Mastodon

Month: July 2011

  • Using off-the-shelf OCR to re-extract data

    Having just written a lengthy blog post / rant about publishing data for another blog (I’ll link to it later if/when it gets published). I thought I’d post a technical demonstration of my issues here. I want need to extract simple matrices of numbers from research papers for my PhD research. Theoretically, I shouldn’t even…