Proof-Editing Shakespeare Entry from Encyclopaedia Britannica 11th Edition
Since the previous post we’ve succeeded in using tesseract and we now have a nice plain text version of the EB entry on shakespeare:
http://knowledgeforge.net/shakespeare/svn/trunk/shksprdata/ancillary/britannica-11th.txt
What we now need to do is ‘proof’ this to correct the OCR errors. This kind of think is perfect for distributed volunteers so if you’d like to help out just step up and starting correcting with one of the sections. To make it especially easy for people to make edits the text has in a temporary location on the Open Knowledge Foundation wiki (only the first five pages for the time being):
http://okfn.org/wiki/tmp/BritannicaShakespeare
2 comments September 19th, 2007