OCR stands for Optical Character Recognition. OCR is a software tool that can extract print text from some documents.
When will OCR work well?
OCR does not work on handwriting. It only works for printed or typed text, meaning text created by a typewriter, printing press, or other mechanical means. OCR will do best on consistent and clear images of modern typefaces.
Do I still need to review pages started with OCR?
Yes! OCR is imperfect. It may not work well for some or all parts of a typed page, but it can be a great starting point. If you start a page with OCR, you should read the text closely before submitting. If you are reviewing a OCR-ed page, you also still need to review.
We always want to use volunteer time effectively. When the Library of Congress digitizes a large group of printed pages, it will usually OCR them. The materials in By the People campaigns are not good candidates for applying OCR at scale, either because they are handwritten, a mixed collection of handwritten and print materials, or printed on paper or in a typeface that does not produce accurate OCR results. However, OCR can still be a useful starting point for some typed pages. Use it if it if you like it or skip it if you don’t!
You can help by transcribing a new page, adding tags to this page, or coming back later to review this page's transcription.
Nice Job!
This page has been submitted for review.
What do you want to do next?
Nice Job!
Thanks for your help - we've saved your decision.
What do you want to do next?
Are you sure?
Clicking "Transcribe with OCR" will remove all existing transcription text and replace it with automatically generated text. We recommend saving existing text in a separate document if you may want to revisit it.
Campaign Tips
Transcribing Sheet Music of the Musical Theater
In this campaign, we aim to transcribe nearly all text found in these 16,000 pieces of sheet music from 1880-1922.
Our goal in transcribing sheet music text is enabling keyword search and improving discovery.
This transcription project may not suitable for all volunteers. Some items include potentially offensive and racist language and views, which reflect both their time and circumstances.
DO transcribe
Capture the sheet music's descriptive text, including titles, lyrics, publication information, and advertisements.
You should also transcribe tempo markings, such as moderato, allegro, allegretto, etc.
Transcribe these in the order they appear on the page.
DON'T transcribe
Do not transcribe musical notation, such as time and key signatures or dynamic markings.
Do not transcribe hyphens within lyrics.
Learn more
We've created an illustrated guide to transcribing sheet music to help you along.
Because of potentially offensive and racist language and views, which reflect both their time and circumstances, this transcription project may not suitable for all volunteers.
We have many other collections available for transcription on our Campaigns page.
Need more help? Check out the How-To Guide
You can access full instructions at any time while transcribing or reviewing. Just click the blue How-To Guide button above the transcription box.
The guide also includes campaign descriptions and other helpful context under "About This Campaign".
View or print instructions in a separate webpage by visiting How-To at the top of your screen on any page.