Skip to main content



Tech Report HCIL-2009-07

Hu, C., Rose, A., Bederson, B. (March 2009)
Locating Text in Scanned Books
To be published at JCDL 2009
HCIL-2009-07

Text location in scanned documents is important for selection, search, and other interactions with visual presentations of scanned books. In this paper, we describe a work flow to extract and verify text locations using commercial software, along with free software products and human proofing. Our method uses Adobe Acrobat’s OCR functionality, but can be easily adapted to other OCR software products. To help mid-sized digital libraries, we are making our solution available as open source software.



Nora Project Screenshot

When You're Hot, You're Hot -- And the Computer Knows It
Read article

Tech Reports
Video Reports
Annual Symposium

News
Seminars + Events
Calendar
HCIL Seminar Series
Annual Symposium
HCIL Service Grants
Events Archives
Awards
Job Openings
For the Press
HCIL Overview
Collaborators
Collaborating Groups + People
Academic Visitors
Become a Member
Our Lighter Side
HCIL Store
Give the HCIL a Hand
HCIL T-shirts for Sale
Join our Mailing List
Contact Us
Visit Us
HCIL Memories Page
Faculty/ Staff
Students
Ph.D. Alumni
Past Members
Research Areas
Communities
Design Process
Digital Libraries
Education
Physical Devices
Public Access
Visualization
Research Histories
Faculty Listed by Research
Project Highlights
Project Screenshots
Online Tech Reports
Video Reports
Books
Products
Presentations
Studying HCI
Graduate Studies in HCI
Visiting Scholars
Class Websites
Sponsor our Research
Sponsor our Annual Symposium
Active Sponsorship
Industrial Visitors