Thumbnail
Access Restriction
Open

Author Reynaert, Martin
Source CiteSeerX
Content type Text
File Format PDF
Language English
Subject Domain (in DDC) Computer science, information & general works ♦ Data processing & computer science
Subject Keyword Ocr-induced Typographical Variation ♦ Particular Focus Word ♦ Representative Sample ♦ Non-interactive Ocr Post-correction ♦ Simple Text-induced Filtering Technique ♦ Pronounce Tickle ♦ Contemporary Ocr-ed Dutch Text Corpus ♦ True Positive ♦ Typographical Variant ♦ Effective Conclusion ♦ Giga-scale Digitization Project ♦ Performance Score ♦ Undesirable Ocr-induced Typographical Variation Present ♦ Dutch Spelling ♦ Predefined Levenshtein Distance ♦ Historical Newspaper Article ♦ Ocr-error Resolution ♦ Text-induced Corpus Clean-up ♦ High-frequency Word ♦ Large Text Collection ♦ False Positive ♦ Correction Mechanism ♦ Non-interactive System
Educational Role Student ♦ Teacher
Age Range above 22 year
Educational Use Research
Education Level UG and PG ♦ Career/Technical Study
Learning Resource Type Article
Publisher Date 2008-01-01
Publisher Institution In Proceedings of the 9th international conference on Computational linguistics and intelligent text processing, CICLing'08