Thumbnail
Access Restriction
Open

Author Prószéky, Gábor ♦ Naszódi, Mátyás ♦ Kis, Balázs
Source CiteSeerX
Content type Text
File Format PDF
Language English
Subject Domain (in DDC) Computer science, information & general works ♦ Data processing & computer science
Subject Keyword Syntactic Post-processing ♦ Treating Error ♦ Morphologic Recognition Assistant ♦ Disambi-guated Segmentation ♦ Text Acquired ♦ Shape Code ♦ Syntactic Analysis ♦ Recognition Assistance ♦ Recognition Source Conti-nuous Speech ♦ Specific Group ♦ Various Recognition Process ♦ Unified Linguistic Framework ♦ Error Re-gardless ♦ Successful Framework ♦ Un-derspecified Symbol ♦ Recognition Process ♦ Underspecified Symbol
Description Texts acquired from recognition sources—conti-nuous speech/handwriting recognition and OCR—generally have three types of errors re-gardless of the characteristics of the source in particular. The output of the recognition process may be (1) poorly segmented or not segmented at all; (2) containing underspecified symbols (where the recognition process can only indicate that the symbol belongs to a specific group), e.g. shape codes; (3) containing incorrectly identified sym-bols. The project presented in this paper addresses these errors by developing of a unified linguistic framework called the MorphoLogic Recognition Assistant that provides feedback and corrections for various recognition processes. The framework uses customized morpho-syntactic and syntactic analysis where the lexicons and their alphabets correspond to the symbol set acquired from the recognition process. The successful framework must provide three services: (1) proper disambi-guated segmentation, (2) disambiguation for un-derspecified symbols, (3) correction for incorr-ectly recognized symbols. The paper outlines the methods of morpho-syntactic and syntactic post-processing currently in use.
Educational Role Student ♦ Teacher
Age Range above 22 year
Educational Use Research
Education Level UG and PG ♦ Career/Technical Study
Learning Resource Type Article
Publisher Institution Proc. of the 19th international conference on Computational linguistics - Vol 2.pp 1