Thumbnail
Access Restriction
Open

Author Aranzabe, María Jesús ♦ Ilarraza, Arantza Díaz De ♦ Gonzalez-Dios, Itziar
Source CiteSeerX
Content type Text
File Format PDF
Subject Domain (in DDC) Computer science, information & general works ♦ Data processing & computer science
Subject Keyword Automatic Text Simplification ♦ Advanced Application ♦ Long Sentence ♦ Syntactic Simplification ♦ Text Simplification System ♦ General Coverage Tool ♦ Machine Translation ♦ Basque Language ♦ First Proposal ♦ Morphological Feature ♦ Finite Relative Clause ♦ New Utility ♦ Research Line ♦ Standard Basque ♦ Specific Phenomenon ♦ Finite Temporal Clause ♦ Target Audience ♦ Agglutinative Language ♦ Long One
Abstract Analysis of long sentences are source of problems in advanced applications such as machine translation. With the aim of solving these problems in advanced applications, we have analysed long sentences of two corpora written in Standard Basque in order to make syntactic simplification. The result of this analysis leads us to design a proposal to produce shorter sentences out of long ones. In order to perform this task we present an architecture for a text simplification system based on previously developed general coverage tools (giving them a new utility) and on hand written rules specific for syntactic simplification. Being Basque an agglutinative language this rules are based on morphological features. In this work we focused on specific phenomena like appositions, finite relative clauses and finite temporal clauses. The simplification proposed does not exclude any target audience, and the simplification could be used for both humans and machines. This is the first proposal for Automatic Text simplification and opens a research line for the Basque language in NLP. 1.
Educational Role Student ♦ Teacher
Age Range above 22 year
Educational Use Research
Education Level UG and PG ♦ Career/Technical Study