Information extraction/summarization

Definition
The purpose of information extraction / summarization is to extract some portion(s) of the translated text, either manually or automatically, for subsequent processing or storage. Information extraction is typically concerned with filling templates by identifying atomic elements of events. In contrast, summarization aims to provide a self-contained and internally cohesive text which serves as a selective account of the original.
Relevant qualities - from part 2
The most important features for this type of work are:

Fidelity (2.2.1.2.1/179)- is the translated output an accurate reflection of the input, are there even small distortions of meaning.

For summarization, coherence (2.2.1.1.1.3/182) and cohesion (2.2.1.1.1.4/503) provide useful cues.

Adaptability or customizability (2.2.6.1/221) - this customizability differs slightly from the customizability for document routing/sorting type of work: can the system be tuned to recognize some types of material (based on domain, content, or genre) and translate them with more care.

Stakeholders
End users of extraction application. Consumers of extracted information. Business process planners.
References

Hovy 1999

White, Cardie et al. 2000


View or add comments (115)