White and O'Connell, 1994 (DARPA measures): 5-point scale of syntactic correctness.
ALPAC: 5-point scale of syntactic correctness.
Percentage of phenomena correctly treated.
List of error types.
Average string edit distance per sentence or for all tokens in the text.
Flanagan, 1994. (See also the LOGOS error list in the same AMTA proceedings).
Loffler-Laurian, 1983 (in French).
See also Arnold et al, eds., 1993 ('Machine Translation' 1993 vol. 8:1-2, special issue on evaluation).