According to White 2000, the purpose of declarative evaluation is to measure the ability of an MT system to handle texts representative of an actual end-user. It purports to measure the actual performance of a system external to the particulars of the feasibility of the approach or of the development process.
As with feasibility and internal evaluation, we look at coverage of linguistic phenomena and handling of samples of real text. However, these generally do not use constrained test patterns, and they are not directly used to determine the extensibility of the system, but how good it is right now. Declarative evaluations generally test for the functionality attributes of intelligibility, (how fluent or understandable it appears to be) and fidelity (the accurateness and completeness of the information conveyed).
Translation process models (2.1.1/402)
Linguistic resources and utilities (2.1.2/403)
Suitability (2.2.1.1/168)
Accuracy (2.2.1.2/177)
Well-formedness (2.2.1.3/186)