NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Haberman, Shelby J. – ETS Research Report Series, 2019
Measures of agreement are compared to measures of prediction accuracy within a general context. Differences in appropriate use are emphasized, and approaches are examined for both numerical and nominal variables. General estimation methods are developed, and their large-sample properties are compared.
Descriptors: Measurement Techniques, Classification, Prediction, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Rap, Robyn; Paxton, Pamela – Sociological Methods & Research, 2021
Questions on voluntary association memberships have been used extensively in social scientific research for decades. Researchers generally assume that these respondent self-reports are accurate, but their measurement has never been assessed. Respondent characteristics are known to influence the accuracy of other self-report variables such as…
Descriptors: Accuracy, Measurement Techniques, Error of Measurement, Voluntary Agencies
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Choi, Ikkyu; Hao, Jiangang; Deane, Paul; Zhang, Mo – ETS Research Report Series, 2021
"Biometrics" are physical or behavioral human characteristics that can be used to identify a person. It is widely known that keystroke or typing dynamics for short, fixed texts (e.g., passwords) could serve as a behavioral biometric. In this study, we investigate whether keystroke data from essay responses can lead to a reliable…
Descriptors: Accuracy, High Stakes Tests, Writing Tests, Benchmarking
Peer reviewed Peer reviewed
Direct linkDirect link
Longford, Nicholas Tibor – Journal of Educational and Behavioral Statistics, 2016
We address the problem of selecting the best of a set of units based on a criterion variable, when its value is recorded for every unit subject to estimation, measurement, or another source of error. The solution is constructed in a decision-theoretical framework, incorporating the consequences (ramifications) of the various kinds of error that…
Descriptors: Decision Making, Classification, Guidelines, Undergraduate Students
Peer reviewed Peer reviewed
Direct linkDirect link
Beauchaine, Theodore P. – Journal of Clinical Child and Adolescent Psychology, 2007
Taxometric procedures provide an empirical means of determining which psychiatric disorders are typologically distinct from normal behavioral functioning. Although most disorders reflect extremes along continuously distributed behavioral traits, identifying those that are discrete has important implications for accurate diagnosis, effective…
Descriptors: Identification, Psychopathology, Adolescents, Etiology
Peer reviewed Peer reviewed
Klauer, Karl Christoph; Batchelder, William H. – Psychometrika, 1996
A general approach to the analysis of nominal-scale ratings is discussed that is based on a simple measurement error model for a rater's judgments. The basic measurement error model gives rise to an agreement model for the agreement matrix of two or more raters. (SLD)
Descriptors: Classification, Data Analysis, Equations (Mathematics), Error of Measurement
Peer reviewed Peer reviewed
Wilson, Noel – Education Policy Analysis Archives, 1998
Explores the ways in which error in measurement related to educational standards and the classification of people in educational settings is obscured in most of the practical events involving the assessment of individuals. Establishes the centrality of the measurement of educational standards to the "production" and control of the individual in…
Descriptors: Classification, Educational Assessment, Educational Environment, Elementary Secondary Education
Jaeger, Richard M. – 1975
Three new indicators of psychometric quality for objectives-based statewide assessments are proposed. These measures provide indication of the stability of reported data on item and objectives mastery, the validity of assessment items for members of various cultural groups, and the convergent validity of prescribed objectives mastery scores. The…
Descriptors: Classification, Cultural Influences, Educational Assessment, Error of Measurement
Karkee, Thakur B.; Wright, Karen R. – Online Submission, 2004
Different item response theory (IRT) models may be employed for item calibration. Change of testing vendors, for example, may result in the adoption of a different model than that previously used with a testing program. To provide scale continuity and preserve cut score integrity, item parameter estimates from the new model must be linked to the…
Descriptors: Measures (Individuals), Evaluation Criteria, Testing, Integrity