NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
Race to the Top1
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023
Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…
Descriptors: Scores, Test Items, Accuracy, Psychometrics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hathcoat, John D. – Practical Assessment, Research & Evaluation, 2013
The semantics, or meaning, of validity is a fluid concept in educational and psychological testing. Contemporary controversies surrounding this concept appear to stem from the proper location of validity. Under one view, validity is a property of score-based inferences and entailed uses of test scores. This view is challenged by the…
Descriptors: Test Validity, Educational Testing, Psychological Testing, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Berk, Ronald A. – Journal of Faculty Development, 2016
Recently, student outcomes have bubbled to the top of debates about how to evaluate teaching in community and liberal arts colleges, universities, and professional schools, but even more international attention has been riveted on how outcomes are being used to evaluate teachers and administrators K-12 (Harris, 2012; Rowen & Raudenbush, 2016;…
Descriptors: Value Added Models, Academic Achievement, Outcomes of Education, Teacher Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Descriptors: Educational Testing, Scores, Reports, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Peer reviewed Peer reviewed
Direct linkDirect link
Roberts, Mary Roduta; Gierl, Mark J. – Educational Measurement: Issues and Practice, 2010
This paper presents a framework to provide a structured approach for developing score reports for cognitive diagnostic assessments ("CDAs"). Guidelines for reporting and presenting diagnostic scores are based on a review of current educational test score reporting practices and literature from the area of information design. A sample diagnostic…
Descriptors: Diagnostic Tests, Scores, Technical Writing, Cognitive Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Yi-Hsin; Ferron, John M.; Thompson, Marilyn S.; Gorin, Joanna S.; Tatsuoka, Kikumi K. – Educational Research and Evaluation, 2010
Traditional comparisons of test score means identify group differences in broad academic areas, but fail to provide substantive description of how the groups differ on the specific cognitive attributes required for success in the academic area. The rule space method (RSM) allows for group comparisons at the cognitive attribute level, which…
Descriptors: Foreign Countries, Academic Achievement, Probability, Algebra
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Peer reviewed Peer reviewed
Direct linkDirect link
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Gierl, Mark J.; Cui, Ying – Measurement: Interdisciplinary Research and Perspectives, 2008
One promising application of diagnostic classification models (DCM) is in the area of cognitive diagnostic assessment in education. However, the successful application of DCM in educational testing will likely come with a price--and this price may be in the form of new test development procedures and practices required to yield data that satisfy…
Descriptors: Educational Testing, Classification, Psychometrics, Test Construction
ACT, Inc., 2008
ACT is often asked whether student scores on the ACT[R] test can be used to make "norm-referenced" or "standards-referenced" comparisons. Norm-referenced interpretations compare students to one another, while standards-referenced interpretations measure student performance against predefined content standards. This brief shows…
Descriptors: College Entrance Examinations, Educational Assessment, Student Evaluation, Academic Standards
ACT, Inc., 2009
It is essential for all students to be ready for college and career when they graduate from high school. Postsecondary educators expect high school graduates to be prepared academically for success in postsecondary education, which in turn influences success in the work world. Employers continue to call for workers to have the tools needed to…
Descriptors: Job Satisfaction, Career Choice, Academic Achievement, High School Graduates
Peer reviewed Peer reviewed
Brennan, Robert L. – Educational Measurement: Issues and Practice, 1997
The history of generalizability theory (G theory) is told from the perspective of one researcher's experiences, describing psychometric and scientific perspectives that influenced the development of G theory and its adoption. Work that remains to be done in the field is outlined. (SLD)
Descriptors: Educational Testing, Generalizability Theory, Measurement, Psychometrics
Peer reviewed Peer reviewed
Traub, Ross E. – Educational Measurement: Issues and Practice, 1997
Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)
Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics
Lyman, Howard B. – 1998
The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…
Descriptors: Educational Testing, Norm Referenced Tests, Performance Based Assessment, Psychometrics