NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021
Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…
Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas
Peer reviewed Peer reviewed
Direct linkDirect link
Allalouf, Avi – International Journal of Testing, 2014
The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…
Descriptors: Quality Control, Scoring, Test Theory, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015
Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…
Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Marzano, Robert J. – 2000
There has been little discussion of two conventions common within classroom assessment: the convention of representing student's performance on an assessment using a single score; and the convention of using the average score to summarize a student's performance over a set of assessments. This paper attempts to demonstrate that the assumptions…
Descriptors: Elementary Secondary Education, Scoring, Teacher Made Tests, Test Theory
Oxford-Carpenter, Rebecca L.; Schultz-Shiner, Linda J. – 1985
This paper addresses practical Army problems in reading assessment from a theory base reflecting the most recent research on reading comprehension. Military and occupational research shows that reading proficiency is related to job performance. Reading assessment is a key issue in the Army due to changes in the reading ability levels of the Army…
Descriptors: Armed Forces, Military Personnel, Postsecondary Education, Psychometrics
Kokkota, V. A. – 1989
This book contrasts non-Soviet approaches to language testing and provides definitions from four Soviet language test experts. The role of foreign language teaching, the function of tests, and theoretical problems are discussed, with considerable focus on communicative competence. The book discusses test standardization and classification and…
Descriptors: Communicative Competence (Languages), Foreign Countries, Language Skills, Language Tests
Kingsbury, G. Gage; And Others – Technological Horizons in Education, 1988
Explores what some deem the best way to objectively determine what a student knows. Adaptive Testing has been around since the early 1900's, but only with the advent of computers has it been effectively applied to day to day educational management. Cites a pilot study in Portland, Oregon, public schools. (MVL)
Descriptors: Administration, Computer Uses in Education, Diagnostic Teaching, Individual Needs
Peer reviewed Peer reviewed
Sackett, Paul R.; Wilk, Steffanie L. – American Psychologist, 1994
Reviews the literature on subgroup norming in testing and examines several types of score-adjustment methods. The authors discuss social and policy perspectives as well as the scientific and theoretical underpinnings of score adjustment. (GLR)
Descriptors: Civil Rights Legislation, Employment Practices, Equal Opportunities (Jobs), Literature Reviews