ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Source

American Psychologist	1
Communique	1
Educational Testing Service	1
Educational and Psychological…	1
International Journal of…	1
Society for Research on…	1
Technological Horizons in…	1

Author

Allalouf, Avi	1
Batchelder, William H.	1
France, Stephen L.	1
Haberman, Shelby J.	1
Kingsbury, G. Gage	1
Kokkota, V. A.	1
Marzano, Robert J.	1
Mitchell, Alison M.	1
Oxford-Carpenter, Rebecca L.	1
Petscher, Yaacov	1
Sackett, Paul R.	1
Schultz-Shiner, Linda J.	1
Sophie Litschwartz	1
Truckenmiller, Adrea	1
Wilk, Steffanie L.	1
More ▼

Publication Type

Reports - Descriptive	10
Journal Articles	5
Books	1
Speeches/Meeting Papers	1

Education Level

Audience

Researchers

Location

New York	1
New York (New York)	1
USSR	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
General Aptitude Test Battery	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

Peer reviewed

Direct link

Allalouf, Avi – International Journal of Testing, 2014

The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

Descriptors: Quality Control, Scoring, Test Theory, Scores

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Analyzing Two Assumptions Underlying the Scoring of Classroom Assessments.

Download full text

Marzano, Robert J. – 2000

There has been little discussion of two conventions common within classroom assessment: the convention of representing student's performance on an assessment using a single score; and the convention of using the average score to summarize a student's performance over a set of assessments. This paper attempts to demonstrate that the assumptions…

Descriptors: Elementary Secondary Education, Scoring, Teacher Made Tests, Test Theory

Military Reading Assessment: What Theory Tells Us.

Oxford-Carpenter, Rebecca L.; Schultz-Shiner, Linda J. – 1985

This paper addresses practical Army problems in reading assessment from a theory base reflecting the most recent research on reading comprehension. Military and occupational research shows that reading proficiency is related to job performance. Reading assessment is a key issue in the Army due to changes in the reading ability levels of the Army…

Descriptors: Armed Forces, Military Personnel, Postsecondary Education, Psychometrics

Lingvo-didakticheskoe testirovanie (Language Testing).

Download full text

Kokkota, V. A. – 1989

This book contrasts non-Soviet approaches to language testing and provides definitions from four Soviet language test experts. The role of foreign language teaching, the function of tests, and theoretical problems are discussed, with considerable focus on communicative competence. The book discusses test standardization and classification and…

Descriptors: Communicative Competence (Languages), Foreign Countries, Language Skills, Language Tests

Computerized Adaptive Testing: A Four-year-old Pilot Shows that CAT Can Work.

Kingsbury, G. Gage; And Others – Technological Horizons in Education, 1988

Explores what some deem the best way to objectively determine what a student knows. Adaptive Testing has been around since the early 1900's, but only with the advent of computers has it been effectively applied to day to day educational management. Cites a pilot study in Portland, Oregon, public schools. (MVL)

Descriptors: Administration, Computer Uses in Education, Diagnostic Teaching, Individual Needs

Within-Group Norming and Other Forms of Score Adjustment in Preemployment Testing.

Peer reviewed

Sackett, Paul R.; Wilk, Steffanie L. – American Psychologist, 1994

Reviews the literature on subgroup norming in testing and examines several types of score-adjustment methods. The authors discuss social and policy perspectives as well as the scientific and theoretical underpinnings of score adjustment. (GLR)

Descriptors: Civil Rights Legislation, Employment Practices, Equal Opportunities (Jobs), Literature Reviews

Scoring	10
Test Theory	10
Test Interpretation	3
Test Items	3
Test Reliability	3
Testing	3
Computer Assisted Testing	2
Difficulty Level	2
Language Tests	2
Scoring Formulas	2
Test Validity	2
Accuracy	1
Administration	1
Answer Keys	1
Armed Forces	1
Civil Rights Legislation	1
Communicative Competence…	1
Comparative Analysis	1
Computer Uses in Education	1
Correlation	1
Cutting Scores	1
Diagnostic Teaching	1
Educational Benefits	1
Educational Technology	1
Efficiency	1
More ▼