ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	28

Descriptor

Scoring	72
Test Theory	72
Test Items	24
Test Construction	21
Test Reliability	19
Item Response Theory	17
Item Analysis	14
Psychometrics	14
Measurement Techniques	13
Statistical Analysis	12
Computer Assisted Testing	11
Difficulty Level	11
Test Validity	11
Testing	11
Latent Trait Theory	10
Test Interpretation	10
Comparative Analysis	9
Evaluation Methods	9
Models	9
Foreign Countries	8
Multiple Choice Tests	8
Scores	8
Equated Scores	7
Mathematical Models	7
Student Evaluation	7
More ▼

Publication Type

Reports - Research	38
Journal Articles	34
Speeches/Meeting Papers	13
Reports - Descriptive	10
Reports - Evaluative	10
Books	4
Guides - Non-Classroom	3
Information Analyses	3
Tests/Questionnaires	3
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
ERIC Publications	2
Opinion Papers	2
Book/Product Reviews	1
Collected Works - General	1
Guides - Classroom - Learner	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	9
Postsecondary Education	6
Secondary Education	5
Elementary Education	4
High Schools	3
Grade 3	2
Grade 4	2
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Adult Education	1
Early Childhood Education	1
Grade 10	1
Grade 9	1
Primary Education	1
More ▼

Audience

Researchers	7
Practitioners	2
Teachers	2
Students	1

Location

New York	2
Alabama	1
Australia	1
California	1
Canada	1
Colorado	1
Florida	1
Illinois	1
Indiana	1
Jordan	1
New York (New York)	1
Nigeria	1
Singapore	1
Sweden	1
Thailand	1
USSR	1
United States	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

National Assessment of…	3
ACT Assessment	2
Alabama High School…	1
Armed Services Vocational…	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Preliminary Scholastic…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
Thematic Apperception Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 72 results Save | Export

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Identifying Enemy Item Pairs Using Natural Language Processing

Peer reviewed

Direct link

Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022

Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…

Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring

Evidence for Validity and Reliability of a Research-Based Assessment Instrument on Measurement Uncertainty

Peer reviewed

Direct link

Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024

The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…

Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

Validation of a Coupled, Multiple Response Assessment for Upper-Division Thermal Physics

Peer reviewed

Direct link

Rainey, Katherine D.; Vignal, Michael; Wilcox, Bethany R. – Physical Review Physics Education Research, 2022

Currently there are no assessment instruments available for upper-division thermal physics, though several introductory assessments are currently available. Notably missing from these introductory assessment are items targeting statistical mechanics. This leaves a gap in the content that can be assessed by upper-division thermal physics faculty.…

Descriptors: Physics, Science Instruction, Thermodynamics, College Science

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

Invariance Person Estimate of Basic Education Certificate Examination: Classical Test Theory and Item Response Theory Scoring Perspective

Peer reviewed
PDF on ERIC

Download full text

Ayanwale, Musa Adekunle; Adeleke, Joshua Oluwatoyin; Mamadelo, Titilayo Iyabode – Journal of the International Society for Teacher Education, 2019

A scoring framework that does not reflect true performance of an examinee would ultimately result in an abnormal score. This study assessed invariance person estimates of 2017 Nigerian National Examinations Council Basic Education Certificate Examination Mathematics Multiple Choice using classical test theory (CTT) and item response theory (IRT)…

Descriptors: Test Theory, Item Response Theory, Scoring, National Competency Tests

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

Peer reviewed

Direct link

Allalouf, Avi – International Journal of Testing, 2014

The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

Descriptors: Quality Control, Scoring, Test Theory, Scores

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

An Alternative Approach to Test Analysis and Interpretation

Download full text

Powell, J. C. – International Association for Development of the Information Society, 2013

This reflection paper challenges current test scoring practices on the grounds that most wrong-answer selections are thoughtful not random, presenting research supporting this proposition. An alternative test scoring system is presented, described and its outcomes discussed. This new scoring system increases the number of variables considered,…

Descriptors: Test Theory, Test Interpretation, Scoring, Multiple Choice Tests

Psychometric Report for the Early Fractions Test Administered with Third- and Fourth-Grade Students in Fall 2016. Research Report No. 2017-10

Download full text

Schoen, Robert C.; Liu, Sicong; Yang, Xiaotong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test is to serve as a student pretest covariate and a test of baseline equivalence in the larger study. In this report, we discuss our…

Descriptors: Mathematics Achievement, Fractions, Mathematics Tests, Grade 3

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

International Journal of…	3
ETS Research Report Series	2
Educational Measurement:…	2
Journal of Educational…	2
Online Submission	2
Physical Review Physics…	2
ProQuest LLC	2
Studies in Educational…	2
American Psychologist	1
Anatomical Sciences Education	1
Australian Journal of…	1
College Board	1
College Composition and…	1
Communique	1
Educational Testing Service	1
Educational and Psychological…	1
European Journal of…	1
European Journal of Science…	1
Grantee Submission	1
Instructional Science	1
International Association for…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Computers in…	1
Journal of Educational…	1
More ▼

Cook, Linda L.	2
Abedalaziz, Nabeel	1
Adeleke, Joshua Oluwatoyin	1
Aksu, Gökhan	1
Algina, James	1
Allalouf, Avi	1
Alqarni, Abdulelah Mohammed	1
Ayanwale, Musa Adekunle	1
Badjadi, Nour El Imane	1
Baker, Eva L.	1
Batchelder, William H.	1
Beaujean, A. Alexander	1
Becker, Kirk A.	1
Bhaskar, R.	1
Braithwaite, Nicholas St. J.	1
Buhr, Dianne C.	1
Callinan, Sarah	1
Chase, Clinton I.	1
Childs, Ruth A.	1
Cohen, Allan S., Comp.	1
Cope, Ronald T.	1
Costantino, Giuseppe	1
Crocker, Linda	1
Cunningham, Everarda	1
More ▼