ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	11

Descriptor

Scoring	24
Test Items	24
Test Theory	24
Difficulty Level	9
Test Construction	8
Item Analysis	7
Item Response Theory	6
Models	6
Psychometrics	6
Latent Trait Theory	5
Measurement Techniques	5
Statistical Analysis	5
Comparative Analysis	4
Multiple Choice Tests	4
Test Reliability	4
Computer Assisted Testing	3
Correlation	3
Goodness of Fit	3
Mathematical Models	3
Simulation	3
Testing	3
Classification	2
College Entrance Examinations	2
Computation	2
Culture Fair Tests	2
More ▼

Source

ProQuest LLC	2
Studies in Educational…	2
College Board	1
Communique	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational and Psychological…	1
European Journal of Science…	1
Grantee Submission	1
Instructional Science	1
Journal of Applied Testing…	1
Journal of Educational and…	1
Journal on Educational…	1
Psychometrika	1
Technological Horizons in…	1
More ▼

Publication Type

Reports - Research	14
Journal Articles	13
Reports - Descriptive	3
Reports - Evaluative	3
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
ERIC Publications	2
Tests/Questionnaires	1

Education Level

Secondary Education	3
Elementary Education	2
Grade 4	2
High Schools	2
Higher Education	2
Intermediate Grades	2
Postsecondary Education	2
Early Childhood Education	1
Grade 3	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Researchers

Location

California	1
Colorado	1
Florida	1
Illinois	1
Indiana	1
New York	1
Singapore	1
Sweden	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
ACT Assessment	1
Graduate Record Examinations	1
Preliminary Scholastic…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Identifying Enemy Item Pairs Using Natural Language Processing

Peer reviewed

Direct link

Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022

Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…

Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Psychometric Report for the Early Fractions Test Administered with Third- and Fourth-Grade Students in Fall 2016. Research Report No. 2017-10

Download full text

Schoen, Robert C.; Liu, Sicong; Yang, Xiaotong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test is to serve as a student pretest covariate and a test of baseline equivalence in the larger study. In this report, we discuss our…

Descriptors: Mathematics Achievement, Fractions, Mathematics Tests, Grade 3

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

Impact of Psychometric Decisions on Assessment Outcomes in an Alternate Assessment

Direct link

Rao, Vasanthi – ProQuest LLC, 2012

In 1997, based on the amendments to Individuals with Disabilities Education Act (IDEA), all states were faced with a statutory requirement to develop and implement alternate assessments for students with disabilities unable to participate in the statewide large-scale assessment. States were given the challenge of creating, implementing, and…

Descriptors: Alternative Assessment, Psychometrics, Item Response Theory, Models

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Conjunctive Item Response Theory Kernels.

Peer reviewed

Jannarone, Robert J. – Psychometrika, 1986

Conjunctive item response models are introduced such that: (1) sufficient statistics for latent traits are not necessarily additive in item scores; (2) items are not necessarily locally independent; and (3) existing compensatory (additive) item response models including the binomial, Rasch, logistic, and general locally independent model are…

Descriptors: Cognitive Processes, Hypothesis Testing, Latent Trait Theory, Mathematical Models

The Central Role of Content Representation in Test Validity.

Download full text

Sireci, Stephen G. – 1995

The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…

Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis

Conditional Covariance Theory and DETECT for Polytomous Items. Research Report. ETS RR-04-50

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming – ETS Research Report Series, 2004

This paper extends the theory of conditional covariances to polytomous items. It has been mathematically proven that under some mild conditions, commonly assumed in the analysis of response data, the conditional covariance of two items, dichotomously or polytomously scored, is positive if the two items are dimensionally homogeneous and negative…

Descriptors: Test Items, Test Theory, Correlation, National Competency Tests

The Trait in Latent Trait Theory.

Levine, Michael V. – 1982

Significant to a latent trait or item response theory analysis of a mental test is the determination of exactly what is being quantified. The following are practical problems to be considered in the formulation of a good theory: (1) deciding whether two tests measure the same trait or traits; (2) analyzing the relative contributions of a pair of…

Descriptors: Item Analysis, Latent Trait Theory, Mathematical Models, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2

Alqarni, Abdulelah Mohammed	1
Batchelder, William H.	1
Becker, Kirk A.	1
Bhaskar, R.	1
Braithwaite, Nicholas St. J.	1
Cook, Linda L.	1
Deng, Nina	1
Dillard, Jesse F.	1
Dorans, Neil J.	1
Engelhard, George, Jr.	1
France, Stephen L.	1
Frary, Robert B.	1
Hambleton, Ronald K.	1
Hedgeland, Holly	1
Ismail, Ainon	1
Jannarone, Robert J.	1
Jordan, Sally E.	1
Kao, Shu-chuan	1
Kehoe, Jerard	1
Kingsbury, G. Gage	1
Levine, Michael V.	1
Liu, Sicong	1
Mitchell, Alison M.	1
O'Brien, Michael L.	1
More ▼