ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	13
Since 2017 (last 10 years)	50
Since 2007 (last 20 years)	131

Descriptor

Probability	205
Test Items	205
Item Response Theory	77
Difficulty Level	46
Models	44
Simulation	35
Foreign Countries	32
Scores	32
Psychometrics	28
Statistical Analysis	28
Test Construction	27
Computation	25
Item Analysis	24
Mathematics Tests	24
Classification	23
Mathematical Models	23
Multiple Choice Tests	23
Comparative Analysis	22
Computer Assisted Testing	22
Test Bias	22
Adaptive Testing	21
Evaluation Methods	21
Ability	19
Achievement Tests	18
Scoring	17
More ▼

Publication Type

Journal Articles	154
Reports - Research	121
Reports - Evaluative	45
Reports - Descriptive	20
Speeches/Meeting Papers	16
Guides - Classroom - Teacher	7
Tests/Questionnaires	6
Dissertations/Theses -…	4
Books	3
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Collected Works - Proceedings	1
Opinion Papers	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Higher Education	27
Postsecondary Education	19
Elementary Education	13
Secondary Education	13
Elementary Secondary Education	8
Junior High Schools	7
Middle Schools	7
High Schools	6
Grade 8	5
Grade 12	3
Grade 7	3
Intermediate Grades	3
Grade 11	2
Grade 4	2
Grade 6	2
Early Childhood Education	1
Grade 3	1
Grade 5	1
Grade 9	1
Kindergarten	1
Two Year Colleges	1
More ▼

Audience

Practitioners	7
Teachers	4
Researchers	2

Location

South Africa	3
Taiwan	3
United Kingdom (England)	3
Australia	2
Canada	2
China	2
Germany	2
Israel	2
Turkey	2
Africa	1
Belgium	1
California	1
Colorado	1
Cyprus	1
Florida	1
Indonesia	1
Iran	1
Italy	1
Italy (Milan)	1
Japan	1
Mexico	1
Netherlands	1
Netherlands (Amsterdam)	1
Oregon	1
Singapore	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	8
Program for International…	5
Trends in International…	4
Law School Admission Test	3
Texas Educational Assessment…	3
Graduate Record Examinations	2
Armed Services Vocational…	1
SAT (College Admission Test)	1
Stanford Early School…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 205 results Save | Export

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

Evaluation of Response Probabilities along Studied Latent Dimensions: A Polytomous Item Extension

Peer reviewed

Direct link

Raykov, Tenko; Huber, Chuck; Marcoulides, George A.; Pusic, Martin; Menold, Natalja – Measurement: Interdisciplinary Research and Perspectives, 2021

A readily and widely applicable procedure is discussed that can be used to point and interval estimate the probabilities of particular responses on polytomous items at pre-specified points along underlying latent continua. The items are assumed thereby to be part of unidimensional multi-component measuring instruments that may contain also binary…

Descriptors: Probability, Computation, Test Items, Responses

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Goodman-Kruskal Gamma and Dimension-Corrected Gamma in Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2021

Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…

Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

Interval Estimation of Item Response Probabilities along Studied Latent Dimensions

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Pusic, Martin – Measurement: Interdisciplinary Research and Perspectives, 2021

An interval estimation procedure is discussed that can be used to evaluate the probability of a particular response for a binary or binary scored item at a pre-specified point along an underlying latent continuum. The item is assumed to: (a) be part of a unidimensional multi-component measuring instrument that may contain also polytomous items,…

Descriptors: Item Response Theory, Computation, Probability, Test Items

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Changing the Success Probability in Computerized Adaptive Testing: A Monte-Carlo Simultion on the Open Matrices Item Bank

Peer reviewed
PDF on ERIC

Download full text

Hanif Akhtar – International Society for Technology, Education, and Science, 2023

For efficiency, Computerized Adaptive Test (CAT) algorithm selects items with the maximum information, typically with a 50% probability of being answered correctly. However, examinees may not be satisfied if they only correctly answer 50% of the items. Researchers discovered that changing the item selection algorithms to choose easier items (i.e.,…

Descriptors: Success, Probability, Computer Assisted Testing, Adaptive Testing

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

Response Quality in Nonprobability and Probability-Based Online Panels

Peer reviewed

Direct link

Cornesse, Carina; Blom, Annelies G. – Sociological Methods & Research, 2023

Recent years have seen a growing number of studies investigating the accuracy of nonprobability online panels; however, response quality in nonprobability online panels has not yet received much attention. To fill this gap, we investigate response quality in a comprehensive study of seven nonprobability online panels and three probability-based…

Descriptors: Probability, Sampling, Social Science Research, Research Methodology

Preparation Techniques for Multiple-Choice Possibility Measurement Tools (MCPMT) in Education

Peer reviewed
PDF on ERIC

Download full text

Ismail, Yilmaz – Educational Research and Reviews, 2020

This study draws on the understanding that when the correlation between variables is not known yet the non-linear expectation in the correlation between the variables is present, non-linear measurement tools can be used. In education, possibility measurement tools can be used for non-linear measurement. Multiple-choice possibility measurement…

Descriptors: Multiple Choice Tests, Measurement Techniques, Student Evaluation, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Educational and Psychological…	20
Journal of Educational and…	15
Journal of Educational…	12
Applied Psychological…	11
Psychometrika	9
Applied Measurement in…	8
Measurement:…	8
ETS Research Report Series	7
International Journal of…	7
Practical Assessment,…	5
ProQuest LLC	4
Educational Sciences: Theory…	2
Eurasian Journal of…	2
Hacettepe University Journal…	2
Journal of Experimental…	2
National Council of Teachers…	2
Assessment & Evaluation in…	1
Assessment for Effective…	1
Athens Journal of Education	1
Child Development	1
Cognition	1
Computers & Education	1
E-Journal of Instructional…	1
EURASIA Journal of…	1
EURASIA Journal of…	1
More ▼

van der Linden, Wim J.	9
Veldkamp, Bernard P.	4
Wilcox, Rand R.	4
Gierl, Mark J.	3
Marcoulides, George A.	3
Raykov, Tenko	3
Sijtsma, Klaas	3
Wyse, Adam E.	3
von Davier, Matthias	3
Beretvas, S. Natasha	2
Chang, Hua-Hua	2
Chen, Yi-Hsin	2
De Boeck, Paul	2
Dimitrov, Dimiter M.	2
Emons, Wilco H. M.	2
Ferdous, Abdullah A.	2
Henson, Robert	2
Johnson, Matthew S.	2
Liu, Yan	2
Lord, Frederic M.	2
Metsämuuronen, Jari	2
Mislevy, Robert J.	2
Plake, Barbara S.	2
Pusic, Martin	2
More ▼