ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	20

Source

Journal of Educational and…	3
Applied Measurement in…	2
Educational and Psychological…	2
Measurement:…	2
Educational Measurement:…	1
International Journal of…	1
International Journal of…	1
International Society for…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Physical Review Physics…	1
Practical Assessment,…	1
Quality Assurance in…	1
Sociological Methods &…	1
South African Journal of…	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	17
Reports - Evaluative	2
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Secondary Education	2
Secondary Education	2
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Canada	1
Germany	1
Netherlands (Amsterdam)	1
South Africa	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Evaluation of Response Probabilities along Studied Latent Dimensions: A Polytomous Item Extension

Peer reviewed

Direct link

Raykov, Tenko; Huber, Chuck; Marcoulides, George A.; Pusic, Martin; Menold, Natalja – Measurement: Interdisciplinary Research and Perspectives, 2021

A readily and widely applicable procedure is discussed that can be used to point and interval estimate the probabilities of particular responses on polytomous items at pre-specified points along underlying latent continua. The items are assumed thereby to be part of unidimensional multi-component measuring instruments that may contain also binary…

Descriptors: Probability, Computation, Test Items, Responses

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Goodman-Kruskal Gamma and Dimension-Corrected Gamma in Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2021

Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…

Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

Interval Estimation of Item Response Probabilities along Studied Latent Dimensions

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Pusic, Martin – Measurement: Interdisciplinary Research and Perspectives, 2021

An interval estimation procedure is discussed that can be used to evaluate the probability of a particular response for a binary or binary scored item at a pre-specified point along an underlying latent continuum. The item is assumed to: (a) be part of a unidimensional multi-component measuring instrument that may contain also polytomous items,…

Descriptors: Item Response Theory, Computation, Probability, Test Items

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Changing the Success Probability in Computerized Adaptive Testing: A Monte-Carlo Simultion on the Open Matrices Item Bank

Peer reviewed
PDF on ERIC

Download full text

Hanif Akhtar – International Society for Technology, Education, and Science, 2023

For efficiency, Computerized Adaptive Test (CAT) algorithm selects items with the maximum information, typically with a 50% probability of being answered correctly. However, examinees may not be satisfied if they only correctly answer 50% of the items. Researchers discovered that changing the item selection algorithms to choose easier items (i.e.,…

Descriptors: Success, Probability, Computer Assisted Testing, Adaptive Testing

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

Response Quality in Nonprobability and Probability-Based Online Panels

Peer reviewed

Direct link

Cornesse, Carina; Blom, Annelies G. – Sociological Methods & Research, 2023

Recent years have seen a growing number of studies investigating the accuracy of nonprobability online panels; however, response quality in nonprobability online panels has not yet received much attention. To fill this gap, we investigate response quality in a comprehensive study of seven nonprobability online panels and three probability-based…

Descriptors: Probability, Sampling, Social Science Research, Research Methodology

Equality of Admission Tests Using Kernel Equating under the Non-Equivalent Groups with Covariates Design

Peer reviewed
PDF on ERIC

Download full text

Altintas, Ozge; Wallin, Gabriel – International Journal of Assessment Tools in Education, 2021

Educational assessment tests are designed to measure the same psychological constructs over extended periods. This feature is important considering that test results are often used for admittance to university programs. To ensure fair assessments, especially for those whose results weigh heavily in selection decisions, it is necessary to collect…

Descriptors: College Admission, College Entrance Examinations, Test Bias, Equated Scores

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2

Probability	20
Test Items	20
Item Response Theory	9
Foreign Countries	6
Item Analysis	6
Models	5
Achievement Tests	4
Correlation	4
Error of Measurement	4
Responses	4
Scores	4
Computation	3
International Assessment	3
Knowledge Level	3
Statistical Analysis	3
Test Bias	3
Adaptive Testing	2
Bayesian Statistics	2
Comparative Analysis	2
Computer Assisted Testing	2
Difficulty Level	2
Elementary Secondary Education	2
Equated Scores	2
Guessing (Tests)	2
Mathematics Achievement	2
More ▼

Marcoulides, George A.	2
Metsämuuronen, Jari	2
Pusic, Martin	2
Raykov, Tenko	2
Abu-Ghazalah, Rashid M.	1
Altintas, Ozge	1
Andreas Kurz	1
Atanasov, Dimitar V.	1
Bendjilali, Nasrine	1
Blom, Annelies G.	1
Can Gürer	1
Chen, Yi-Hsin	1
Clemens Draxler	1
Cornesse, Carina	1
Daniel M Bolt	1
DeCarlo, Lawrence T.	1
Dhlamini, Zwelithini Bongani	1
Dimitrov, Dimiter M.	1
Dubins, David N.	1
Hanif Akhtar	1
Huber, Chuck	1
Jan Philipp Nolte	1
Jeroen Vermunt	1
Jesper Tijmstra	1
Kim, Stella Yun	1
More ▼