ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Item Response Theory	46
Statistical Distributions	46
Estimation (Mathematics)	17
Test Items	15
Mathematical Models	14
Models	13
Ability	11
Simulation	11
Computer Simulation	10
Equations (Mathematics)	10
Goodness of Fit	9
Comparative Analysis	8
Scores	7
Adaptive Testing	6
Bayesian Statistics	6
Equated Scores	6
Item Bias	6
Maximum Likelihood Statistics	6
Probability	6
Computer Assisted Testing	5
Sample Size	5
Error of Measurement	4
Test Length	4
Computation	3
Difficulty Level	3
More ▼

Source

Applied Psychological…	10
Journal of Educational…	6
Educational and Psychological…	4
Psychometrika	4
Journal of Educational and…	2
Journal of Outcome Measurement	2
ACT, Inc.	1
Cognitive Science	1
International Educational…	1
National Center for Research…	1
Online Submission	1
More ▼

Publication Type

Reports - Evaluative	46
Journal Articles	30
Speeches/Meeting Papers	12

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Location

Japan

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
ACT Assessment	1
Law School Admission Test	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 46 results Save | Export

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Grades Are Not Normal: Improving Exam Score Models Using the Logit-Normal Distribution

Peer reviewed
PDF on ERIC

Download full text

Arthurs, Noah; Stenhaug, Ben; Karayev, Sergey; Piech, Chris – International Educational Data Mining Society, 2019

Understanding exam score distributions has implications for item response theory (IRT), grade curving, and downstream modeling tasks such as peer grading. Historically, grades have been assumed to be normally distributed, and to this day the normal is the ubiquitous choice for modeling exam scores. While this is a good assumption for tests…

Descriptors: Grades (Scholastic), Scores, Statistical Distributions, Models

Lord's Equity Theorem Revisited

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019

Lord's (1980) equity theorem claims observed-score equating to be possible only when two test forms are perfectly reliable or strictly parallel. An analysis of its proof reveals use of an incorrect statistical assumption. The assumption does not invalidate the theorem itself though, which can be shown to follow directly from the discrete nature of…

Descriptors: Equated Scores, Testing Problems, Item Response Theory, Evaluation Methods

IRT-Estimated Reliability for Tests Containing Mixed Item Formats

Peer reviewed

Direct link

Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014

As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

Descriptors: Item Response Theory, Reliability, Models, Computation

Local Equating Using the Rasch Model, the OPLM, and the 2PL IRT Model--or--What Is It Anyway if the Model Captures Everything There Is to Know about the Test Takers?

Peer reviewed

Direct link

von Davier, Matthias; González B., Jorge; von Davier, Alina A. – Journal of Educational Measurement, 2013

Local equating (LE) is based on Lord's criterion of equity. It defines a family of true transformations that aim at the ideal of equitable equating. van der Linden (this issue) offers a detailed discussion of common issues in observed-score equating relative to this local approach. By assuming an underlying item response theory model, one of…

Descriptors: Equated Scores, Transformations (Mathematics), Item Response Theory, Raw Scores

A New Statistic for Evaluating Item Response Theory Models for Ordinal Data. CRESST Report 839

Download full text

Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…

Descriptors: Item Response Theory, Models, Goodness of Fit, Probability

Validities of the Signed and Unsigned Lecture Questionnaires Using the Item Response Theory

Download full text

Hirose, Hideo – Online Submission, 2011

Teachers often raise a question that whether the lecture questionnaires are necessary or not. In this paper, we first show the recent statistical analysis for the official unsigned questionnaire evaluation results took in our faculty. We have found that: (1) the evaluation scores of lectures by students have been rising up year by year, which…

Descriptors: Item Response Theory, Questionnaires, Statistical Analysis, Course Evaluation

Linking Item Parameters to a Base Scale. ACT Research Report Series, 2009-2

Download full text

Kang, Taehoon; Petersen, Nancy S. – ACT, Inc., 2009

This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord (1983) characteristic curve method…

Descriptors: Standards, Testing Programs, Test Items, Statistical Distributions

Stochastic Order in Dichotomous Item Response Models for Fixed, Adaptive, and Multidimensional Tests.

Peer reviewed

van der Linden, Wim J. – Psychometrika, 1998

Dichotomous item response theory (IRT) models can be viewed as families of stochastically ordered distributions of responses to test items. This paper explores several properties of such distributions, especially those related to transfer to other distributions. Results are formulated as a series of theorems and corollaries that apply to…

Descriptors: Item Response Theory, Responses, Statistical Distributions, Test Items

The Distribution of Person Fit Using True and Estimated Person Parameters.

Peer reviewed

Nering, Michael L. – Applied Psychological Measurement, 1995

A person-fit method that allows researchers to identify nonfitting response vectors is the l(z) statistic. Simulation results show that l(z) may not perform as expected when estimated person parameters are used rather than true person parameters. Other considerations in using true and estimated person parameters are discussed. (SLD)

Descriptors: Estimation (Mathematics), Item Response Theory, Research Methodology, Responses

Test Equating Procedures: A Primer on the Logic and Applications of Test Equating.

Download full text

Buras, Avery – 1996

The logic and uses of test equating are discussed, including three methods of test equating. The focus is on the conceptual underpinnings of each test equating method, rather than on the mathematics of the procedures. Additional consideration is given to the assumptions of each method and its respective strengths and weaknesses. A commonly…

Descriptors: Equated Scores, Item Response Theory, Models, Raw Scores

Rasch Measurement Theory, the Method of Paired Comparisons, and Graph Theory.

Download full text

Garner, Mary; Engelhard, George, Jr. – 1997

This paper considers the following questions: (1) what is the relationship between the method of paired comparisons and Rasch measurement theory? (2) what is the relationship between the method of paired comparisons and graph theory? and (3) what can graph theory contribute to the understanding of Rasch measurement theory? It is specifically shown…

Descriptors: Comparative Analysis, Estimation (Mathematics), Graphs, Item Response Theory

An Investigation of the Sampling Distributions of Equating Coefficients.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1996

Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…

Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions

Approximating the Conditional Distribution of Person Fit Indexes for Checking the Rasch Model.

Peer reviewed

Bedrick, Edward J. – Psychometrika, 1997

A simple approximation to the conditional distribution of goodness-of-fit statistics for the Rasch model is presented that is used when item difficulties are known. The approximation, which is easily programmed, gives relatively accurate assessments of conditional p-values for tests of 10 or more items. (Author/SLD)

Descriptors: Difficulty Level, Goodness of Fit, Item Response Theory, Statistical Distributions

Detecting Differential Item Functioning with Five Standardized Item-Fit Indices in the Rasch Model.

Peer reviewed

Seol, Hyunsoo – Journal of Outcome Measurement, 1999

Examined five Rasch-model-based item-fit indices in terms of their distributional properties and the power of detecting item bias or differential item functioning. Results indicate that, although these five standardized item-fit indices did not depart significantly from a normal distribution, the Type I error rates were not reasonable. (Author/SLD)

Descriptors: Goodness of Fit, Item Bias, Item Response Theory, Statistical Distributions

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

van der Linden, Wim J.	5
Baker, Frank B.	2
Dodd, Barbara G.	2
Harwell, Michael R.	2
Luecht, Richard M.	2
Nering, Michael L.	2
Schumacker, Randall E.	2
Smith, Richard M.	2
Tate, Richard L.	2
van Krimpen-Stoop, Edith M.…	2
Arthurs, Noah	1
Batley, Rose-Marie	1
Bedrick, Edward J.	1
Boss, Marvin W.	1
Buras, Avery	1
Bush, M. Joan	1
Cai, Li	1
Camilli, Gregory	1
Engelhard, George, Jr.	1
Garner, Mary	1
González B., Jorge	1
Hirose, Hideo	1
Ito, Kyoko	1
Janosky, Janine E.	1
More ▼