ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	7

Descriptor

Computer Simulation	36
Item Response Theory	14
Test Items	13
Estimation (Mathematics)	12
Mathematical Models	11
Comparative Analysis	10
Test Construction	9
Computer Assisted Testing	7
Item Bias	7
Adaptive Testing	6
Equations (Mathematics)	6
Evaluation Methods	6
Sample Size	5
Test Validity	5
Error of Measurement	4
Models	4
Monte Carlo Methods	4
Scoring	4
Statistical Distributions	4
Ability	3
Achievement Tests	3
Educational Assessment	3
Elementary Secondary Education	3
Higher Education	3
Latent Trait Theory	3
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	36
Reports - Research	19
Reports - Evaluative	17
Speeches/Meeting Papers	3

Education Level

Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 36 results Save | Export

An Investigation into Item Calibration in Multidimensional Multistage Testing

Peer reviewed

Direct link

Xi Wang; Catherine Welch – Journal of Educational Measurement, 2025

This study builds on prior research on adaptive testing by examining the performance of item calibration methods in the context of multidimensional multistage tests with within-item multidimensionality. Building on the adaptive module-level approach, where test-takers proceed through customized modules based on their initial performance, this…

Descriptors: Test Items, Adaptive Testing, Testing, Computer Simulation

Mapping an Experiment-Based Assessment of Collaborative Behavior onto Collaborative Problem Solving in PISA 2015: A Cluster Analysis Approach for Collaborator Profiles

Peer reviewed

Direct link

Herborn, Katharina; Mustafic, Maida; Greiff, Samuel – Journal of Educational Measurement, 2017

Collaborative problem solving (CPS) assessment is a new academic research field with a number of educational implications. In 2015, the Programme for International Student Assessment (PISA) assessed CPS with a computer-simulated human-agent (H-A) approach that claimed to measure 12 individual CPS skills for the first time. After reviewing the…

Descriptors: Cooperative Learning, Problem Solving, Computer Simulation, Evaluation Methods

Hybrid Computerized Adaptive Testing: From Group Sequential Design to Fully Sequential Design

Peer reviewed

Direct link

Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016

Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach

Modeling Data from Collaborative Assessments: Learning in Digital Interactive Social Networks

Peer reviewed

Direct link

Wilson, Mark; Gochyyev, Perman; Scalise, Kathleen – Journal of Educational Measurement, 2017

This article summarizes assessment of cognitive skills through collaborative tasks, using field test results from the Assessment and Teaching of 21st Century Skills (ATC21S) project. This project, sponsored by Cisco, Intel, and Microsoft, aims to help educators around the world enable students with the skills to succeed in future career and…

Descriptors: Cognitive Ability, Thinking Skills, Evaluation Methods, Educational Assessment

The Reliability of Difference Scores in Populations and Samples

Peer reviewed

Direct link

Zimmerman, Donald W. – Journal of Educational Measurement, 2009

This study was an investigation of the relation between the reliability of difference scores, considered as a parameter characterizing a population of examinees, and the reliability estimates obtained from random samples from the population. The parameters in familiar equations for the reliability of difference scores were redefined in such a way…

Descriptors: Computer Simulation, Reliability, Population Groups, Scores

A Comparison of Tests for Equality of Two or More Independent Alpha Coefficients

Peer reviewed

Direct link

Kim, Seonghoon; Feldt, Leonard S. – Journal of Educational Measurement, 2008

This article extends the Bonett (2003a) approach to testing the equality of alpha coefficients from two independent samples to the case of m [greater than or equal] 2 independent samples. The extended Fisher-Bonett test and its competitor, the Hakstian-Whalen (1976) test, are illustrated with numerical examples of both hypothesis testing and power…

Descriptors: Tests, Comparative Analysis, Hypothesis Testing, Error of Measurement

Multidimensionality and IRT-Based Item Invariance Indexes: The Effect of Between-Group Variation in Trait Correlation.

Peer reviewed

Oshima, Takako C.; Miller, M. David – Journal of Educational Measurement, 1990

A bidimensional 2-parameter logistic model was applied to data generated for 2 groups on a 40-item test. Item parameters were the same across groups; correlation across the 2 traits varied. Results indicate the need for caution in using item-response theory (IRT)-based invariance indexes with multidimensional data for these groups. (TJH)

Descriptors: Computer Simulation, Correlation, Discriminant Analysis, Item Response Theory

Logistic Regression and Its Use in Detecting Differential Item Functioning in Polytomous Items.

Peer reviewed

French, Ann W.; Miller, Timothy R. – Journal of Educational Measurement, 1996

A computer simulation study was conducted to determine the feasibility of using logistic regression procedures to detect differential item functioning (DIF) in polytomous items. Results indicate that logistic regression is powerful in detecting most forms of DIF, although it requires large amounts of data manipulation and careful interpretation.…

Descriptors: Computer Simulation, Identification, Item Bias, Test Interpretation

Confidence Intervals for True Scores under an Answer-until-Correct Scoring Procedure.

Peer reviewed

Wilcox, Rand R. – Journal of Educational Measurement, 1987

Four procedures are discussed for obtaining a confidence interval when answer-until-correct scoring is used in multiple choice tests. Simulated data show that the choice of procedure depends upon sample size. (GDC)

Descriptors: Computer Simulation, Multiple Choice Tests, Sample Size, Scoring

The Effect of Speededness on Parameter Estimation in Item Response Theory.

Peer reviewed

Oshima, T. C. – Journal of Educational Measurement, 1994

The effect of violating the assumption of nonspeededness on ability and item parameter estimates in item response theory was studied through simulation under three speededness conditions. Results indicate that ability estimation was least affected by speededness but that substantial effects on item parameter estimates were found. (SLD)

Descriptors: Ability, Computer Simulation, Estimation (Mathematics), Item Response Theory

Generalizability in Item Response Modeling

Peer reviewed

Direct link

Briggs, Derek C.; Wilson, Mark – Journal of Educational Measurement, 2007

An approach called generalizability in item response modeling (GIRM) is introduced in this article. The GIRM approach essentially incorporates the sampling model of generalizability theory (GT) into the scaling model of item response theory (IRT) by making distributional assumptions about the relevant measurement facets. By specifying a random…

Descriptors: Markov Processes, Generalizability Theory, Item Response Theory, Computation

The Effect of Inappropriate Omissions on Formula Scores: A Simulation Study.

Peer reviewed

Frary, Robert B. – Journal of Educational Measurement, 1989

Responses to a 50-item, 4-choice test were simulated for 1,000 examinees under conventional formula-scoring instructions. Based on 192 simulation runs, formula scores and expected formula scores were determined for each examinee allowing and not allowing for inappropriate omissions. (TJH)

Descriptors: Computer Simulation, Difficulty Level, Guessing (Tests), Multiple Choice Tests

Exact versus Asymptotic Mantel-Haenszel DIF Statistics: A Comparison of Performance under Small-Sample Conditions.

Peer reviewed

Parshall, Cynthia G.; Miller, Timothy R. – Journal of Educational Measurement, 1995

Exact testing was evaluated as a method for conducting Mantel-Haenszel differential item functioning (DIF) analyses with relatively small samples. A series of computer simulations found that the asymptotic Mantel-Haenszel and the exact method yielded very similar results across sample size, levels of DIF, and data sets. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Identification, Item Bias

The Performance of a Method for the Long-Term Equating of Mixed-Format Assessment

Peer reviewed

Direct link

Kamata, Akihito; Tate, Richard – Journal of Educational Measurement, 2005

The goal of this study was the development of a procedure to predict the equating error associated with the long-term equating method of Tate (2003) for mixed-format tests. An expression for the determination of the error of an equating based on multiple links using the error for the component links was derived and illustrated with simulated data.…

Descriptors: Computer Simulation, Item Response Theory, Test Format, Evaluation Methods

Parameter Recovery in the Graded Response Model Using MULTILOG.

Peer reviewed

Reise, Steve P.; Yu, Jiayuan – Journal of Educational Measurement, 1990

Parameter recovery in the graded-response model was investigated using the MULTILOG computer program under default conditions. Results from 36 simulated data sets suggest that at least 500 examinees are needed to achieve adequate calibration under the graded model. Sample size had little influence on the true ability parameter's recovery. (SLD)

Descriptors: Computer Assisted Testing, Computer Simulation, Computer Software, Estimation (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2 | 3

Clauser, Brian E.	2
Frary, Robert B.	2
Miller, Timothy R.	2
Nandakumar, Ratna	2
Wilson, Mark	2
Zwick, Rebecca	2
Ackerman, Terry A.	1
Andrich, David	1
Briggs, Derek C.	1
Catherine Welch	1
Chang, Hua-Hua	1
Clyman, Stephen G.	1
De Ayala, R. J.	1
Douglas, Jeff	1
Eiting, Mindert H.	1
Feldt, Leonard S.	1
French, Ann W.	1
Gochyyev, Perman	1
Greiff, Samuel	1
Gressard, Risa P.	1
Hambleton, Ronald K.	1
Harik, Polina	1
Herborn, Katharina	1
Hirsch, Thomas M.	1
More ▼