ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	23

Descriptor

Comparative Analysis	24
Computation	24
Difficulty Level	24
Test Items	18
Item Response Theory	14
Accuracy	8
Bayesian Statistics	6
Adaptive Testing	5
Correlation	5
Models	5
Sample Size	5
Simulation	5
Error of Measurement	4
Monte Carlo Methods	4
Test Bias	4
Computer Software	3
Foreign Countries	3
Markov Processes	3
Mathematics Instruction	3
Maximum Likelihood Statistics	3
Statistical Analysis	3
Statistical Bias	3
Achievement Tests	2
Classification	2
Coding	2
More ▼

Source

Educational and Psychological…	5
Journal of Educational…	4
ProQuest LLC	3
Assessment for Effective…	1
Computers & Education	1
ETS Research Report Series	1
Eurasian Journal of…	1
Grantee Submission	1
International Journal of…	1
Journal of Problem Solving	1
Multivariate Behavioral…	1
Perspectives in Education	1
Psicologica: International…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	19
Dissertations/Theses -…	3
Reports - Descriptive	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Early Childhood Education	2
Elementary Education	2
Grade 3	2
High Schools	2
Primary Education	2
Secondary Education	2
Grade 11	1
Grade 12	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Germany	1
Mexico	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Wide Range Achievement Test

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Unidimensional IRT Item Parameter Estimates across Equivalent Test Forms with Confounding Specifications within Dimensions

Peer reviewed

Direct link

Matlock, Ki Lynn; Turner, Ronna – Educational and Psychological Measurement, 2016

When constructing multiple test forms, the number of items and the total test difficulty are often equivalent. Not all test developers match the number of items and/or average item difficulty within subcontent areas. In this simulation study, six test forms were constructed having an equal number of items and average item difficulty overall.…

Descriptors: Item Response Theory, Computation, Test Items, Difficulty Level

Computation Error Analysis: Students with Mathematics Difficulty Compared to Typically Achieving Students

Peer reviewed
PDF on ERIC

Download full text

Direct link

Nelson, Gena; Powell, Sarah R. – Assessment for Effective Intervention, 2018

Though proficiency with computation is highly emphasized in national mathematics standards, students with mathematics difficulty (MD) continue to struggle with computation. To learn more about the differences in computation error patterns between typically achieving students and students with MD, we assessed 478 third-grade students on a measure…

Descriptors: Computation, Mathematics Instruction, Learning Problems, Mathematics Skills

Bayesian Estimation of Multidimensional Item Response Models. A Comparison of Analytic and Simulation Algorithms

Peer reviewed
PDF on ERIC

Download full text

Martin-Fernandez, Manuel; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…

Descriptors: Bayesian Statistics, Item Response Theory, Models, Comparative Analysis

Computation Error Analysis: Students with Mathematics Difficulty Compared to Typically Achieving Students

Peer reviewed
PDF on ERIC

Download full text

Direct link

Nelson, Gena; Powell, Sarah R – Grantee Submission, 2017

Descriptors: Computation, Mathematics Instruction, Learning Problems, Mathematics Skills

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

A Maximum Likelihood Based Offline Estimation of Student Capabilities and Question Difficulties with Guessing

Peer reviewed

Direct link

Moothedath, Shana; Chaporkar, Prasanna; Belur, Madhu N. – Perspectives in Education, 2016

In recent years, the computerised adaptive test (CAT) has gained popularity over conventional exams in evaluating student capabilities with desired accuracy. However, the key limitation of CAT is that it requires a large pool of pre-calibrated questions. In the absence of such a pre-calibrated question bank, offline exams with uncalibrated…

Descriptors: Guessing (Tests), Computer Assisted Testing, Adaptive Testing, Maximum Likelihood Statistics

Multidimensional Classification of Examinees Using the Mixture Random Weights Linear Logistic Test Model

Peer reviewed

Direct link

Choi, In-Hee; Wilson, Mark – Educational and Psychological Measurement, 2015

An essential feature of the linear logistic test model (LLTM) is that item difficulties are explained using item design properties. By taking advantage of this explanatory aspect of the LLTM, in a mixture extension of the LLTM, the meaning of latent classes is specified by how item properties affect item difficulties within each class. To improve…

Descriptors: Classification, Test Items, Difficulty Level, Statistical Analysis

Rasch Mixture Models for DIF Detection: A Comparison of Old and New Score Specifications

Peer reviewed

Direct link

Frick, Hannah; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2015

Rasch mixture models can be a useful tool when checking the assumption of measurement invariance for a single Rasch model. They provide advantages compared to manifest differential item functioning (DIF) tests when the DIF groups are only weakly correlated with the manifest covariates available. Unlike in single Rasch models, estimation of Rasch…

Descriptors: Item Response Theory, Test Bias, Comparative Analysis, Scores

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

A Comparison of Item Parameter Standard Error Estimation Procedures for Unidimensional and Multidimensional Item Response Theory Modeling

Peer reviewed

Direct link

Paek, Insu; Cai, Li – Educational and Psychological Measurement, 2014

The present study was motivated by the recognition that standard errors (SEs) of item response theory (IRT) model parameters are often of immediate interest to practitioners and that there is currently a lack of comparative research on different SE (or error variance-covariance matrix) estimation procedures. The present study investigated item…

Descriptors: Item Response Theory, Comparative Analysis, Error of Measurement, Computation

Longitudinal Multistage Testing

Peer reviewed

Direct link

Pohl, Steffi – Journal of Educational Measurement, 2013

This article introduces longitudinal multistage testing (lMST), a special form of multistage testing (MST), as a method for adaptive testing in longitudinal large-scale studies. In lMST designs, test forms of different difficulty levels are used, whereas the values on a pretest determine the routing to these test forms. Since lMST allows for…

Descriptors: Adaptive Testing, Longitudinal Studies, Difficulty Level, Comparative Analysis

The Impact of Social Comparison on the Judgment-Based Angoff Method

Direct link

Sorensen, Henry L. – ProQuest LLC, 2013

Cut-score setting processes are used to establish the passing standards for all kinds of tests in education and for credentialing. While experts use their best efforts to guide cut-score setting processes to generate valid and reliable results, cut-score participants often have a difficult time understanding the standard at which the cut score is…

Descriptors: Cutting Scores, Standard Setting (Scoring), Comparative Analysis, Difficulty Level

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

A Comparison of Different Psychometric Approaches to Modeling Testlet Structures: An Example with C-Tests

Peer reviewed

Direct link

Schroeders, Ulrich; Robitzsch, Alexander; Schipolowski, Stefan – Journal of Educational Measurement, 2014

C-tests are a specific variant of cloze tests that are considered time-efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C-tests and compared the changes in item difficulties,…

Descriptors: Comparative Analysis, Psychometrics, Cloze Procedure, Language Tests

Ability Level Estimation of Students on Probability Unit via Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Özyurt, Hacer; Özyurt, Özcan – Eurasian Journal of Educational Research, 2015

Problem Statement: Learning-teaching activities bring along the need to determine whether they achieve their goals. Thus, multiple choice tests addressing the same set of questions to all are frequently used. However, this traditional assessment and evaluation form contrasts with modern education, where individual learning characteristics are…

Descriptors: Probability, Adaptive Testing, Computer Assisted Testing, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Kim, Sooyeon	2
Moses, Tim	2
Nelson, Gena	2
Belur, Madhu N.	1
Brzezinski, Evelyn	1
Cai, Li	1
Chaporkar, Prasanna	1
Choi, In-Hee	1
Culpepper, Steven Andrew	1
DeMars, Christine E.	1
Demaline, Randy	1
Desmet, Piet	1
Frey, Andreas	1
Frick, Hannah	1
Hartig, Johannes	1
He, Wei	1
Jiao, Hong	1
Kim, Hyun Seok John	1
Klieme, Eckhard	1
Kwisthout, Johan	1
Martin-Fernandez, Manuel	1
Martínez-Sierra, Gustavo	1
Matlock, Ki Lynn	1
Miranda-Tirado, Marisa	1
Moothedath, Shana	1
More ▼