ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	17

Descriptor

Mathematics Tests	20
Simulation	20
Test Items	20
Item Response Theory	13
Foreign Countries	10
Comparative Analysis	8
Achievement Tests	7
Computer Assisted Testing	6
Elementary Secondary Education	6
International Assessment	6
Mathematics Achievement	6
Science Tests	6
Item Analysis	5
Science Achievement	5
Test Bias	5
Accuracy	4
Adaptive Testing	4
Difficulty Level	4
Measurement	4
Models	4
Scores	4
Statistical Analysis	4
Correlation	3
Data Analysis	3
Educational Assessment	3
More ▼

Source

ETS Research Report Series	3
International Journal of…	2
Journal of Educational…	2
Large-scale Assessments in…	2
Applied Measurement in…	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
European Journal of Science…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Pearson	1
Research in Mathematics…	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	16
Reports - Evaluative	4
Speeches/Meeting Papers	2

Education Level

Elementary Secondary Education	7
Secondary Education	6
High Schools	3
Higher Education	3
Postsecondary Education	3
Grade 12	2
Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Argentina	1
Canada	1
Turkey	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	5
National Assessment of…	3
Program for International…	2
Big Five Inventory	1
COMPASS (Computer Assisted…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Preliminary Development of an Item Bank and an Adaptive Test in Mathematical Knowledge for University Students

Peer reviewed
PDF on ERIC

Download full text

Ghio, Fernanda Belén; Bruzzone, Manuel; Rojas-Torres, Luis; Cupani, Marcos – European Journal of Science and Mathematics Education, 2022

In the last decades, the development of computerized adaptive testing (CAT) has allowed more precise measurements with a smaller number of items. In this study, we develop an item bank (IB) to generate the adaptive algorithm and simulate the functioning of CAT to assess the domains of mathematical knowledge in Argentinian university students…

Descriptors: Test Items, Item Banks, Adaptive Testing, Mathematics Tests

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Investigation of Equating Error in Tests with Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Yurtçu, Meltem; Güzeller, Cem Oktay – International Journal of Assessment Tools in Education, 2018

In this study purposes to indicate the effect of the number of DIF items and the distribution of DIF items in these forms, which be equalized on equating error. Mean-mean, mean-standard deviation, Haebara and Stocking-Lord Methods used in common item design equal groups as equalization methods. The study included six different simulation…

Descriptors: Error Patterns, Test Items, Item Analysis, Simulation

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

Measuring Widening Proficiency Differences in International Assessments: Are Current Approaches Enough?

Peer reviewed

Direct link

Rutkowski, David; Rutkowski, Leslie; Liaw, Yuan-Ling – Educational Measurement: Issues and Practice, 2018

Participation in international large-scale assessments has grown over time with the largest, the Programme for International Student Assessment (PISA), including more than 70 education systems that are economically and educationally diverse. To help accommodate for large achievement differences among participants, in 2009 PISA offered…

Descriptors: Educational Assessment, Foreign Countries, Achievement Tests, Secondary School Students

Can Computerized Adaptive Testing Work in Students' Admission to Higher Education Programs in Turkey?

Peer reviewed
PDF on ERIC

Download full text

Kalender, Ilker; Berberoglu, Giray – Educational Sciences: Theory and Practice, 2017

Admission into university in Turkey is very competitive and features a number of practical problems regarding not only the test administration process itself, but also concerning the psychometric properties of test scores. Computerized adaptive testing (CAT) is seen as a possible alternative approach to solve these problems. In the first phase of…

Descriptors: Foreign Countries, Computer Assisted Testing, College Admission, Simulation

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

Comparing DIF Methods for Data with Dual Dependency

Peer reviewed

Direct link

Jin, Ying; Kang, Minsoo – Large-scale Assessments in Education, 2016

Background: The current study compared four differential item functioning (DIF) methods to examine their performances in terms of accounting for dual dependency (i.e., person and item clustering effects) simultaneously by a simulation study, which is not sufficiently studied under the current DIF literature. The four methods compared are logistic…

Descriptors: Comparative Analysis, Test Bias, Simulation, Regression (Statistics)

Some Implications of Choice of Tiering Model in GCSE Mathematics for Inferences about What Students Know and Can Do

Peer reviewed

Direct link

Bramley, Tom – Research in Mathematics Education, 2017

This study compared models of assessment structure for achieving differentiation across the range of examinee attainment in the General Certificate of Secondary Education (GCSE) examination taken by 16-year-olds in England. The focus was on the "adjacent levels" model, where papers are targeted at three specific non-overlapping ranges of…

Descriptors: Foreign Countries, Mathematics Education, Student Certification, Student Evaluation

Using Out-of-Level Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wei, Hua; Lin, Jie – International Journal of Testing, 2015

Out-of-level testing refers to the practice of assessing a student with a test that is intended for students at a higher or lower grade level. Although the appropriateness of out-of-level testing for accountability purposes has been questioned by educators and policymakers, incorporating out-of-level items in formative assessments for accurate…

Descriptors: Test Items, Computer Assisted Testing, Adaptive Testing, Instructional Program Divisions

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

Detecting Differential Item Functioning Using Generalized Logistic Regression in the Context of Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Rutkowski, Leslie – Large-scale Assessments in Education, 2014

Background: When studying student performance across different countries or cultures, an important aspect for comparisons is that of score comparability. In other words, it is imperative that the latent variable (i.e., construct of interest) is understood and measured equivalently across all participating groups or countries, if our inferences…

Descriptors: Test Items, Item Response Theory, Item Analysis, Regression (Statistics)

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Modeling Item-Level and Step-Level Invariance Effects in Polytomous Items Using the Partial Credit Model

Peer reviewed

Direct link

Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D. – International Journal of Testing, 2012

Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…

Descriptors: Foreign Countries, Psychometrics, Test Bias, Test Items

Previous Page | Next Page »

Pages: 1 | 2

Rutkowski, Leslie	2
Albert, James H.	1
Ali, Usama S.	1
Attali, Yigal	1
Berberoglu, Giray	1
Bramley, Tom	1
Bruzzone, Manuel	1
Chang, Hua-Hua	1
Chengyu Cui	1
Chien, Yuehmei	1
Chun Wang	1
Cohen, Jon	1
Cupani, Marcos	1
Gattamorta, Karina A.	1
Ghio, Fernanda Belén	1
Gongjun Xu	1
Güzeller, Cem Oktay	1
Haag, Nicole	1
Jackson, Carol	1
Jin, Ying	1
Kalender, Ilker	1
Kang, Minsoo	1
Liaw, Yuan-Ling	1
Lin, Jie	1
Meyers, Jason L.	1
More ▼