ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	30

Descriptor

Error of Measurement	50
Test Construction	50
Test Items	50
Item Response Theory	22
Difficulty Level	16
Item Analysis	14
Test Validity	14
Test Reliability	13
Test Format	10
Computer Assisted Testing	9
Adaptive Testing	8
Comparative Analysis	8
Goodness of Fit	8
Scores	8
Statistical Analysis	8
Equated Scores	7
Item Banks	7
Simulation	7
Achievement Tests	6
Mathematical Models	6
Mathematics Tests	6
Measurement Techniques	6
Psychometrics	6
Student Evaluation	6
Computation	5
More ▼

Publication Type

Reports - Research	31
Journal Articles	21
Reports - Evaluative	10
Speeches/Meeting Papers	8
Reports - Descriptive	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	3
Tests/Questionnaires	3
Information Analyses	1

Education Level

Elementary Education	7
Grade 3	4
Higher Education	4
Middle Schools	4
Postsecondary Education	4
Early Childhood Education	3
Elementary Secondary Education	3
Grade 2	3
Grade 4	3
Junior High Schools	3
Primary Education	3
Secondary Education	3
Grade 5	2
Grade 6	2
Grade 7	2
Grade 8	2
Grade 1	1
Grade 9	1
Intermediate Grades	1
Kindergarten	1
More ▼

Audience

Researchers

Location

Canada	2
New Mexico	2
Colorado (Boulder)	1
Florida	1
Japan	1
Maine	1
Portugal	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

SAT (College Admission Test)	2
ACT Assessment	1
Graduate Management Admission…	1
Measures of Academic Progress	1
National Assessment of…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 50 results Save | Export

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Addressing Current Methodological Challenges in ILSA's Transition to Adaptive Testing

Direct link

Montserrat Beatriz Valdivia Medinaceli – ProQuest LLC, 2023

My dissertation examines three current challenges of international large-scale assessments (ILSAs) associated with the transition from linear testing to an adaptive testing design. ILSAs are important for making comparisons among populations and informing countries about the quality of their educational systems. ILSA's results inform policymakers…

Descriptors: International Assessment, Achievement Tests, Adaptive Testing, Test Items

Response Styles in Multiscale Measures

Direct link

Zebing Wu – ProQuest LLC, 2024

Response style, one common aberrancy in non-cognitive assessments in psychological fields, is problematic in terms of inaccurate estimation of item and person parameters, which leads to serious reliability, validity, and fairness issues (Baumgartner & Steenkamp, 2001; Bolt & Johnson, 2009; Bolt & Newton, 2011). Response style refers to…

Descriptors: Response Style (Tests), Accuracy, Preferences, Psychological Testing

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Analyzing Different Module Characteristics in Computer Adaptive Multistage Testing

Peer reviewed
PDF on ERIC

Download full text

Sahin, Melek Gulsah – International Journal of Assessment Tools in Education, 2020

Computer Adaptive Multistage Testing (ca-MST), which take the advantage of computer technology and adaptive test form, are widely used, and are now a popular issue of assessment and evaluation. This study aims at analyzing the effect of different panel designs, module lengths, and different sequence of a parameter value across stages and change in…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Response Theory

Different Methods of Adjusting for Form Difficulty under the Rasch Model: Impact on Consistency of Assessment Results. Research Report. ETS RR-19-08

Peer reviewed
PDF on ERIC

Download full text

Manna, Venessa F.; Gu, Lixiong – ETS Research Report Series, 2019

When using the Rasch model, equating with a nonequivalent groups anchor test design is commonly achieved by adjustment of new form item difficulty using an additive equating constant. Using simulated 5-year data, this report compares 4 approaches to calculating the equating constants and the subsequent impact on equating results. The 4 approaches…

Descriptors: Item Response Theory, Test Items, Test Construction, Sample Size

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Development and Initial Field Test of the 2016 K-TEEM (Knowledge for Teaching Early Elementary Mathematics) Test. Research Report No. 2019-01

Download full text

Direct link

Schoen, Robert C.; Yang, Xiaotong; Tazaz, Amanda M.; Bray, Wendy S.; Farina, Kristy – Grantee Submission, 2019

The "2016 Knowledge for Teaching Early Elementary Mathematics" (2016 K-TEEM) test measures teachers' mathematical knowledge for teaching early elementary mathematics. The 2016 K-TEEM is the third version of the K-TEEM (Schoen, Bray, Wolfe, Tazaz, & Nielsen, 2017). In this report, we present results of the first large-scale field test…

Descriptors: Test Construction, Elementary School Mathematics, Elementary School Teachers, Knowledge Base for Teaching

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

The Common Instrument: An Assessment to Measure and Communicate Youth Science Engagement in Out-of-School Time

Peer reviewed

Direct link

Noam, Gil G.; Allen, Patricia J.; Sonnert, Gerhard; Sadler, Philip M. – International Journal of Science Education, Part B: Communication and Public Engagement, 2020

There has been a growing need felt by practitioners, researchers, and evaluators to obtain a common measure of science engagement that can be used in different out-of-school time (OST) science learning settings. We report on the development and validation of a novel 10-item self-report instrument designed to measure, communicate, and ultimately…

Descriptors: Leisure Time, Elementary School Students, Middle School Students, After School Programs

Simulation Study for Evaluating MAP® Growth™ Item Pools with Grade-Level Constraints

Download full text

Li, Sylvia; Meyer, Patrick – NWEA, 2019

This simulation study examines the measurement precision, item exposure rates, and the depth of the MAP® Growth™ item pools under various grade-level restrictions. Unlike most summative assessments, MAP Growth allows examinees to see items from any grade level, regardless of the examinee's actual grade level. It does not limit the test to items…

Descriptors: Achievement Tests, Item Banks, Test Items, Instructional Program Divisions

The Effect of Anchor Test Construction on Scale Drift

Peer reviewed

Direct link

Antal, Judit; Proctor, Thomas P.; Melican, Gerald J. – Applied Measurement in Education, 2014

In common-item equating the anchor block is generally built to represent a miniature form of the total test in terms of content and statistical specifications. The statistical properties frequently reflect equal mean and spread of item difficulty. Sinharay and Holland (2007) suggested that the requirement for equal spread of difficulty may be too…

Descriptors: Test Items, Equated Scores, Difficulty Level, Item Response Theory

Bad Questions: An Essay Involving Item Response Theory

Peer reviewed

Direct link

Thissen, David – Journal of Educational and Behavioral Statistics, 2016

David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…

Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Applied Measurement in…	4
ETS Research Report Series	4
Journal of Educational…	4
Behavioral Research and…	3
ProQuest LLC	3
New Mexico Public Education…	2
American Institutes for…	1
Assessment & Evaluation in…	1
College Entrance Examination…	1
Education and Information…	1
Grantee Submission	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Language Teaching Research	1
NWEA	1
Partnership for Assessment of…	1
More ▼

Alonzo, Julie	3
Hambleton, Ronald K.	3
Tindal, Gerald	3
Haladyna, Tom	2
Liu, Kimy	2
Patience, Wayne M.	2
Reckase, Mark D.	2
Roid, Gale	2
Allen, Patricia J.	1
Antal, Judit	1
Ban, Jae-Chun	1
Beglar, David	1
Benson, Jeri	1
Bichi, Ado Abdu	1
Bray, Wendy S.	1
Brennan, Robert L.	1
Briggs, Derek C.	1
Bristow, M.	1
Chen, Yu-Jen	1
Cheng, Chien-Fen	1
Cole, Ki Lynn	1
Colton, Dean A.	1
Cook, Linda	1
Cromack, Theodore R.	1
More ▼