ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	23
Since 2016 (last 10 years)	67
Since 2006 (last 20 years)	156

Descriptor

Item Response Theory	234
Scaling	234
Test Items	82
Test Construction	56
Scores	50
Equated Scores	44
Models	37
Foreign Countries	36
Scoring	34
Comparative Analysis	31
Error of Measurement	31
Test Reliability	31
Educational Assessment	29
Simulation	29
Mathematics Tests	28
Psychometrics	27
Achievement Tests	26
Test Bias	25
Test Validity	25
Grade 4	23
Item Analysis	23
Evaluation Methods	22
Difficulty Level	19
Estimation (Mathematics)	19
Nonparametric Statistics	19
More ▼

Publication Type

Journal Articles	155
Reports - Research	115
Reports - Evaluative	67
Speeches/Meeting Papers	30
Reports - Descriptive	29
Numerical/Quantitative Data	19
Dissertations/Theses -…	15
Information Analyses	4
Opinion Papers	4
Tests/Questionnaires	4
Books	3
Collected Works - General	3
Book/Product Reviews	2
Guides - Classroom - Learner	1
Guides - General	1
More ▼

Education Level

Elementary Education	30
Secondary Education	28
Grade 4	22
Elementary Secondary Education	21
Intermediate Grades	20
Grade 5	19
Grade 3	18
Middle Schools	18
Grade 6	17
Junior High Schools	17
Early Childhood Education	16
Higher Education	16
Primary Education	16
Grade 7	15
Grade 8	14
Postsecondary Education	13
High Schools	10
Grade 9	6
Grade 2	5
Kindergarten	4
Grade 1	3
Grade 10	2
Grade 11	2
Adult Education	1
Grade 12	1
More ▼

Audience

Researchers

Location

Florida	6
Germany	5
New York	4
Turkey	4
Australia	3
Colorado	3
North Carolina	3
Arizona	2
Austria	2
California	2
China	2
Indiana	2
Iran	2
Italy	2
Sweden	2
Switzerland	2
Tennessee	2
Argentina	1
Brazil	1
Chile	1
Cyprus	1
Delaware	1
Hawaii	1
Hungary	1
Idaho	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 234 results Save | Export

Combining Mokken Scale Analysis with Rasch Measurement Theory to Explore Differences in Measurement Quality between Subgroups

Peer reviewed

Direct link

Stefanie A. Wind; Benjamin Lugu; Yurou Wang – International Journal of Testing, 2025

Mokken Scale Analysis (MSA) is a nonparametric approach that offers exploratory tools for understanding the nature of item responses while emphasizing invariance requirements. MSA is often discussed as it relates to Rasch measurement theory, which also emphasizes invariance, but uses parametric models. Researchers who have compared and combined…

Descriptors: Item Response Theory, Scaling, Surveys, Evaluation Methods

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Using a Projection IRT Method for Vertical Scaling When Construct Shift Is Present

Peer reviewed

Direct link

Strachan, Tyler; Cho, Uk Hyun; Kim, Kyung Yong; Willse, John T.; Chen, Shyh-Huei; Ip, Edward H.; Ackerman, Terry A.; Weeks, Jonathan P. – Journal of Educational Measurement, 2021

In vertical scaling, results of tests from several different grade levels are placed on a common scale. Most vertical scaling methodologies rely heavily on the assumption that the construct being measured is unidimensional. In many testing situations, however, such an assumption could be problematic. For instance, the construct measured at one…

Descriptors: Item Response Theory, Scaling, Tests, Construct Validity

Identifying Problematic Item Characteristics with Small Samples Using Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2022

Researchers frequently use Mokken scale analysis (MSA), which is a nonparametric approach to item response theory, when they have relatively small samples of examinees. Researchers have provided some guidance regarding the minimum sample size for applications of MSA under various conditions. However, these studies have not focused on item-level…

Descriptors: Nonparametric Statistics, Item Response Theory, Sample Size, Test Items

NEPSscaling: Plausible Value Estimation for Competence Tests Administered in the German National Educational Panel Study

Peer reviewed

Direct link

Scharl, Anna; Zink, Eva – Large-scale Assessments in Education, 2022

Educational large-scale assessments (LSAs) often provide plausible values for the administered competence tests to facilitate the estimation of population effects. This requires the specification of a background model that is appropriate for the specific research question. Because the "German National Educational Panel Study" (NEPS) is…

Descriptors: National Competency Tests, Foreign Countries, Programming Languages, Longitudinal Studies

The Eco-Generativity Scale-Short Form: A Multidimensional Item Response Theory Analysis in University Students

Peer reviewed

Direct link

Annamaria Di Fabio; Andrea Svicher – Journal of Psychoeducational Assessment, 2024

The Eco-Generativity Scale (EGS) is a recently developed 28-item scale derived from a 4-factor higher-order model (ecological generativity, social generativity, environmental identity, and agency/pathways). The aim of this study was to develop a short-scale version of the EGS to facilitate its use with university students (N = 779) who will…

Descriptors: Foreign Countries, College Students, Ecology, Likert Scales

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Investigating Invariant Item Ordering in Intelligence Tests: Mokken Scale Analysis of KBIT-2

Peer reviewed
PDF on ERIC

Download full text

Ozberk, Eren Halil; Unsal Ozberk, Elif Bengi; Uluc, Sait; Oktem, Ferhunde – International Journal of Assessment Tools in Education, 2021

The Kaufman Brief Intelligence Test--Second Edition (KBIT-2) is designed to measure verbal and nonverbal abilities in a wide range of individuals from 4 years 0 months to 90 years 11 months of age. This study examines both the advantages of using Mokken Scale Analysis (MSA) in intelligence tests and the hierarchical order of the items in the…

Descriptors: Intelligence Tests, Nonparametric Statistics, Test Items, Test Construction

Basic Concepts of Item Response Theory: A Nonmathematical Introduction. Research Memorandum. ETS RM-20-06

Download full text

Livingston, Samuel A. – Educational Testing Service, 2020

This booklet is a conceptual introduction to item response theory (IRT), which many large-scale testing programs use for constructing and scoring their tests. Although IRT is essentially mathematical, the approach here is nonmathematical, in order to serve as an introduction on the topic for people who want to understand why IRT is used and what…

Descriptors: Item Response Theory, Scoring, Test Items, Scaling

Making Each Point Count: Revising a Local Adaptation of the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE Rubric

Peer reviewed

Direct link

Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024

In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…

Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)

A Scaled Threshold Model for Measuring Extreme Response Style

Peer reviewed

Direct link

Lubbe, Dirk; Schuster, Christof – Journal of Educational and Behavioral Statistics, 2020

Extreme response style is the tendency of individuals to prefer the extreme categories of a rating scale irrespective of item content. It has been shown repeatedly that individual response style differences affect the reliability and validity of item responses and should, therefore, be considered carefully. To account for extreme response style…

Descriptors: Response Style (Tests), Rating Scales, Item Response Theory, Models

Anchors Aweigh: How the Choice of Anchor Items Affects the Vertical Scaling of 3PL Data with the Rasch Model

Peer reviewed

Direct link

Waterbury, Glenn Thomas; DeMars, Christine E. – Educational Assessment, 2021

Vertical scaling is used to put tests of different difficulty onto a common metric. The Rasch model is often used to perform vertical scaling, despite its strict functional form. Few, if any, studies have examined anchor item choice when using the Rasch model to vertically scale data that do not fit the model. The purpose of this study was to…

Descriptors: Test Items, Equated Scores, Item Response Theory, Scaling

Rethinking the Exploration of Dichotomous Data: Mokken Scale Analysis versus Factorial Analysis

Peer reviewed

Direct link

Antino, Mirko; Alvarado, Jesús M.; Asún, Rodrigo A.; Bliese, Paul – Sociological Methods & Research, 2020

The need to determine the correct dimensionality of theoretical constructs and generate valid measurement instruments when underlying items are categorical has generated a significant volume of research in the social sciences. This article presents two studies contrasting different categorical exploratory techniques. The first study compares…

Descriptors: Nonparametric Statistics, Factor Analysis, Item Analysis, Robustness (Statistics)

Efficient Estimation of Mean Ability Growth Using Vertical Scaling

Peer reviewed

Direct link

Bjermo, Jonas; Miller, Frank – Applied Measurement in Education, 2021

In recent years, the interest in measuring growth in student ability in various subjects between different grades in school has increased. Therefore, good precision in the estimated growth is of importance. This paper aims to compare estimation methods and test designs when it comes to precision and bias of the estimated growth of mean ability…

Descriptors: Scaling, Ability, Computation, Test Items

Computational Thinking Assessment -- Towards More Vivid Interpretations

Peer reviewed

Direct link

Guggemos, Josef; Seufert, Sabine; Román-González, Marcos – Technology, Knowledge and Learning, 2023

Computational thinking (CT) is an important 21st-century skill. This paper aims at more useful CT assessment. Available evaluation instruments are reviewed; two generally accepted CT evaluation tools are selected for a comprehensive CT assessment: the CTt, a performance test, and the CTS, a self-assessment instrument. The sample comprises 202 high…

Descriptors: Computation, Thinking Skills, 21st Century Skills, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

Applied Psychological…	20
Journal of Educational…	16
ProQuest LLC	15
Educational and Psychological…	14
Applied Measurement in…	10
ETS Research Report Series	9
Educational Measurement:…	7
Journal of Educational and…	7
Behavioral Research and…	6
Journal of Applied Measurement	5
Online Submission	5
Journal of Psychoeducational…	4
Measurement:…	4
Partnership for Assessment of…	4
International Journal of…	3
International Journal of…	3
Large-scale Assessments in…	3
New York State Education…	3
Assessment for Effective…	2
Educational Assessment	2
Educational Testing Service	2
International Journal of…	2
International Journal of…	2
Journal of Educational…	2
Language Testing	2
More ▼

Anderson, Daniel	7
Alonzo, Julie	6
Irvin, P. Shawn	6
Park, Bitnara Jasmine	6
Saven, Jessica L.	6
Tindal, Gerald	6
Kolen, Michael J.	5
Wind, Stefanie A.	5
Schulz, E. Matthew	4
Briggs, Derek C.	3
Camilli, Gregory	3
DeMars, Christine E.	3
Hambleton, Ronald K.	3
Han, Kyung T.	3
Keller, Lisa A.	3
Petscher, Yaacov	3
Sireci, Stephen G.	3
van der Ark, L. Andries	3
Abdel-fattah, Abdel-fattah A.	2
Avery, Marybell	2
Bassiri, Dina	2
Beaton, Albert E.	2
Bolt, Daniel M.	2
Capar, Nilufer K.	2
More ▼

National Assessment of…	12
Program for International…	7
ACT Assessment	5
Test of English as a Foreign…	4
Trends in International…	4
SAT (College Admission Test)	3
Iowa Tests of Educational…	2
Progress in International…	2
College Level Examination…	1
Eysenck Personality Inventory	1
Florida Comprehensive…	1
General Educational…	1
Graduate Record Examinations	1
Hollingshead Social Economic…	1
International Adult Literacy…	1
International English…	1
Iowa Tests of Basic Skills	1
Kaufman Brief Intelligence…	1
Kaufman Test of Educational…	1
Lexile Scale of Reading	1
National Adult Literacy…	1
North Carolina End of Course…	1
Raven Progressive Matrices	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼