ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	40
Since 2006 (last 20 years)	98

Descriptor

Scaling	178
Test Items	178
Item Response Theory	82
Test Construction	66
Foreign Countries	36
Difficulty Level	35
Equated Scores	35
Scores	34
Item Analysis	32
Scoring	31
Mathematics Tests	26
Psychometrics	25
Statistical Analysis	24
Comparative Analysis	22
Test Reliability	22
Achievement Tests	21
Simulation	20
Evaluation Methods	19
Educational Assessment	18
Error of Measurement	18
Mathematical Models	18
Student Evaluation	18
Test Bias	18
Test Validity	18
Latent Trait Theory	17
More ▼

Publication Type

Reports - Research	100
Journal Articles	81
Reports - Evaluative	46
Speeches/Meeting Papers	32
Numerical/Quantitative Data	21
Reports - Descriptive	21
Tests/Questionnaires	7
Dissertations/Theses -…	6
Information Analyses	5
Collected Works - General	4
Guides - General	2
Guides - Non-Classroom	2
Opinion Papers	2
Reports - General	2
Collected Works - Serials	1
Guides - Classroom - Learner	1
Reference Materials -…	1
More ▼

Education Level

Elementary Education	29
Secondary Education	22
Elementary Secondary Education	16
Middle Schools	12
Higher Education	11
Intermediate Grades	11
Junior High Schools	11
Grade 4	10
Grade 6	10
Primary Education	10
Early Childhood Education	9
Grade 8	9
Postsecondary Education	9
Grade 3	8
Grade 5	8
Grade 7	7
High Schools	6
Grade 2	4
Grade 9	4
Kindergarten	3
Grade 1	2
Grade 10	1
More ▼

Audience

Researchers	8
Teachers	3
Practitioners	1

Location

Australia	8
Asia	5
Germany	5
Canada	3
Europe	3
Florida	3
Turkey	3
Austria	2
Chile	2
Denmark	2
France	2
Italy	2
Japan	2
Netherlands	2
South Korea	2
Sweden	2
United States	2
Belgium	1
China	1
Cyprus	1
Czech Republic	1
Estonia	1
Finland	1
Indonesia	1
Iran	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	3
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 178 results Save | Export

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Identifying Problematic Item Characteristics with Small Samples Using Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2022

Researchers frequently use Mokken scale analysis (MSA), which is a nonparametric approach to item response theory, when they have relatively small samples of examinees. Researchers have provided some guidance regarding the minimum sample size for applications of MSA under various conditions. However, these studies have not focused on item-level…

Descriptors: Nonparametric Statistics, Item Response Theory, Sample Size, Test Items

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Investigating Invariant Item Ordering in Intelligence Tests: Mokken Scale Analysis of KBIT-2

Peer reviewed
PDF on ERIC

Download full text

Ozberk, Eren Halil; Unsal Ozberk, Elif Bengi; Uluc, Sait; Oktem, Ferhunde – International Journal of Assessment Tools in Education, 2021

The Kaufman Brief Intelligence Test--Second Edition (KBIT-2) is designed to measure verbal and nonverbal abilities in a wide range of individuals from 4 years 0 months to 90 years 11 months of age. This study examines both the advantages of using Mokken Scale Analysis (MSA) in intelligence tests and the hierarchical order of the items in the…

Descriptors: Intelligence Tests, Nonparametric Statistics, Test Items, Test Construction

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

Basic Concepts of Item Response Theory: A Nonmathematical Introduction. Research Memorandum. ETS RM-20-06

Download full text

Livingston, Samuel A. – Educational Testing Service, 2020

This booklet is a conceptual introduction to item response theory (IRT), which many large-scale testing programs use for constructing and scoring their tests. Although IRT is essentially mathematical, the approach here is nonmathematical, in order to serve as an introduction on the topic for people who want to understand why IRT is used and what…

Descriptors: Item Response Theory, Scoring, Test Items, Scaling

Anchors Aweigh: How the Choice of Anchor Items Affects the Vertical Scaling of 3PL Data with the Rasch Model

Peer reviewed

Direct link

Waterbury, Glenn Thomas; DeMars, Christine E. – Educational Assessment, 2021

Vertical scaling is used to put tests of different difficulty onto a common metric. The Rasch model is often used to perform vertical scaling, despite its strict functional form. Few, if any, studies have examined anchor item choice when using the Rasch model to vertically scale data that do not fit the model. The purpose of this study was to…

Descriptors: Test Items, Equated Scores, Item Response Theory, Scaling

Rethinking the Exploration of Dichotomous Data: Mokken Scale Analysis versus Factorial Analysis

Peer reviewed

Direct link

Antino, Mirko; Alvarado, Jesús M.; Asún, Rodrigo A.; Bliese, Paul – Sociological Methods & Research, 2020

The need to determine the correct dimensionality of theoretical constructs and generate valid measurement instruments when underlying items are categorical has generated a significant volume of research in the social sciences. This article presents two studies contrasting different categorical exploratory techniques. The first study compares…

Descriptors: Nonparametric Statistics, Factor Analysis, Item Analysis, Robustness (Statistics)

Efficient Estimation of Mean Ability Growth Using Vertical Scaling

Peer reviewed

Direct link

Bjermo, Jonas; Miller, Frank – Applied Measurement in Education, 2021

In recent years, the interest in measuring growth in student ability in various subjects between different grades in school has increased. Therefore, good precision in the estimated growth is of importance. This paper aims to compare estimation methods and test designs when it comes to precision and bias of the estimated growth of mean ability…

Descriptors: Scaling, Ability, Computation, Test Items

Practical Significance of Item Misfit and Its Manifestations in Constructs Assessed in Large-Scale Studies

Peer reviewed

Direct link

Fährmann, Katharina; Köhler, Carmen; Hartig, Johannes; Heine, Jörg-Henrik – Large-scale Assessments in Education, 2022

When scaling psychological tests with methods of item response theory it is necessary to investigate to what extent the responses correspond to the model predictions. In addition to the statistical evaluation of item misfit, the question arises as to its practical significance. Although item removal is undesirable for several reasons, its…

Descriptors: Psychological Testing, Scaling, Test Items, Item Response Theory

The Comparison of the Dimensionality Results Provided by the Automated Item Selection Procedure and DETECT Analysis

Peer reviewed
PDF on ERIC

Download full text

Mor, Ezgi; Kula-Kartal, Seval – International Journal of Assessment Tools in Education, 2022

The dimensionality is one of the most investigated concepts in the psychological assessment, and there are many ways to determine the dimensionality of a measured construct. The Automated Item Selection Procedure (AISP) and the DETECT are non-parametric methods aiming to determine the factorial structure of a data set. In the current study,…

Descriptors: Psychological Evaluation, Nonparametric Statistics, Test Items, Item Analysis

Statistical Estimation and Inference for Large-Scale Categorical Data

Direct link

Chengcheng Li – ProQuest LLC, 2022

Categorical data become increasingly ubiquitous in the modern big data era. In this dissertation, we propose novel statistical learning and inference methods for large-scale categorical data, focusing on latent variable models and their applications to psychometrics. In psychometric assessments, the subjects' underlying aptitude often cannot be…

Descriptors: Statistical Inference, Data Analysis, Psychometrics, Raw Scores

A Mokken Scale Analysis of the Last Series of the Standard Progressive Matrices (SPM-LS)

Peer reviewed
PDF on ERIC

Download full text

Myszkowski, Nils – Journal of Intelligence, 2020

Raven's Standard Progressive Matrices (Raven 1941) is a widely used 60-item long measure of general mental ability. It was recently suggested that, for situations where taking this test is too time consuming, a shorter version, comprised of only the last series of the Standard Progressive Matrices (Myszkowski and Storme 2018) could be used, while…

Descriptors: Intelligence Tests, Psychometrics, Nonparametric Statistics, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Applied Psychological…	10
Educational and Psychological…	10
Behavioral Research and…	8
International Association for…	6
Applied Measurement in…	5
ETS Research Report Series	5
Journal of Educational…	5
Ministerial Council on…	5
ProQuest LLC	5
Educational Assessment	3
Journal of Educational and…	3
OECD Publishing (NJ1)	3
Educational Measurement:…	2
Educational Testing Service	2
International Journal of…	2
Journal of Educational…	2
Large-scale Assessments in…	2
Measurement in Physical…	2
Measurement:…	2
New Meridian Corporation	2
Practical Assessment,…	2
Sociological Methods &…	2
ACT, Inc.	1
AERA Online Paper Repository	1
Assessment for Effective…	1
More ▼

Anderson, Daniel	9
Tindal, Gerald	9
Alonzo, Julie	8
Irvin, P. Shawn	7
Park, Bitnara Jasmine	6
Saven, Jessica L.	6
Ban, Jae-Chun	3
Donovan, Jenny	3
Fraillon, Julian, Ed.	3
Hanson, Bradley A.	3
Harris, Deborah J.	3
Lennon, Melissa	3
Schulz, Wolfram, Ed.	3
Yi, Qing	3
Ainley, John, Ed.	2
Allen, Nancy L.	2
Almehrizi, Rashid S.	2
Avery, Marybell	2
Beaton, Albert E.	2
Canner, Jane M.	2
Capar, Nilufer K.	2
Davey, Tim	2
DeMars, Christine E.	2
Dyson, Ben	2
More ▼

Program for International…	11
National Assessment of…	7
SAT (College Admission Test)	5
Test of English as a Foreign…	3
Trends in International…	2
ACT Assessment	1
ACT Interest Inventory	1
Comprehensive Tests of Basic…	1
Florida Comprehensive…	1
Graduate Management Admission…	1
Graduate Record Examinations	1
International Adult Literacy…	1
International English…	1
Kaufman Brief Intelligence…	1
Kaufman Test of Educational…	1
North Carolina End of Course…	1
Piers Harris Childrens Self…	1
Progress in International…	1
Raven Progressive Matrices	1
Sentence Completion Test	1
Stanford Achievement Tests	1
Stanford Diagnostic Reading…	1
Stanford Early School…	1
Tennessee Self Concept Scale	1
More ▼