ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	39

Descriptor

Error of Measurement	57
Scores	57
Test Items	57
Item Response Theory	22
Simulation	14
Test Reliability	13
Foreign Countries	11
Test Bias	11
Comparative Analysis	10
Item Analysis	10
Computation	9
Sample Size	9
Models	8
Psychometrics	8
Reliability	8
Test Construction	8
Test Length	8
Accuracy	7
Computer Assisted Testing	7
Goodness of Fit	7
Statistical Analysis	7
Correlation	6
Difficulty Level	6
Elementary School Students	6
Estimation (Mathematics)	6
More ▼

Publication Type

Journal Articles	38
Reports - Research	38
Reports - Evaluative	12
Speeches/Meeting Papers	7
Reports - Descriptive	4
Dissertations/Theses -…	3
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Education	6
Higher Education	5
Postsecondary Education	5
Secondary Education	4
Elementary Secondary Education	2
Grade 5	2
Grade 8	2
High Schools	2
Junior High Schools	2
Middle Schools	2
Grade 3	1
Grade 4	1
Grade 7	1
Grade 9	1
Intermediate Grades	1
More ▼

Audience

Researchers

Location

Indonesia	2
Iran	2
Canada	1
Colorado (Boulder)	1
France	1
Georgia	1
Maryland	1
Netherlands	1
Portugal	1
Saudi Arabia	1
South Africa	1
South Korea	1
Spain	1
Turkey	1
United Kingdom (England)	1
United States	1
Zambia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

National Assessment of…	2
Program for International…	2
SAT (College Admission Test)	2
ACT Assessment	1
Armed Forces Qualification…	1
Cognitive Abilities Test	1
Graduate Record Examinations	1
New Jersey College Basic…	1
Sentence Completion Test	1
Test of English as a Foreign…	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 57 results Save | Export

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Examination of Differential Item Functioning in PISA through Univariate and Multivariate Matching Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ahmet Yildirim; Nizamettin Koç – International Journal of Assessment Tools in Education, 2024

The present research aims to examine whether the questions in the Program for the International Student Assessment (PISA) 2009 reading literacy instrument display differential item functioning (DIF) among the Turkish, French, and American samples based on univariate and multivariate matching techniques before and after the total score, which is…

Descriptors: Test Items, Item Analysis, Correlation, Error of Measurement

Assessing Measurement Invariance with Dichotomous Items: The Case of Early Grade Mathematic Assessment from the Zambian Sample

Peer reviewed

Direct link

Mumba, Brian; Alci, Devrim; Uzun, N. Bilge – Journal on Educational Psychology, 2022

Assessment of measurement invariance is an essential component of construct validity in psychological measurement. However, the procedure for assessing measurement invariance with dichotomous items partially differs from that of invariance testing with continuous items. However, many studies have focused on invariance testing with continuous items…

Descriptors: Mathematics Tests, Test Items, Foreign Countries, Error of Measurement

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Impact of Item Parameter Drift on Rasch Scale Stability in Small Samples over Multiple Administrations

Peer reviewed

Direct link

Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020

Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…

Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling

Beyond Group Comparisons: Accounting for Intersectional Sources of Bias in International Survey Measures

Peer reviewed

Direct link

Rujun Xu; James Soland – International Journal of Testing, 2024

International surveys are increasingly being used to understand nonacademic outcomes like math and science motivation, and to inform education policy changes within countries. Such instruments assume that the measure works consistently across countries, ethnicities, and languages--that is, they assume measurement invariance. While studies have…

Descriptors: Surveys, Statistical Bias, Achievement Tests, Foreign Countries

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Bayesian Approaches to Test Score Measurement Errors in Student Growth Prediction Models

Direct link

Pei-Hsuan Chiu – ProQuest LLC, 2018

Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…

Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models

Evaluating a Computerized Adaptive Testing Version of a Cognitive Ability Test Using a Simulation Study

Peer reviewed

Direct link

Tsaousis, Ioannis; Sideridis, Georgios D.; AlGhamdi, Hannan M. – Journal of Psychoeducational Assessment, 2021

This study evaluated the psychometric quality of a computerized adaptive testing (CAT) version of the general cognitive ability test (GCAT), using a simulation study protocol put forth by Han, K. T. (2018a). For the needs of the analysis, three different sets of items were generated, providing an item pool of 165 items. Before evaluating the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Cognitive Ability

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

Student Perceptions of Teaching Quality in Five Countries: A Partial Credit Model Approach to Assess Measurement Invariance

Peer reviewed

Direct link

van der Lans, Rikkert M.; Maulana, Ridwan; Helms-Lorenz, Michelle; Fernández-García, Carmen-María; Chun, Seyeoung; de Jager, Thelma; Irnidayanti, Yulia; Inda-Caro, Mercedes; Lee, Okhwa; Coetzee, Thys; Fadhilah, Nurul; Jeon, Meae; Moorer, Peter – SAGE Open, 2021

This study examines measurement invariance of student perceptions of teaching quality collected in five countries: Indonesia (n students = 6,331), the Netherlands (n students = 6,738), South Africa (n students = 3,422), South Korea (n students = 6,997) and Spain (n students = 4,676). The administered questionnaire was the My Teacher Questionnaire…

Descriptors: Foreign Countries, Student Attitudes, Student Evaluation of Teacher Performance, Teacher Effectiveness

Item Parameter Drift in a Time-Varying Predictor

Peer reviewed

Direct link

Lee, HyeSun – Applied Measurement in Education, 2018

The current simulation study examined the effects of Item Parameter Drift (IPD) occurring in a short scale on parameter estimates in multilevel models where scores from a scale were employed as a time-varying predictor to account for outcome scores. Five factors, including three decisions about IPD, were considered for simulation conditions. It…

Descriptors: Test Items, Hierarchical Linear Modeling, Predictor Variables, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Applied Psychological…	6
Applied Measurement in…	4
International Journal of…	4
ProQuest LLC	3
Assessment & Evaluation in…	2
Educational and Psychological…	2
Journal of Educational…	2
Online Submission	2
Practical Assessment,…	2
College Entrance Examination…	1
ETS Research Report Series	1
EURASIA Journal of…	1
Education and Information…	1
Educational Assessment	1
Educational Testing Service	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
Journal of Autism and…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Journal on Educational…	1
Partnership for Assessment of…	1
Psychometrika	1
Research Papers in Education	1
More ▼

Custer, Michael	2
Emons, Wilco H. M.	2
Lord, Frederic M.	2
Sijtsma, Klaas	2
Smith, Richard M.	2
Sykes, Robert C.	2
Ahmet Yildirim	1
AlGhamdi, Hannan M.	1
Alci, Devrim	1
Anwyll, Steve	1
Benítez, Isabel	1
Borrello, Gloria M.	1
Briggs, Derek C.	1
Bristow, M.	1
Burton, Richard F.	1
Camilli, Gregory	1
Christopher F. Chabris	1
Chun, Seyeoung	1
Clauser, Brian E.	1
Coetzee, Thys	1
Cook, Linda	1
Coverdale, Bradley J.	1
Cox, Kyle	1
Cui, Zhongmin	1
More ▼