ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	44

Descriptor

Item Response Theory	56
Raw Scores	56
Test Items	17
Equated Scores	13
Models	12
Comparative Analysis	11
Error of Measurement	11
Correlation	10
Statistical Analysis	10
Multiple Choice Tests	8
Psychometrics	8
Scaling	8
Scores	8
Test Construction	8
Scoring	7
True Scores	7
Reliability	6
Simulation	6
Test Reliability	6
College Entrance Examinations	5
Computation	5
Cutting Scores	5
Difficulty Level	5
Foreign Countries	5
Goodness of Fit	5
More ▼

Publication Type

Journal Articles	40
Reports - Research	31
Reports - Evaluative	14
Reports - Descriptive	7
Speeches/Meeting Papers	4
Dissertations/Theses -…	3
Numerical/Quantitative Data	3
Guides - Classroom - Learner	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Higher Education	6
Secondary Education	6
Elementary Secondary Education	4
High Schools	4
Elementary Education	3
Early Childhood Education	2
Grade 4	2
Grade 7	2
Grade 8	2
Postsecondary Education	2
Grade 10	1
Grade 12	1
Grade 3	1
Grade 5	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers

Location

New Mexico	2
South Africa	2
Florida	1
Singapore	1
Taiwan	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

ACT Assessment	3
Early Childhood Environment…	2
Iowa Tests of Basic Skills	2
Iowa Tests of Educational…	2
SAT (College Admission Test)	2
Childrens Manifest Anxiety…	1
College Level Examination…	1
Florida Comprehensive…	1
Graduate Record Examinations	1
Minnesota Multiphasic…	1
National Assessment of…	1
Stanford Achievement Tests	1
Test of English as a Foreign…	1
Test of Standard Written…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 56 results Save | Export

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Statistical Estimation and Inference for Large-Scale Categorical Data

Direct link

Chengcheng Li – ProQuest LLC, 2022

Categorical data become increasingly ubiquitous in the modern big data era. In this dissertation, we propose novel statistical learning and inference methods for large-scale categorical data, focusing on latent variable models and their applications to psychometrics. In psychometric assessments, the subjects' underlying aptitude often cannot be…

Descriptors: Statistical Inference, Data Analysis, Psychometrics, Raw Scores

Effect of Sample Size on Common Item Equating Using the Dichotomous Rasch Model

Peer reviewed

Direct link

O'Neill, Thomas R.; Gregg, Justin L.; Peabody, Michael R. – Applied Measurement in Education, 2020

This study addresses equating issues with varying sample sizes using the Rasch model by examining how sample size affects the stability of item calibrations and person ability estimates. A resampling design was used to create 9 sample size conditions (200, 100, 50, 45, 40, 35, 30, 25, and 20), each replicated 10 times. Items were recalibrated…

Descriptors: Sample Size, Equated Scores, Item Response Theory, Raw Scores

Exploring the Impact of Q-Matrix Specifications through a DINA Model in a Large-Scale Mathematics Assessment

Peer reviewed

Direct link

Wu, Haiyan; Liang, Xinya; Yürekli, Hülya; Becker, Betsy Jane; Paek, Insu; Binici, Salih – Journal of Psychoeducational Assessment, 2020

The demand for diagnostic feedback has triggered extensive research on cognitive diagnostic models (CDMs), such as the deterministic input, noisy output "and" gate (DINA) model. This study explored two Q-matrix specifications with the DINA model in a statewide large-scale mathematics assessment. The first Q-matrix was developed based on…

Descriptors: Mathematics Tests, Cognitive Measurement, Models, Test Items

Rubric Rating with MFRM versus Randomly Distributed Comparative Judgment: A Comparison of Two Approaches to Second-Language Writing Assessment

Peer reviewed

Direct link

Sims, Maureen E.; Cox, Troy L.; Eckstein, Grant T.; Hartshorn, K. James; Wilcox, Matthew P.; Hart, Judson M. – Educational Measurement: Issues and Practice, 2020

The purpose of this study is to explore the reliability of a potentially more practical approach to direct writing assessment in the context of ESL writing. Traditional rubric rating (RR) is a common yet resource-intensive evaluation practice when performed reliably. This study compared the traditional rubric model of ESL writing assessment and…

Descriptors: Scoring Rubrics, Item Response Theory, Second Language Learning, English (Second Language)

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Improvement of Norm Score Quality via Regression-Based Continuous Norming

Peer reviewed

Direct link

Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021

The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…

Descriptors: Test Norms, Scores, Regression (Statistics), Test Items

A New Interpretation of Augmented Subscores and Their Added Value in Terms of Parallel Forms

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2018

The value-added method of Haberman is arguably one of the most popular methods to evaluate the quality of subscores. The method is based on the classical test theory and deems a subscore to be of added value if the subscore predicts the corresponding true subscore better than does the total score. Sinharay provided an interpretation of the added…

Descriptors: Scores, Value Added Models, Raw Scores, Item Response Theory

Development of the School Mental Health Self-Efficacy Teacher Survey Using Rasch Analysis

Peer reviewed

Direct link

Brann, Kristy L.; Boone, William J.; Splett, Joni W.; Clemons, Courtney; Bidwell, Sarah L. – Journal of Psychoeducational Assessment, 2021

Given the important role that teachers play in supporting student mental health, it is critical teachers feel confident in their ability to fill such roles. To inform strategies intended to improve teacher confidence in supporting student mental health, a psychometrically sound tool assessing teacher school mental health self-efficacy is needed.…

Descriptors: Teacher Surveys, Test Construction, Psychometrics, Mental Health

Predicting Operational Rater-Type Classifications Using Rasch Measurement Theory and Random Forests: A Music Performance Assessment Perspective

Peer reviewed

Direct link

Wesolowski, Brian C. – Journal of Educational Measurement, 2019

The purpose of this study was to build a Random Forest supervised machine learning model in order to predict musical rater-type classifications based upon a Rasch analysis of raters' differential severity/leniency related to item use. Raw scores (N = 1,704) from 142 raters across nine high school solo and ensemble festivals (grades 9-12) were…

Descriptors: Item Response Theory, Prediction, Classification, Artificial Intelligence

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

Examining the Category Functioning of the ECERS-R across Eight Data Sets

Peer reviewed
PDF on ERIC

Download full text

Fujimoto, Ken A.; Gordon, Rachel A.; Peng, Fang; Hofer, Kerry G. – AERA Open, 2018

Classroom quality measures, such as the Early Childhood Environment Rating Scale, Revised (ECERS-R), are widely used in research, practice, and policy. Increasingly, these uses have been for purposes not originally intended, such as contributing to consequential policy decisions. The current study adds to the recent evidence of problems with the…

Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Preschool Curriculum

Examining the Category Functioning of the ECERS-R across Eight Datasets

Peer reviewed
PDF on ERIC

Download full text

Direct link

Fujimoto, Ken A.; Gordon, Rachel A.; Peng, Fang; Hofer, Kerry G. – Grantee Submission, 2018

Descriptors: Rating Scales, Educational Quality, Early Childhood Education, Preschool Curriculum

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Descriptive Statistics for Modern Test Score Distributions: Skewness, Kurtosis, Discreteness, and Ceiling Effects

Peer reviewed

Direct link

Ho, Andrew D.; Yu, Carol C. – Educational and Psychological Measurement, 2015

Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological…

Descriptors: Statistics, Scores, Statistical Distributions, Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	5
Journal of Educational…	5
ETS Research Report Series	4
Educational Measurement:…	3
Journal of Psychoeducational…	3
ProQuest LLC	3
ACT, Inc.	2
Applied Measurement in…	2
Applied Psychological…	2
Grantee Submission	2
International Journal of…	2
New Mexico Public Education…	2
AERA Open	1
Advances in Health Sciences…	1
African Journal of Research…	1
Assessment for Effective…	1
Educational Assessment	1
Educational Research	1
Educational Testing Service	1
International Journal of…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Experimental…	1
Psicologica: International…	1
Psychological Assessment	1
More ▼

Biancarosa, Gina	2
Carlson, Sarah E.	2
Davison, Mark L.	2
Fujimoto, Ken A.	2
Gordon, Rachel A.	2
Hofer, Kerry G.	2
Liu, Bowen	2
Livingston, Samuel A.	2
Peng, Fang	2
Potgieter, Marietjie	2
Seipel, Ben	2
Tong, Ye	2
Yi, Qing	2
Ackerman, Terry A.	1
Allen, Jeff	1
Allison Ames	1
Arce, Alvaro J.	1
Becker, Betsy Jane	1
Becker, Douglas F.	1
Bene, Nancy H.	1
Bidwell, Sarah L.	1
Binici, Salih	1
Boone, William J.	1
Brandon Crawford	1
More ▼