ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	15
Since 2006 (last 20 years)	35

Descriptor

Raw Scores	58
Test Items	58
Comparative Analysis	20
Equated Scores	19
Item Response Theory	17
Difficulty Level	16
Item Analysis	16
Scoring	13
Test Construction	13
Sample Size	11
Scaling	11
Statistical Analysis	11
Scores	9
Test Format	9
Correlation	8
Error of Measurement	8
Foreign Countries	8
Mathematics Tests	8
Psychometrics	8
Cutting Scores	7
Goodness of Fit	7
Test Validity	7
Multiple Choice Tests	6
Academic Achievement	5
Academic Standards	5
More ▼

Publication Type

Reports - Research	34
Journal Articles	33
Reports - Evaluative	15
Speeches/Meeting Papers	12
Reports - Descriptive	7
Tests/Questionnaires	4
Numerical/Quantitative Data	3
Dissertations/Theses -…	1
Guides - Classroom - Learner	1

Education Level

Elementary Secondary Education	8
Higher Education	6
Postsecondary Education	6
Elementary Education	5
Secondary Education	3
Grade 2	2
Grade 3	2
Grade 6	2
Adult Education	1
Early Childhood Education	1
Grade 1	1
Grade 10	1
Grade 5	1
Grade 7	1
Primary Education	1
More ▼

Audience

Researchers	2
Teachers	2

Location

Australia	3
New Mexico	2
Colombia	1
Japan	1
South Africa	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
ACT Assessment	1
Beck Depression Inventory	1
Iowa Tests of Basic Skills	1
New Jersey College Basic…	1
Preschool Language Scale	1
Program for International…	1
Raven Progressive Matrices	1
Woodcock Reading Mastery Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 58 results Save | Export

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Evaluating Population Invariance of Test Equating during the COVID-19 Pandemic

Peer reviewed

Direct link

Li, Dongmei; Kapoor, Shalini – Educational Measurement: Issues and Practice, 2022

Population invariance is a desirable property of test equating which might not hold when significant changes occur in the test population, such as those brought about by the COVID-19 pandemic. This research aims to investigate whether equating functions are reasonably invariant when the test population is impacted by the pandemic. Based on…

Descriptors: Test Items, Equated Scores, COVID-19, Pandemics

Effect of Statistically Matching Equating Samples for Common-Item Equating. Research Report. ETS RR-21-02

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021

This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…

Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items

Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit

Peer reviewed

Direct link

Betts, Joe; Muntean, William; Kim, Doyoung; Kao, Shu-chuan – Educational and Psychological Measurement, 2022

The multiple response structure can underlie several different technology-enhanced item types. With the increased use of computer-based testing, multiple response items are becoming more common. This response type holds the potential for being scored polytomously for partial credit. However, there are several possible methods for computing raw…

Descriptors: Scoring, Test Items, Test Format, Raw Scores

Statistical Estimation and Inference for Large-Scale Categorical Data

Direct link

Chengcheng Li – ProQuest LLC, 2022

Categorical data become increasingly ubiquitous in the modern big data era. In this dissertation, we propose novel statistical learning and inference methods for large-scale categorical data, focusing on latent variable models and their applications to psychometrics. In psychometric assessments, the subjects' underlying aptitude often cannot be…

Descriptors: Statistical Inference, Data Analysis, Psychometrics, Raw Scores

Effect of Sample Size on Common Item Equating Using the Dichotomous Rasch Model

Peer reviewed

Direct link

O'Neill, Thomas R.; Gregg, Justin L.; Peabody, Michael R. – Applied Measurement in Education, 2020

This study addresses equating issues with varying sample sizes using the Rasch model by examining how sample size affects the stability of item calibrations and person ability estimates. A resampling design was used to create 9 sample size conditions (200, 100, 50, 45, 40, 35, 30, 25, and 20), each replicated 10 times. Items were recalibrated…

Descriptors: Sample Size, Equated Scores, Item Response Theory, Raw Scores

Exploring the Impact of Q-Matrix Specifications through a DINA Model in a Large-Scale Mathematics Assessment

Peer reviewed

Direct link

Wu, Haiyan; Liang, Xinya; Yürekli, Hülya; Becker, Betsy Jane; Paek, Insu; Binici, Salih – Journal of Psychoeducational Assessment, 2020

The demand for diagnostic feedback has triggered extensive research on cognitive diagnostic models (CDMs), such as the deterministic input, noisy output "and" gate (DINA) model. This study explored two Q-matrix specifications with the DINA model in a statewide large-scale mathematics assessment. The first Q-matrix was developed based on…

Descriptors: Mathematics Tests, Cognitive Measurement, Models, Test Items

Improvement of Norm Score Quality via Regression-Based Continuous Norming

Peer reviewed

Direct link

Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021

The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…

Descriptors: Test Norms, Scores, Regression (Statistics), Test Items

The Pseudo-Equivalent Groups Approach as an Alternative to Common-Item Equating. Research Report. ETS RR-18-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Lu, Ru – ETS Research Report Series, 2018

The purpose of this study was to evaluate the effectiveness of linking test scores by using test takers' background data to form pseudo-equivalent groups (PEG) of test takers. Using 4 operational test forms that each included 100 items and were taken by more than 30,000 test takers, we created 2 half-length research forms that had either 20…

Descriptors: Test Items, Item Banks, Difficulty Level, Comparative Analysis

Development of the School Mental Health Self-Efficacy Teacher Survey Using Rasch Analysis

Peer reviewed

Direct link

Brann, Kristy L.; Boone, William J.; Splett, Joni W.; Clemons, Courtney; Bidwell, Sarah L. – Journal of Psychoeducational Assessment, 2021

Given the important role that teachers play in supporting student mental health, it is critical teachers feel confident in their ability to fill such roles. To inform strategies intended to improve teacher confidence in supporting student mental health, a psychometrically sound tool assessing teacher school mental health self-efficacy is needed.…

Descriptors: Teacher Surveys, Test Construction, Psychometrics, Mental Health

An Investigation of Speededness as a Possible Explanation for Mode Effects on the ACT. Technical Brief. 2021-R2142

Download full text

Wang, Shichao; Li, Dongmei; Steedle, Jeffrey – ACT, Inc., 2021

Speeded tests set time limits so that few examinees can reach all items, and power tests allow most test-takers sufficient time to attempt all items. Educational achievement tests are sometimes described as "timed power tests" because the amount of time provided is intended to allow nearly all students to complete the test, yet this…

Descriptors: Timed Tests, Test Items, Achievement Tests, Testing

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

A Comparison of Distractor Selection among Proficiency Levels in Reading Tests: A Focus on Summarization Processes in Japanese EFL Learners

Peer reviewed

Direct link

Terao, Takahiro; Ishii, Hidetoki – SAGE Open, 2020

This study aimed to compare selection patterns of distractors (incorrect options) according to test taker proficiency regarding Japanese students' summarization skills of an English paragraph. Participants included 414 undergraduate students, and the test comprised three summarization process types--deletion, generalization, and integration.…

Descriptors: Comparative Analysis, English (Second Language), Second Language Instruction, Second Language Learning

Use of Jackknifing to Evaluate Effects of Anchor Item Selection on Equating with the Nonequivalent Groups with Anchor Test (NEAT) Design. Research Report. ETS RR-15-10

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015

In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…

Descriptors: Test Construction, Equated Scores, Test Items, Sampling

Development of the BioCalculus Assessment (BCA)

Peer reviewed

Direct link

Taylor, Robin T.; Bishop, Pamela R.; Lenhart, Suzanne; Gross, Louis J.; Sturner, Kelly – CBE - Life Sciences Education, 2020

We describe the development and initial validity assessment of the 20-item BioCalculus Assessment (BCA), with the objective of comparing undergraduate life science students' understanding of calculus concepts in different courses with alternative emphases (with and without focus on biological applications). The development process of the BCA…

Descriptors: Test Construction, Mathematics Tests, Calculus, Test Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

ETS Research Report Series	8
Educational Testing Service	3
Educational and Psychological…	3
Journal of Educational…	3
Ministerial Council on…	3
Applied Psychological…	2
International Journal of…	2
Journal of Psychoeducational…	2
New Mexico Public Education…	2
ACT, Inc.	1
Applied Measurement in…	1
Assessment	1
CBE - Life Sciences Education	1
Educational Assessment	1
Educational Measurement:…	1
International Journal of…	1
International Journal of…	1
Journal of Experimental…	1
PROFILE: Issues in Teachers'…	1
Pearson	1
Popular Measurement	1
ProQuest LLC	1
Research in Developmental…	1
SAGE Open	1
Social Indicators Research	1
More ▼

Kim, Sooyeon	3
Livingston, Samuel A.	3
Lu, Ru	3
Puhan, Gautam	3
Ackerman, Terry A.	2
Binici, Salih	2
Guo, Hongwen	2
Li, Dongmei	2
Liu, Jinghua	2
Sinharay, Sandip	2
Allison Ames	1
Almehrizi, Rashid S.	1
Anderson, A. E.	1
Arce, Alvaro J.	1
Becker, Betsy Jane	1
Becker, Kirk	1
Bell, Anita I.	1
Bene, Nancy H.	1
Betts, Joe	1
Bidwell, Sarah L.	1
Bishop, Pamela R.	1
Boone, William J.	1
Brandon Crawford	1
Brann, Kristy L.	1
More ▼