ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	18

Descriptor

Comparative Testing	63
Test Bias	63
Test Items	16
Test Validity	15
Racial Differences	13
Sex Differences	12
White Students	12
Higher Education	11
Item Analysis	11
Mathematics Tests	11
Foreign Countries	10
Test Construction	10
Test Format	10
Achievement Tests	9
Black Students	9
College Entrance Examinations	9
Multiple Choice Tests	9
Testing Problems	9
Educational Testing	8
High Schools	8
Scores	8
Computer Assisted Testing	7
Elementary Education	7
Elementary School Students	7
High School Students	7
More ▼

Publication Type

Reports - Research	49
Journal Articles	34
Reports - Evaluative	10
Speeches/Meeting Papers	8
Collected Works - General	1
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Education	6
Elementary Secondary Education	5
Higher Education	4
Postsecondary Education	4
Grade 3	2
Grade 8	2
Early Childhood Education	1
Grade 4	1
Grade 5	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Researchers	3
Counselors	1
Practitioners	1

Location

South Africa	2
United States	2
Canada	1
China	1
Georgia (Atlanta)	1
Hong Kong	1
Illinois	1
Ireland	1
Israel	1
Surinam	1
Sweden	1
Tennessee	1
Thailand	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 63 results Save | Export

Do Reported Treatment Effects Generalize to Other Measures of the Same Construct: A Specification Test

Peer reviewed

Direct link

Peter F. Halpin – Society for Research on Educational Effectiveness, 2024

Background: Meta-analyses of educational interventions have consistently documented the importance of methodological factors related to the choice of outcome measures. In particular, when interventions are evaluated using measures developed by researchers involved with the intervention or its evaluation, the effect sizes tend to be larger than…

Descriptors: College Students, College Faculty, STEM Education, Item Response Theory

Examining the Relationship between Randomization Strategies and Control Group Crossover in Higher Education Interventions. EdWorkingPaper No. 24-1083

Download full text

Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024

This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…

Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction

Hold the Bets! Should Quasi-Experiments Be Preferred to True Experiments When Causal Generalization Is the Goal?

Peer reviewed

Direct link

Andrew P. Jaciw – American Journal of Evaluation, 2025

By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…

Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment

Inequity in the Early Years: Student Development Trajectories from Kindergarten to Grade 3. Kindergarten Readiness in Illinois Series, Part 2

Download full text

Sebastian Kiguel; Sarah Cashdollar; Meg Bates – Illinois Workforce and Education Research Collaborative, Discovery Partners Institute, 2024

In this report, we perform an analysis of kindergarten readiness in Illinois and relate it to students' third grade academic achievement. We study two cohorts of Illinois kindergarteners and follow them into third grade using data provided by the Illinois State Board of Education (ISBE). We summarize our key findings below: (1) Disparities appear…

Descriptors: School Readiness, Early Childhood Education, Test Bias, Culture Fair Tests

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014

The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis

Differential Item Functioning: The Consequence of Language, Curriculum, or Culture?

Direct link

Huang, Xiaoting – ProQuest LLC, 2010

In recent decades, the use of large-scale standardized international assessments has increased drastically as a way to evaluate and compare the quality of education across countries. In order to make valid international comparisons, the primary requirement is to ensure the measurement equivalence between the different language versions of these…

Descriptors: Test Bias, Comparative Testing, Foreign Countries, Measurement

International Comparisons and Sensitivity to Instruction

Peer reviewed

Direct link

Wiliam, Dylan – Assessment in Education: Principles, Policy & Practice, 2008

While international comparisons such as those provided by PISA may be meaningful in terms of overall judgements about the performance of educational systems, caution is needed in terms of more fine-grained judgements. In particular it is argued that the results of PISA to draw conclusions about the quality of instruction in different systems is…

Descriptors: Test Bias, Test Construction, Comparative Testing, Evaluation

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Score Comparability for Language Minority Students on the Content Assessments Used by Two States. Research Report. ETS RR-11-27

Download full text

Young, John W.; Holtzman, Steven; Steinberg, Jonathan – Educational Testing Service, 2011

In this research investigation of score comparability for language minority students (English language learners [ELLs] and former English language learners), we examined 3 indicators of score comparability (reliability, internal test structure, and differential item functioning) for 4th and 8th grade students who took the NCLB-mandated content…

Descriptors: Language Minorities, Second Language Learning, Grade 8, Minority Group Students

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

Comparability of GCSE Examinations in Different Subjects: An Application of the Rasch Model

Peer reviewed

Direct link

Coe, Robert – Oxford Review of Education, 2008

The comparability of examinations in different subjects has been a controversial topic for many years and a number of criticisms have been made of statistical approaches to estimating the "difficulties" of achieving particular grades in different subjects. This paper argues that if comparability is understood in terms of a linking…

Descriptors: Test Items, Grades (Scholastic), Foreign Countries, Test Bias

Examining Differences in Examinee Performance in Paper and Pencil and Computerized Testing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Puhan, Gautam; Boughton, Keith; Kim, Sooyeon – Journal of Technology, Learning, and Assessment, 2007

The study evaluated the comparability of two versions of a certification test: a paper-and-pencil test (PPT) and computer-based test (CBT). An effect size measure known as Cohen's d and differential item functioning (DIF) analyses were used as measures of comparability at the test and item levels, respectively. Results indicated that the effect…

Descriptors: Computer Assisted Testing, Effect Size, Test Bias, Mathematics Tests

Interpreter and Spanish Administration Effects on the WISC Performance on Mexican-American Children.

Peer reviewed

Swanson, Elinor N.; Deblassie, Richard R. – Journal of School Psychology, 1979

A study was conducted to ascertain whether use of an interpreter and/or a regular examiner in administering the WISC would affect test results of a group of Mexican-American children. Spanish administration of some scales of the performance test are likely to elicit optimum performance. (Author)

Descriptors: Comparative Testing, Elementary Education, Mexican Americans, Psychological Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational Measurement:…	4
Educational and Psychological…	4
Journal of Technology,…	4
Journal of Educational…	3
Applied Measurement in…	2
Contemporary Educational…	2
Intelligence	2
Advances in Health Sciences…	1
American Educational Research…	1
American Journal of Evaluation	1
Annenberg Institute for…	1
Applied Psychological…	1
Assessment in Education:…	1
ERS Spectrum	1
ETS Research Report Series	1
Educational Testing Service	1
Evaluation and Program…	1
Illinois Workforce and…	1
Journal of Black Psychology	1
Journal of School Psychology	1
Oxford Review of Education	1
ProQuest LLC	1
Scandinavian Journal of…	1
School Psychology Review	1
Society for Research on…	1
More ▼

Kim, Sooyeon	2
Whitworth, Randolph H.	2
Allen, Nancy	1
Andrew P. Jaciw	1
Armstrong, Anne-Marie	1
Barclay, Allan G.	1
Bauer, Daniel	1
Bennett, Randy Elliott	1
Bolger, Niall	1
Bolt, Sara E.	1
Boughton, Keith	1
Breland, Hunter M.	1
Buhr, Dianne C.	1
Carey, Jill	1
Catherine Mata	1
Chang, Yu-Wen	1
Chipman, Susan F.	1
Chrisman, Sabine M.	1
Coe, Robert	1
Coffman, William E.	1
Cowen, Sheila	1
Crino, Michael D.	1
Curley, Edward	1
Davison, Mark L.	1
More ▼

SAT (College Admission Test)	6
Wechsler Intelligence Scale…	4
California Achievement Tests	2
General Aptitude Test Battery	2
Kaufman Assessment Battery…	2
Program for International…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Alabama High School…	1
Armed Services Vocational…	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Marlowe Crowne Social…	1
My Class Inventory	1
Raven Progressive Matrices	1
Stanford Achievement Tests	1
Texas Assessment of Academic…	1
Wechsler Preschool and…	1
Wide Range Achievement Test	1
More ▼