ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	20

Descriptor

Comparative Analysis	47
Scoring	47
Test Validity	47
Test Reliability	25
Test Construction	12
Foreign Countries	10
Higher Education	9
Correlation	8
Language Tests	8
Test Items	8
Elementary Secondary Education	7
Item Analysis	7
Scores	7
Statistical Analysis	7
Testing	7
Computer Assisted Testing	6
Evaluation Methods	6
Item Response Theory	6
English (Second Language)	5
Writing Evaluation	5
Achievement Tests	4
College Students	4
Grade 4	4
Predictive Validity	4
Rating Scales	4
More ▼

Publication Type

Reports - Research	26
Journal Articles	24
Reports - Evaluative	11
Reports - Descriptive	3
Speeches/Meeting Papers	3
Information Analyses	2
Tests/Questionnaires	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Guides - General	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	5
Postsecondary Education	5
Elementary Education	3
Secondary Education	3
Elementary Secondary Education	2
High Schools	2
Kindergarten	2
Grade 1	1
Grade 11	1
Grade 2	1
Grade 4	1
Intermediate Grades	1
More ▼

Audience

Practitioners

Location

Australia	2
Taiwan	2
China	1
Europe	1
Japan	1
Malawi	1
Netherlands	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Trends in International…	2
ACT Assessment	1
College and University…	1
Draw a Person Test	1
Early Childhood Longitudinal…	1
International Association for…	1
National Teacher Examinations	1
Peabody Picture Vocabulary…	1
Progress in International…	1
SAT (College Admission Test)	1
Test of Language Development	1
Wechsler Intelligence Scale…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 47 results Save | Export

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Factor Structure, Stability, and Congruence in the Functional Movement Screen

Peer reviewed

Direct link

Kelleher, Leila K.; Beach, Tyson A. C.; Frost, David M.; Johnson, Andrew M.; Dickey, James P. – Measurement in Physical Education and Exercise Science, 2018

The scoring scheme for the functional movement screen implicitly assumes that the factor structure is consistent, stable, and congruent across different populations. To determine if this is the case, we compared principal components analyses of three samples: a healthy, general population (n = 100), a group of varsity athletes (n = 101), and a…

Descriptors: Factor Structure, Test Reliability, Screening Tests, Motion

The Role of Expert Judgement in Language Test Validation

Peer reviewed
PDF on ERIC

Download full text

Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022

The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…

Descriptors: Specialists, Language Tests, Test Validity, College Faculty

Experimental Validation of the Half-Length Force Concept Inventory

Peer reviewed

Direct link

Han, Jing; Koenig, Kathleen; Cui, Lili; Fritchman, Joseph; Li, Dan; Sun, Wanyi; Fu, Zhao; Bao, Lei – Physical Review Physics Education Research, 2016

In a recent study, the 30-question Force Concept Inventory (FCI) was theoretically split into two 14-question "half-length" tests (HFCIs) covering the same set of concepts and producing mean scores that can be equated to those of the original FCI. The HFCIs require less administration time and reduce test-retest issues when different…

Descriptors: Physics, Scientific Concepts, Science Instruction, College Science

Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis

Peer reviewed

Direct link

Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016

Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…

Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size

Reliability and Validity of International Large-Scale Assessment: Understanding IEA's Comparative Studies of Student Achievement. IEA Research for Education. Volume 10

Download full text

Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020

Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…

Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

A Comparative Analysis of British and Taiwanese Students' Conceptual and Procedural Knowledge of Fraction Addition

Peer reviewed

Direct link

Li, Hui-Chuan – International Journal of Mathematical Education in Science and Technology, 2014

This study examines students' procedural and conceptual achievement in fraction addition in England and Taiwan. A total of 1209 participants (561 British students and 648 Taiwanese students) at ages 12 and 13 were recruited from England and Taiwan to take part in the study. A quantitative design by means of a self-designed written test is adopted…

Descriptors: Comparative Analysis, Addition, Mathematics Instruction, Foreign Countries

Validation of Empirically Derived Rating Scales for a Story Retelling Speaking Test

Peer reviewed

Direct link

Hirai, Akiyo; Koizumi, Rie – Language Assessment Quarterly, 2013

In recognition of the rating scale as a crucial tool of performance assessment, this study aims to establish a rating scale suitable for a Story Retelling Speaking Test (SRST), which is a semidirect test of speaking ability in English as a foreign language for classroom use. To identify an appropriate scale, three rating scales, all of which have…

Descriptors: Test Validity, Rating Scales, Story Telling, Speech Tests

Test Review: "Test of Language Development-Intermediate" by D. D. Hammill and P. L. Newcomer

Peer reviewed

Direct link

Carmichael, Jessica A.; Fraccaro, Rebecca L.; Nordstokke, David W. – Canadian Journal of School Psychology, 2014

Oral language skills are important to consider in school psychology practice, as they are directly tied to many areas of academic functioning. For example, research has demonstrated that oral language skills in early elementary school predict reading comprehension in later grades (Kendeou, van den Broek, White, & Lynch, 2009). With a…

Descriptors: Language Tests, Oral Language, Language Skills, School Psychology

Using Digital Technologies to Improve the Authenticity of Performance Assessment for High-Stakes Purposes

Peer reviewed

Direct link

Newhouse, C. Paul – Technology, Pedagogy and Education, 2015

This paper reports on the outcomes of a three-year study investigating the use of digital technologies to increase the authenticity of high-stakes summative assessment in four Western Australian senior secondary courses. The study involved 82 teachers and 1015 students and a range of digital forms of assessment using computer-based exams, digital…

Descriptors: Educational Technology, High Stakes Tests, Summative Evaluation, Secondary School Students

An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

Peer reviewed

Direct link

Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie – Measurement and Evaluation in Counseling and Development, 2013

Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

Descriptors: Item Response Theory, Test Theory, Measures (Individuals), Racial Identification

Examining Secondary Writing: Curriculum-Based Measures and Six Traits

Direct link

Havlin, Patricia J. – ProQuest LLC, 2013

Writing assessments have taken two primary forms in the past two decades: direct and indirect. Irrespective of type, either form needs to be anchored to making decisions in the classroom and predicting performance on high-stakes tests, particularly in a high-stakes environment with serious consequences. In this study, 11th-grade students were…

Descriptors: Writing Evaluation, Grade 11, High School Students, Writing Assignments

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Journal of Educational…	2
Language Testing	2
ACT Education Corp.	1
Art Therapy: Journal of the…	1
Bilingual Review	1
Canadian Journal of School…	1
Communique	1
Education Policy Analysis…	1
Educational Research	1
Educational and Psychological…	1
International Association for…	1
International Journal of…	1
Journal of Autism and…	1
Journal of Clinical Psychology	1
Journal of Educational…	1
Journal on Educational…	1
Language Assessment Quarterly	1
Language Education &…	1
Measurement and Evaluation in…	1
Measurement in Physical…	1
National Center for Education…	1
Physical Review Physics…	1
ProQuest LLC	1
Research in Developmental…	1
Review of Educational Research	1
More ▼

Weiss, David J.	2
Alqarni, Abdulelah Mohammed	1
August, Diane	1
Bao, Lei	1
Beach, Tyson A. C.	1
Beaujean, A. Alexander	1
Blaker, Lisa	1
Bowman, Harry L.	1
Carlo, Maria	1
Carmichael, Jessica A.	1
Chakwera, Elias	1
Clariana, Roy B.	1
Coniam, David	1
Cui, Lili	1
Des Brisay, Margaret	1
Dickey, James P.	1
Downey, Ronald G.	1
Ferroli, Lou	1
Fraccaro, Rebecca L.	1
Fritchman, Joseph	1
Frost, David M.	1
Fu, Zhao	1
Ginther, April	1
Glasser, Anne	1
More ▼