ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	22

Descriptor

Comparative Analysis	61
Test Validity	61
Scoring	47
Test Reliability	32
Test Construction	14
Scoring Formulas	12
Foreign Countries	11
Testing	11
Statistical Analysis	10
Higher Education	9
Language Tests	9
Correlation	8
Elementary Secondary Education	8
Test Items	8
Computer Assisted Testing	7
Item Analysis	7
Scores	7
Achievement Tests	6
English (Second Language)	6
Evaluation Methods	6
Item Response Theory	6
Response Style (Tests)	6
College Students	5
Multiple Choice Tests	5
Rating Scales	5
More ▼

Publication Type

Reports - Research	33
Journal Articles	27
Reports - Evaluative	11
Speeches/Meeting Papers	5
Reports - Descriptive	4
Information Analyses	2
Tests/Questionnaires	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Guides - General	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	6
Postsecondary Education	6
Secondary Education	4
Elementary Education	3
Elementary Secondary Education	2
High Schools	2
Kindergarten	2
Grade 1	1
Grade 11	1
Grade 2	1
Grade 4	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Practitioners

Location

Australia	2
Taiwan	2
China	1
Europe	1
Iran	1
Japan	1
Kansas	1
Malawi	1
Netherlands	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Trends in International…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
College and University…	1
Defining Issues Test	1
Draw a Person Test	1
Early Childhood Longitudinal…	1
International Association for…	1
National Teacher Examinations	1
Peabody Picture Vocabulary…	1
Progress in International…	1
SAT (College Admission Test)	1
Strong Vocational Interest…	1
Test of Language Development	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

TOEFL iBT Iranian Test-Takers' Oral Language Performance: A Comparison between Independent and Integrated Speaking Tasks

Peer reviewed
PDF on ERIC

Download full text

Ariamanesh, Ali A.; Barati, Hossein; Youhanaee, Manijeh – International TESOL Journal, 2022

The present study investigated the speaking module of TOEFL iBT with an emphasis on the dichotomy of independent and integrated tasks. The potential differences between the two speaking conditions were intended to be explored based on the oral performance elicited from a group of Iranian test takers. To collect the required data, a simulated…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Computer Assisted Testing

Factor Structure, Stability, and Congruence in the Functional Movement Screen

Peer reviewed

Direct link

Kelleher, Leila K.; Beach, Tyson A. C.; Frost, David M.; Johnson, Andrew M.; Dickey, James P. – Measurement in Physical Education and Exercise Science, 2018

The scoring scheme for the functional movement screen implicitly assumes that the factor structure is consistent, stable, and congruent across different populations. To determine if this is the case, we compared principal components analyses of three samples: a healthy, general population (n = 100), a group of varsity athletes (n = 101), and a…

Descriptors: Factor Structure, Test Reliability, Screening Tests, Motion

The Role of Expert Judgement in Language Test Validation

Peer reviewed
PDF on ERIC

Download full text

Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022

The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…

Descriptors: Specialists, Language Tests, Test Validity, College Faculty

Experimental Validation of the Half-Length Force Concept Inventory

Peer reviewed

Direct link

Han, Jing; Koenig, Kathleen; Cui, Lili; Fritchman, Joseph; Li, Dan; Sun, Wanyi; Fu, Zhao; Bao, Lei – Physical Review Physics Education Research, 2016

In a recent study, the 30-question Force Concept Inventory (FCI) was theoretically split into two 14-question "half-length" tests (HFCIs) covering the same set of concepts and producing mean scores that can be equated to those of the original FCI. The HFCIs require less administration time and reduce test-retest issues when different…

Descriptors: Physics, Scientific Concepts, Science Instruction, College Science

Evidence of Middle School Science Assessment Practice from Classroom-Based Portfolios

Peer reviewed

Direct link

Kloser, Matthew; Borko, Hilda; Martinez, Jose Felipe; Stecher, Brian; Luskin, Rebecca – Science Education, 2017

Assessments are powerful tools for informing teachers and students about where student thinking stands with relation to a learning goal. Yet, few studies provide qualitative analyses of assessment practice across a unit. This study uses a framework of nine dimensions of effective assessment practice in science classrooms to compare more and less…

Descriptors: Secondary School Science, Evidence, Portfolio Assessment, Middle School Teachers

Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis

Peer reviewed

Direct link

Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016

Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…

Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size

Reliability and Validity of International Large-Scale Assessment: Understanding IEA's Comparative Studies of Student Achievement. IEA Research for Education. Volume 10

Download full text

Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020

Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…

Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

A Comparative Analysis of British and Taiwanese Students' Conceptual and Procedural Knowledge of Fraction Addition

Peer reviewed

Direct link

Li, Hui-Chuan – International Journal of Mathematical Education in Science and Technology, 2014

This study examines students' procedural and conceptual achievement in fraction addition in England and Taiwan. A total of 1209 participants (561 British students and 648 Taiwanese students) at ages 12 and 13 were recruited from England and Taiwan to take part in the study. A quantitative design by means of a self-designed written test is adopted…

Descriptors: Comparative Analysis, Addition, Mathematics Instruction, Foreign Countries

Validation of Empirically Derived Rating Scales for a Story Retelling Speaking Test

Peer reviewed

Direct link

Hirai, Akiyo; Koizumi, Rie – Language Assessment Quarterly, 2013

In recognition of the rating scale as a crucial tool of performance assessment, this study aims to establish a rating scale suitable for a Story Retelling Speaking Test (SRST), which is a semidirect test of speaking ability in English as a foreign language for classroom use. To identify an appropriate scale, three rating scales, all of which have…

Descriptors: Test Validity, Rating Scales, Story Telling, Speech Tests

Test Review: "Test of Language Development-Intermediate" by D. D. Hammill and P. L. Newcomer

Peer reviewed

Direct link

Carmichael, Jessica A.; Fraccaro, Rebecca L.; Nordstokke, David W. – Canadian Journal of School Psychology, 2014

Oral language skills are important to consider in school psychology practice, as they are directly tied to many areas of academic functioning. For example, research has demonstrated that oral language skills in early elementary school predict reading comprehension in later grades (Kendeou, van den Broek, White, & Lynch, 2009). With a…

Descriptors: Language Tests, Oral Language, Language Skills, School Psychology

Using Digital Technologies to Improve the Authenticity of Performance Assessment for High-Stakes Purposes

Peer reviewed

Direct link

Newhouse, C. Paul – Technology, Pedagogy and Education, 2015

This paper reports on the outcomes of a three-year study investigating the use of digital technologies to increase the authenticity of high-stakes summative assessment in four Western Australian senior secondary courses. The study involved 82 teachers and 1015 students and a range of digital forms of assessment using computer-based exams, digital…

Descriptors: Educational Technology, High Stakes Tests, Summative Evaluation, Secondary School Students

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Journal of Educational…	5
Educational and Psychological…	2
Language Testing	2
ACT Education Corp.	1
Art Therapy: Journal of the…	1
Bilingual Review	1
Canadian Journal of School…	1
Communique	1
Education Policy Analysis…	1
Educational Research	1
International Association for…	1
International Journal of…	1
International TESOL Journal	1
Journal of Autism and…	1
Journal of Clinical Psychology	1
Journal of Educational…	1
Journal of School Psychology	1
Journal on Educational…	1
Language Assessment Quarterly	1
Language Education &…	1
Measurement and Evaluation in…	1
Measurement in Physical…	1
National Center for Education…	1
Physical Review Physics…	1
ProQuest LLC	1
More ▼

Hakstian, A. Ralph	2
Kansup, Wanlop	2
Weiss, David J.	2
Alqarni, Abdulelah Mohammed	1
Ariamanesh, Ali A.	1
August, Diane	1
Bao, Lei	1
Barati, Hossein	1
Beach, Tyson A. C.	1
Beaujean, A. Alexander	1
Blaker, Lisa	1
Borko, Hilda	1
Bowman, Harry L.	1
Braden, Jeffery P.	1
Carlo, Maria	1
Carmichael, Jessica A.	1
Chakwera, Elias	1
Clariana, Roy B.	1
Coniam, David	1
Cui, Lili	1
Des Brisay, Margaret	1
Dickey, James P.	1
Donlon, Thomas F.	1
Downey, Ronald G.	1
More ▼