ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	21
Since 2017 (last 10 years)	42
Since 2007 (last 20 years)	66

Descriptor

Test Reliability	248
Testing	248
Test Validity	140
Test Construction	76
Higher Education	44
Foreign Countries	43
Language Tests	37
Comparative Analysis	33
English (Second Language)	33
Evaluation Methods	32
Second Language Learning	31
Achievement Tests	29
Scores	29
Scoring	28
Measurement Techniques	27
Student Evaluation	27
Testing Problems	26
Language Proficiency	23
Standardized Tests	22
Test Interpretation	22
Test Items	22
Elementary Secondary Education	20
Statistical Analysis	20
College Students	19
Psychometrics	19
More ▼

Publication Type

Reports - Research	248
Journal Articles	122
Speeches/Meeting Papers	21
Tests/Questionnaires	11
Information Analyses	3
Numerical/Quantitative Data	3
Reports - Descriptive	3
Collected Works - General	2
Guides - Non-Classroom	2
Books	1
Guides - Classroom - Teacher	1
Opinion Papers	1
More ▼

Audience

Practitioners	7
Teachers	5
Researchers	3
Administrators	1
Counselors	1

Location

Canada	4
Illinois	4
Iran	4
China	3
Japan	3
Ohio	3
Pennsylvania	3
Turkey	3
United Kingdom	3
Australia	2
Bangladesh	2
Indonesia	2
Maryland	2
New Zealand	2
South Africa	2
United Kingdom (England)	2
Argentina	1
Austria	1
Brazil	1
California	1
Cyprus	1
Delaware	1
Europe	1
Finland	1
Florida	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
Bilingual Education Act 1968	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 248 results Save | Export

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Parents Can Accurately and Reliably Administer an Online Dyslexia Evaluation Tool

Peer reviewed

Direct link

Hurford, David P.; Wines, Autumn – Australian Journal of Learning Difficulties, 2022

The purpose of the present study was to examine the potential that parents could effectively administer an online dyslexia evaluation tool (ODET) to their children. To this end, four groups consisting of parents and trained staff were compared. Sixty-three children (36 females and 27 males) participated. The children in each group were assessed…

Descriptors: Test Reliability, Computer Assisted Testing, Dyslexia, Screening Tests

The Use of ChatGPT in Assessment

Peer reviewed
PDF on ERIC

Download full text

Mehmet Kanik – International Journal of Assessment Tools in Education, 2024

ChatGPT has surged interest to cause people to look for its use in different tasks. However, before allowing it to replace humans, its capabilities should be investigated. As ChatGPT has potential for use in testing and assessment, this study aims to investigate the questions generated by ChatGPT by comparing them to those written by a course…

Descriptors: Artificial Intelligence, Testing, Multiple Choice Tests, Test Construction

Examining the Relationship between Randomization Strategies and Control Group Crossover in Higher Education Interventions. EdWorkingPaper No. 24-1083

Download full text

Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024

This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…

Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction

Can the Oral Proficiency Interview -- Computer (ACTFL OPIc) Be Used Instead of the Oral Proficiency Interview (ACTFL OPI)? An Aligned Rank Transform (ART) Analysis

Peer reviewed

Direct link

Troy L. Cox; Gregory L. Thompson; Steven S. Stokes – Foreign Language Annals, 2025

This study investigated the differences between the ACTFL Oral Proficiency Interview (OPI) and the ACTFL Oral Proficiency Interview - Computer (OPIc) among Spanish learners at a U.S. university. Participants (N = 154) were randomly assigned to take both tests in a counterbalanced order to mitigate test order effects. Data were analyzed using an…

Descriptors: Oral Language, Language Proficiency, Interviews, Computer Uses in Education

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

The Effects of Attentional Focus on Test-Retest Reliability of Jumping Tasks

Peer reviewed

Direct link

Makaruk, Hubert; Porter, Jared M.; Cieslinski, Igor – Measurement in Physical Education and Exercise Science, 2021

This study examined the test-retest reliability of the standing long jump (SLJ) and the countermovement jump (CMJ) following consistent and non-consistent attentional focus cuing instructions in physically active young adults (n = 30). The systematic error (as standardize change in mean), random error (as typical error), the Bland and Altman…

Descriptors: Attention Control, Test Reliability, Performance Tests, Physical Activities

Constructing and Validating a Code of Ethics in Testing Inventory: Investigating EFL Instructors' Perspectives

Peer reviewed

Direct link

Mansooreh Hosseinnia; Zahra Kafi – Language Testing in Asia, 2024

As testing involves various aspects of education as well as the ones who are involved like instructors, students, managers, teacher trainers, testers, and decision-makers, it comes to be highly crucial to develop ethical tests. In addition, as some methods of testing are more favored and practiced compared to others without considering the ethical…

Descriptors: Test Construction, Test Validity, Ethics, Testing

Rethinking Online Assessment Quality from Pre-Service Teachers Perspectives

Peer reviewed
PDF on ERIC

Download full text

Mücahit Öztürk – Open Praxis, 2024

This study examined the problems that pre-service teachers face in the online assessment process and their suggestions for solutions to these problems. The participants were 136 pre-service teachers who have been experiencing online assessment for a long time and who took the Foundations of Open and Distance Learning course. This research is a…

Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Distance Education

Assessment of Multiple Choice Question Exams Quality Using Graphical Methods

Peer reviewed
PDF on ERIC

Download full text

Yousuf, Mustafa S.; Miles, Katherine; Harvey, Heather; Al-Tamimi, Mohammad; Badran, Darwish – Journal of University Teaching and Learning Practice, 2022

Exams should be valid, reliable, and discriminative. Multiple informative methods are used for exam analysis. Displaying analysis results numerically, however, may not be easily comprehended. Using graphical analysis tools could be better for the perception of analysis results. Two such methods were employed: standardized x-bar control charts with…

Descriptors: Multiple Choice Tests, Testing, Test Reliability, Test Validity

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Language Testing	8
Journal of Educational…	6
Language Learning	4
Early Child Development and…	3
Regional Educational…	3
System	3
Advances in Health Sciences…	2
Advances in Language and…	2
Annenberg Institute for…	2
ESL Magazine	2
ETS Research Report Series	2
Educational and Psychological…	2
Journal of Clinical Psychology	2
Journal of Communication…	2
Journal of Experimental…	2
Journal of Learning…	2
Journal of Visual Impairment…	2
Language, Speech, and Hearing…	2
Measurement and Evaluation in…	2
Practical Assessment,…	2
ACT Education Corp.	1
Academic Medicine	1
Administration and Policy in…	1
American Annals of the Deaf	1
American Journal of Distance…	1
More ▼

Weiss, David J.	4
Gallas, Edwin J.	3
Kapes, Jerome T.	3
Ackerman, Debra J.	2
Feldt, Leonard S.	2
Fernandes, Kathleen	2
Hurford, David P.	2
Rose, Andrew M.	2
Russell, Nolan F.	2
Schrader, William B.	2
Vansickle, Timothy R.	2
Adams, R. J.	1
Ahmed, Md. Kawser	1
Ajjawi, Rola	1
Al Hajri, Fatma	1
Al-Tamimi, Mohammad	1
Alderson, J. Charles	1
Ali, Md. Maksud	1
Allen, Ted W.	1
Allen, Thomas E.	1
Alper Gülay	1
Altshuler, Joan	1
Amanda A. Wolkowitz	1
Amery D. Wu	1
More ▼

Higher Education	24
Postsecondary Education	20
Elementary Education	8
Early Childhood Education	7
Kindergarten	5
Primary Education	5
Secondary Education	5
Elementary Secondary Education	4
Junior High Schools	3
Middle Schools	3
Preschool Education	3
Adult Education	2
High Schools	2
Intermediate Grades	2
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
More ▼

Bayley Scales of Infant…	2
General Aptitude Test Battery	2
Minnesota Multiphasic…	2
Stanford Achievement Tests	2
State Trait Anxiety Inventory	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Adjective Check List	1
Battelle Developmental…	1
Beck Anxiety Inventory	1
Beck Depression Inventory	1
Bem Sex Role Inventory	1
California Achievement Tests	1
California Critical Thinking…	1
Career Development Inventory	1
Center for Epidemiologic…	1
Clinical Evaluation of…	1
Comprehensive Tests of Basic…	1
Defining Issues Test	1
Denver Developmental…	1
Developmental Indicators for…	1
Florida Comprehensive…	1
Gates MacGinitie Reading Tests	1
General Educational…	1
Graduate Management Admission…	1
More ▼