ERIC - Search Results

Publication Date

In 2025	1
Since 2024	9
Since 2021 (last 5 years)	26
Since 2016 (last 10 years)	62
Since 2006 (last 20 years)	176

Descriptor

Test Validity	429
Testing	429
Test Reliability	181
Language Tests	111
Test Construction	99
Foreign Countries	85
Second Language Learning	77
Standardized Tests	66
Test Interpretation	62
English (Second Language)	58
Scores	58
Test Format	57
Scoring	53
Higher Education	52
Student Evaluation	52
Elementary Secondary Education	48
Test Reviews	45
Language Proficiency	43
Evaluation Methods	38
Testing Problems	37
Screening Tests	36
Test Items	36
Intelligence Tests	35
Test Content	35
Computer Assisted Testing	32
More ▼

Publication Type

Journal Articles	429
Reports - Research	172
Reports - Evaluative	89
Reports - Descriptive	83
Opinion Papers	67
Information Analyses	38
Tests/Questionnaires	12
Guides - Non-Classroom	7
Guides - Classroom - Teacher	4
Speeches/Meeting Papers	4
Reports - General	2
Book/Product Reviews	1
Guides - General	1
Historical Materials	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	36
Postsecondary Education	27
Elementary Secondary Education	23
Secondary Education	11
Elementary Education	5
Early Childhood Education	4
Grade 5	4
High Schools	4
Kindergarten	4
Adult Education	2
Grade 1	2
Grade 3	2
Grade 4	2
Grade 7	2
Grade 8	2
Grade 9	2
Primary Education	2
Grade 10	1
Grade 12	1
Grade 2	1
Grade 6	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Practitioners	14
Researchers	9
Teachers	8
Administrators	2

Location

Canada	12
China	9
United Kingdom	8
United Kingdom (England)	6
Japan	5
Australia	4
Malaysia	4
United States	4
Brazil	3
Iran	3
Arizona	2
Bangladesh	2
California	2
India	2
Indonesia	2
New York	2
North Carolina	2
Pennsylvania	2
South Africa	2
Sweden	2
Texas	2
Thailand	2
Africa	1
Argentina	1
Arkansas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Education for All Handicapped…	1
Elementary and Secondary…	1
Lau v Nichols	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 429 results Save | Export

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

An Examination of Classification Accuracy in the Continuous Testing Framework

Peer reviewed

Direct link

Coggeshall, Whitney Smiley – Educational Measurement: Issues and Practice, 2021

The continuous testing framework, where both successful and unsuccessful examinees have to demonstrate continued proficiency at frequent prespecified intervals, is a framework that is used in noncognitive assessment and is gaining in popularity in cognitive assessment. Despite the rigorous advantages of this framework, this paper demonstrates that…

Descriptors: Classification, Accuracy, Testing, Failure

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Test Review: Raven's 2 Progressive Matrices, Clinical Edition (Raven's 2)

Peer reviewed

Direct link

McLeod, Justin W.H.; McCrimmon, Adam W. – Journal of Psychoeducational Assessment, 2021

The "Raven's 2 Progressive Matrices Clinical Edition" (Raven's 2; Raven, Rust, Chan, & Zhou, 2018), published by NCS Pearson, is an individually administered nonverbal assessment of general cognitive ability developed to measure "educative abilities," defined as the ability to think clearly and solve complex problems in…

Descriptors: Test Reviews, Intelligence Tests, Testing, Test Reliability

A Dialectic on Validity: Explanation-Focused and the Many Ways of Being Human

Peer reviewed
PDF on ERIC

Download full text

Bruno D. Zumbo – International Journal of Assessment Tools in Education, 2023

In line with the journal volume's theme, this essay considers lessons from the past and visions for the future of test validity. In the first part of the essay, a description of historical trends in test validity since the early 1900s leads to the natural question of whether the discipline has progressed in its definition and description of test…

Descriptors: Test Theory, Test Validity, True Scores, Definitions

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Reflecting on the Relevance of Drawing as a Tool in Eliciting Pre-Service Teachers' Preconceptions of Human Organs and Organ Systems

Peer reviewed

Direct link

Ian Phil Canlas; Joyce Molino-Magtolis – Journal of Biological Education, 2024

The use of drawing as an assessment tool to reveal students' conceptions in biology specifically on human organs and organ systems is not new, however, there is a deficit in the literature that attempted to explore and reflect on its usefulness and relevance specifically, in eliciting students' preconceptions related thereto. Making use of a…

Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Biology

Test-Taker Engagement in AI Technology-Mediated Language Assessment

Peer reviewed

Direct link

Yan Jin; Jason Fan – Language Assessment Quarterly, 2023

In language assessment, AI technology has been incorporated in task design, assessment delivery, automated scoring of performance-based tasks, score reporting, and provision of feedback. AI technology is also used for collecting and analyzing performance data in language assessment validation. Research has been conducted to investigate the…

Descriptors: Language Tests, Artificial Intelligence, Computer Assisted Testing, Test Format

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Measuring Test-Taking Effort on Constructed-Response Items with Item Response Time and Number of Actions

Peer reviewed
PDF on ERIC

Download full text

Militsa G. Ivanova; Michalis P. Michaelides – Practical Assessment, Research & Evaluation, 2023

Research on methods for measuring examinee engagement with constructed-response items is limited. The present study used data from the PISA 2018 Reading domain to construct and compare indicators of test-taking effort on constructed-response items: response time, number of actions, the union (combining effortless responses detected by either…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Response Process Evidence for Academic Assessments of Students with Significant Cognitive Disabilities

Peer reviewed
PDF on ERIC

Download full text

Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024

This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…

Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation

Constructing and Validating a Code of Ethics in Testing Inventory: Investigating EFL Instructors' Perspectives

Peer reviewed

Direct link

Mansooreh Hosseinnia; Zahra Kafi – Language Testing in Asia, 2024

As testing involves various aspects of education as well as the ones who are involved like instructors, students, managers, teacher trainers, testers, and decision-makers, it comes to be highly crucial to develop ethical tests. In addition, as some methods of testing are more favored and practiced compared to others without considering the ethical…

Descriptors: Test Construction, Test Validity, Ethics, Testing

Assessment of Multiple Choice Question Exams Quality Using Graphical Methods

Peer reviewed
PDF on ERIC

Download full text

Yousuf, Mustafa S.; Miles, Katherine; Harvey, Heather; Al-Tamimi, Mohammad; Badran, Darwish – Journal of University Teaching and Learning Practice, 2022

Exams should be valid, reliable, and discriminative. Multiple informative methods are used for exam analysis. Displaying analysis results numerically, however, may not be easily comprehended. Using graphical analysis tools could be better for the perception of analysis results. Two such methods were employed: standardized x-bar control charts with…

Descriptors: Multiple Choice Tests, Testing, Test Reliability, Test Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 29

Language Testing	42
Diagnostique	28
Journal of Psychoeducational…	22
Canadian Journal of School…	7
Journal of Research in…	7
Language Assessment Quarterly	7
Educational Researcher	6
Measurement:…	6
Educational and Psychological…	5
International Journal of…	5
System	5
Alberta Journal of…	4
Assessment for Effective…	4
Canadian Modern Language…	4
ETS Research Report Series	4
Educational Studies	4
Practical Assessment,…	4
Psychology in the Schools	4
TESOL Quarterly	4
Assessment and Evaluation in…	3
Educational Measurement:…	3
Foreign Language Annals	3
Journal of Learning…	3
Journal of School Psychology	3
Journal of Special Education	3
More ▼

McCrimmon, Adam W.	6
Chapelle, Carol A.	3
Kane, Michael	3
Milton, Ohmer	3
Ackerman, Debra J.	2
Chalhoub-Deville, Micheline	2
Chambers, Francine	2
Cheng, Liying	2
Cziko, Gary A.	2
Davies, Alan	2
Dickens, Rachel H.	2
Dunne, Michael P.	2
Embretson, Susan E.	2
Fuchs, Lynn S.	2
Fulcher, Glenn	2
Goh, Pauline Swee Choo	2
Henning, Grant	2
Hudson, Thom	2
Klein-Braley, Christine	2
McNamara, Tim	2
Meisinger, Elizabeth B.	2
Mislevy, Robert J.	2
Modu, Christopher C.	2
Moss, Pamela A.	2
More ▼

Wechsler Intelligence Scale…	13
Test of English as a Foreign…	7
Kaufman Assessment Battery…	5
Peabody Picture Vocabulary…	5
SAT (College Admission Test)	5
Raven Progressive Matrices	4
Wechsler Adult Intelligence…	4
Vineland Adaptive Behavior…	3
Woodcock Johnson Tests of…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Armed Services Vocational…	2
Battelle Developmental…	2
Bayley Scales of Infant…	2
Clinical Evaluation of…	2
Developmental Indicators for…	2
International English…	2
Kaufman Test of Educational…	2
Peabody Individual…	2
Program for International…	2
Test of English for…	2
Autism Diagnostic Observation…	1
Beery Developmental Test of…	1
Behavior Assessment System…	1
Bender Gestalt Test	1
More ▼