ERIC - Search Results

Publication Date

In 2025	3
Since 2024	10
Since 2021 (last 5 years)	22
Since 2016 (last 10 years)	48
Since 2006 (last 20 years)	79

Descriptor

Test Validity	333
Testing	333
Test Reliability	139
Test Construction	90
Language Tests	73
Foreign Countries	62
English (Second Language)	53
Second Language Learning	52
Higher Education	47
Scores	37
Student Evaluation	37
Evaluation Methods	34
Elementary Education	33
Language Proficiency	33
Testing Problems	33
Comparative Analysis	31
Standardized Tests	31
Elementary Secondary Education	29
Achievement Tests	28
Academic Achievement	27
Statistical Analysis	27
Intelligence Tests	26
Measurement Techniques	26
Test Interpretation	26
Computer Assisted Testing	25
More ▼

Publication Type

Reports - Research	333
Journal Articles	172
Speeches/Meeting Papers	30
Tests/Questionnaires	24
Information Analyses	6
Numerical/Quantitative Data	5
Guides - Non-Classroom	4
Reports - Descriptive	4
Collected Works - General	3
Books	2
Collected Works - Proceedings	2
Collected Works - Serials	2
Opinion Papers	2
Reference Materials -…	2
Historical Materials	1
Reports - Evaluative	1
More ▼

Education Level

Higher Education	28
Postsecondary Education	23
Secondary Education	11
Elementary Education	8
Early Childhood Education	7
Elementary Secondary Education	6
High Schools	5
Kindergarten	5
Primary Education	5
Adult Education	2
Grade 5	2
Junior High Schools	2
Middle Schools	2
Preschool Education	2
Grade 4	1
Grade 8	1
Intermediate Grades	1
More ▼

Audience

Practitioners	13
Researchers	10
Teachers	4
Administrators	1
Counselors	1

Location

Canada	7
China	7
California	5
Australia	4
Illinois	4
Iran	4
United Kingdom	4
Brazil	3
Japan	3
Malaysia	3
Ohio	3
Pennsylvania	3
United Arab Emirates	3
United Kingdom (England)	3
Bangladesh	2
Indonesia	2
Maryland	2
New York	2
North Carolina	2
Philippines	2
Saudi Arabia	2
West Germany	2
Argentina	1
Arizona	1
Arkansas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Elementary and Secondary…	1
Elementary and Secondary…	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 333 results Save | Export

A Chi-Square Statistic for Testing the Equality of Distracters' Plausibility in Multiple-Choice Test Items

Download full text

Sherwin E. Balbuena – Online Submission, 2024

This study introduces a new chi-square test statistic for testing the equality of response frequencies among distracters in multiple-choice tests. The formula uses the information from the number of correct answers and wrong answers, which becomes the basis of calculating the expected values of response frequencies per distracter. The method was…

Descriptors: Multiple Choice Tests, Statistics, Test Validity, Testing

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Reflecting on the Relevance of Drawing as a Tool in Eliciting Pre-Service Teachers' Preconceptions of Human Organs and Organ Systems

Peer reviewed

Direct link

Ian Phil Canlas; Joyce Molino-Magtolis – Journal of Biological Education, 2024

The use of drawing as an assessment tool to reveal students' conceptions in biology specifically on human organs and organ systems is not new, however, there is a deficit in the literature that attempted to explore and reflect on its usefulness and relevance specifically, in eliciting students' preconceptions related thereto. Making use of a…

Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Biology

Design Framework for the ACT® Enhancements. ACT Research. Research Report. R2519

Download full text

Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025

This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…

Descriptors: College Entrance Examinations, Testing, Change, Test Construction

Measuring Test-Taking Effort on Constructed-Response Items with Item Response Time and Number of Actions

Peer reviewed
PDF on ERIC

Download full text

Militsa G. Ivanova; Michalis P. Michaelides – Practical Assessment, Research & Evaluation, 2023

Research on methods for measuring examinee engagement with constructed-response items is limited. The present study used data from the PISA 2018 Reading domain to construct and compare indicators of test-taking effort on constructed-response items: response time, number of actions, the union (combining effortless responses detected by either…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Response Process Evidence for Academic Assessments of Students with Significant Cognitive Disabilities

Peer reviewed
PDF on ERIC

Download full text

Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024

This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…

Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation

Constructing and Validating a Code of Ethics in Testing Inventory: Investigating EFL Instructors' Perspectives

Peer reviewed

Direct link

Mansooreh Hosseinnia; Zahra Kafi – Language Testing in Asia, 2024

As testing involves various aspects of education as well as the ones who are involved like instructors, students, managers, teacher trainers, testers, and decision-makers, it comes to be highly crucial to develop ethical tests. In addition, as some methods of testing are more favored and practiced compared to others without considering the ethical…

Descriptors: Test Construction, Test Validity, Ethics, Testing

Assessment of Multiple Choice Question Exams Quality Using Graphical Methods

Peer reviewed
PDF on ERIC

Download full text

Yousuf, Mustafa S.; Miles, Katherine; Harvey, Heather; Al-Tamimi, Mohammad; Badran, Darwish – Journal of University Teaching and Learning Practice, 2022

Exams should be valid, reliable, and discriminative. Multiple informative methods are used for exam analysis. Displaying analysis results numerically, however, may not be easily comprehended. Using graphical analysis tools could be better for the perception of analysis results. Two such methods were employed: standardized x-bar control charts with…

Descriptors: Multiple Choice Tests, Testing, Test Reliability, Test Validity

How Administration Stakes and Settings Affect Student Behavior and Performance on a Biology Concept Assessment

Peer reviewed

Direct link

Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023

Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…

Descriptors: Biology, Science Tests, High Stakes Tests, Scores

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

Illustration of Multilevel Explanatory IRT Model DIF Testing with the Creative Thinking Scale

Peer reviewed

Direct link

Qian, Meihua; Wang, Xianyong – Journal of Creative Behavior, 2020

Creativity has been well studied in the past several decades, and numerous measures have been developed to assess creativity. However, validity evidence associated with each measure is often mixed. In particular, the social consequence aspect of validity has received little attention. This is partly due to the difficulty of testing for…

Descriptors: Item Response Theory, Testing, Creativity Tests, Creative Thinking

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 23

Language Testing	22
Journal of Research in…	6
Psychology in the Schools	4
ETS Research Report Series	3
Educational and Psychological…	3
Journal of Learning…	3
Online Submission	3
Regional Educational…	3
System	3
ACT Education Corp.	2
AEDS Journal	2
Advances in Language and…	2
Alberta Journal of…	2
Assessment for Effective…	2
British Journal of…	2
ESL Magazine	2
Early Child Development and…	2
English Language Teaching	2
Grantee Submission	2
Journal of Childhood…	2
Journal of Consulting and…	2
Journal of School Psychology	2
Journal of Visual Impairment…	2
Language Assessment Quarterly	2
Language Learning	2
More ▼

Nakamura, Yuji	3
Ackerman, Debra J.	2
Chalhoub-Deville, Micheline	2
Cory, Charles H.	2
Cziko, Gary A.	2
Donlon, Thomas F.	2
Fernandes, Kathleen	2
Goh, Pauline Swee Choo	2
Jeff Allen	2
Klein, Stephen P.	2
Klein-Braley, Christine	2
Modu, Christopher C.	2
Osman, Rosma bt	2
Perkins, Kyle	2
Rahmat, Mohd Khairezan	2
Rose, Andrew M.	2
Russell, Nolan F.	2
Schmitt, Norbert	2
Shohamy, Elana	2
Ty Cruce	2
Wong, Kung-Teck	2
Abkarian, G. G.	1
Ahlawat, Kapur S.	1
Ahmed, Md. Kawser	1
More ▼

Wechsler Intelligence Scale…	7
ACT Assessment	4
Kaufman Assessment Battery…	4
Raven Progressive Matrices	4
Test of English as a Foreign…	4
Iowa Tests of Basic Skills	3
Peabody Picture Vocabulary…	3
SAT (College Admission Test)	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Bayley Scales of Infant…	2
Matching Familiar Figures Test	2
Program for International…	2
Stanford Achievement Tests	2
Adjective Check List	1
Armed Services Vocational…	1
Battelle Developmental…	1
Bem Sex Role Inventory	1
Bender Gestalt Test	1
Career Development Inventory	1
Clinical Evaluation of…	1
Defining Issues Test	1
Denver Developmental…	1
Florida Comprehensive…	1
General Aptitude Test Battery	1
More ▼