ERIC - Search Results

Publication Date

In 2026	3
Since 2025	656
Since 2022 (last 5 years)	3157
Since 2017 (last 10 years)	7398
Since 2007 (last 20 years)	15036

Descriptor

Test Reliability	15028
Test Validity	10265
Reliability	9757
Foreign Countries	7137
Test Construction	4821
Validity	4191
Measures (Individuals)	3876
Factor Analysis	3822
Psychometrics	3520
Interrater Reliability	3124
Correlation	3039
Evaluation Methods	2744
Statistical Analysis	2533
Higher Education	2514
Questionnaires	2471
Scores	2386
College Students	2209
Student Attitudes	2146
Comparative Analysis	1943
Factor Structure	1821
Student Evaluation	1693
Rating Scales	1621
Measurement Techniques	1561
Test Items	1526
Elementary Secondary Education	1498
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	19227
Reports - Research	17413
Reports - Evaluative	3328
Speeches/Meeting Papers	1858
Tests/Questionnaires	1596
Reports - Descriptive	1543
Information Analyses	957
Dissertations/Theses -…	673
Opinion Papers	645
Guides - Non-Classroom	325
Numerical/Quantitative Data	252
Books	135
Guides - Classroom - Teacher	81
Reports - General	70
Guides - General	57
Reference Materials -…	53
Collected Works - General	40
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Non-Print Media	22
Dissertations/Theses	21
ERIC Digests in Full Text	20
More ▼

Education Level

Higher Education	4720
Postsecondary Education	3734
Secondary Education	2267
Elementary Education	2194
High Schools	1083
Middle Schools	1030
Elementary Secondary Education	875
Early Childhood Education	873
Junior High Schools	713
Primary Education	427
Intermediate Grades	400
Preschool Education	384
Grade 5	341
Grade 8	325
Grade 4	318
Grade 6	299
Grade 7	279
Grade 3	270
Kindergarten	267
Adult Education	211
Grade 1	202
Grade 2	173
Grade 9	154
Grade 10	140
Grade 11	108
More ▼

Audience

Researchers	709
Practitioners	451
Teachers	208
Administrators	122
Policymakers	66
Counselors	42
Students	38
Parents	11
Community	7
Support Staff	6
Media Staff	5
More ▼

Location

Turkey	1326
Australia	436
Canada	379
China	368
United States	271
United Kingdom	256
Indonesia	251
Taiwan	234
Netherlands	223
Spain	216
California	214
Germany	196
United Kingdom (England)	192
Malaysia	170
Hong Kong	161
Florida	159
Iran	156
Nigeria	149
South Korea	135
Texas	134
India	127
New York	119
Pennsylvania	114
South Africa	109
Japan	106
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 31 to 45 of 27,088 results Save | Export

Synthesizing Validity and Reliability Evidence for the Draw-A-Scientist Test

Peer reviewed
PDF on ERIC

Download full text

Julia Brochey-Taylor; Joseph A. Taylor – Educational Research and Reviews, 2024

The purpose of this synthesis study was to assess the reliability and validity of the Draw-A-Scientist Test (DAST) and its variations across multiple studies, aiming to understand limitations and propose modifications for future application within and beyond the science domain. Given the existence of multiple DAST versions, this study quantified…

Descriptors: Cognitive Tests, Freehand Drawing, Personality Measures, Projective Measures

Measuring Intentional Communication in Infants at Elevated Likelihood of Autism: Validity, Reliability, and Responsiveness of a Novel Coding Scale

Peer reviewed

Direct link

Elizabeth Choi-Tucci; John Sideris; Cristin Holland; Grace T. Baranek; Linda R. Watson – Journal of Speech, Language, and Hearing Research, 2025

Purpose: Intentional communication acts, or purposefully directed vocalizations and gestures, are particularly difficult for infants at elevated likelihood for eventual diagnosis of autism. The ability to measure and track intentional communication in infancy thus has the potential to aid early identification and intervention efforts. This study…

Descriptors: Infants, Autism Spectrum Disorders, Caregiver Child Relationship, Nonverbal Communication

Creativity Assessment over Time: Examining the Reliability of CAT Ratings

Peer reviewed

Direct link

Barth, Philipp; Stadtmann, Georg – Journal of Creative Behavior, 2021

The "consensual assessment technique" (CAT) is a reliable and valid method to measure (product) creativity and often considered "the" gold standard of creativity assessment. The reliability measure traditionally applied in CAT studies--inter-rater reliability--cannot capture time-sampling error, which is a particular relevant…

Descriptors: Creativity, Creativity Tests, Test Reliability, Interrater Reliability

Different Methods for Assessing Preservice Teachers' Instruction: Why Measures Matter

Peer reviewed

Direct link

Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024

Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…

Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Undergraduate Students' Career Resources: Validation of the Italian Version of the Career Resources Questionnaire

Peer reviewed

Direct link

Francesco Pace; Giulia Sciotto – International Journal for Educational and Vocational Guidance, 2025

In recent years, to better face university paths, the first approaches to the labor market, and then the actual university-to-work transition, university students are asked to have broader skills, such as the ability to network, to be involved in career-related issues, and to explore the characteristics of occupations as much as personal ones.…

Descriptors: Undergraduate Students, Questionnaires, Foreign Countries, Test Reliability

Mental Toughness of Physical Education Teachers: Validation of a New Questionnaire

Peer reviewed

Direct link

Sima Zach; Noa Fishler-Barum; Itamar Shidlov – Physical Educator, 2025

The purpose of the study was to develop the Teachers' Mental Toughness Questionnaire (TMTQ). The questionnaire was developed in six stages: item generation, content validity, exploratory factor analysis, reliability tests, convergent validity tests, and discriminant validity. The factor analysis indicates that it measures six factors: team,…

Descriptors: Test Construction, Test Validity, Test Reliability, Psychometrics

Automated Scoring in Learning Progression-Based Assessment: A Comparison of Researcher and Machine Interpretations

Peer reviewed

Direct link

Hui Jin; Cynthia Lima; Limin Wang – Educational Measurement: Issues and Practice, 2025

Although AI transformer models have demonstrated notable capability in automated scoring, it is difficult to examine how and why these models fall short in scoring some responses. This study investigated how transformer models' language processing and quantification processes can be leveraged to enhance the accuracy of automated scoring. Automated…

Descriptors: Automation, Scoring, Artificial Intelligence, Accuracy

Toward Sufficient Statistical Power in Algorithmic Bias Assessment: A Test for ABROCA

Peer reviewed
PDF on ERIC

Download full text

Conrad Borchers – International Educational Data Mining Society, 2025

Algorithmic bias is a pressing concern in educational data mining (EDM), as it risks amplifying inequities in learning outcomes. The Area Between ROC Curves (ABROCA) metric is frequently used to measure discrepancies in model performance across demographic groups to quantify overall model fairness. However, its skewed distribution--especially when…

Descriptors: Algorithms, Bias, Statistics, Simulation

Authenticating Truth in a Post-Truth Climate

Peer reviewed
PDF on ERIC

Download full text

Clarence Joldersma – Philosophical Studies in Education, 2025

In this paper, the author will develop a more comprehensive notion of truth, one that goes beyond the epistemological correspondence theory, and the author will argue for the importance of authentication as a crucial extension of truth, especially in a posttruth climate. Hannah Arendt observes, "facts need testimony to be remembered and…

Descriptors: Educational Philosophy, Educational Theories, Epistemology, Educational Practices

A Systematic Review of the International Assessment Literacy Measures in Higher Education (2013-2023)

Peer reviewed
PDF on ERIC

Download full text

Beyza Aksu Dunya; Mehmet Can Demir; Stefanie Wind – Research & Practice in Assessment, 2025

This paper aims to synthesize measures of assessment literacy in higher education by forging a connection between two research domains: educational assessment and psychometrics. It begins with a systematic review of assessment literacy measures within the context of higher education published within the last ten years. AL measures, including tests…

Descriptors: Assessment Literacy, Higher Education, Measures (Individuals), Reliability

Classification Consistency and Accuracy Indices for Simple Structure MIRT Model

Peer reviewed

Direct link

Huan Liu; Won-Chan Lee – Journal of Educational Measurement, 2025

This study investigates the estimation of classification consistency and accuracy indices for composite summed and theta scores within the SS-MIRT framework, using five popular approaches, including the Lee, Rudner, Guo, Bayesian EAP, and Bayesian MCMC approaches. The procedures are illustrated through analysis of two real datasets and further…

Descriptors: Classification, Reliability, Accuracy, Item Response Theory

IRT Scoring and Recursion for Estimating Reliability and Other Accuracy Indices

Peer reviewed

Direct link

Tim Moses; YoungKoung Kim – Journal of Educational Measurement, 2025

This study considers the estimation of marginal reliability and conditional accuracy measures using a generalized recursion procedure with several IRT-based ability and score estimators. The estimators include MLE, TCC, and EAP abilities, and corresponding test scores obtained with different weightings of the item scores. We consider reliability…

Descriptors: Item Response Theory, Scoring, Reliability, Accuracy

Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management

Peer reviewed

Direct link

Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023

Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…

Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education

Validity and Reliability of Child-Friendly School Policy Evaluation Instruments in Primary Schools: Confirmatory Factor Analysis

Peer reviewed
PDF on ERIC

Download full text

Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024

Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…

Descriptors: Validity, Reliability, School Policy, Program Evaluation

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 1806

Educational and Psychological…	810
ProQuest LLC	659
Journal of Psychoeducational…	399
Online Submission	327
Journal of Educational…	252
Journal of Autism and…	232
Psychology in the Schools	232
Measurement and Evaluation in…	230
Grantee Submission	183
Psychological Assessment	180
Journal of Speech, Language,…	174
Measurement in Physical…	170
Applied Psychological…	149
Assessment for Effective…	138
International Journal of…	134
Journal of Consulting and…	131
Educational Research and…	130
Assessment & Evaluation in…	124
Language Testing	120
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Applied Measurement in…	111
International Journal of…	110
ETS Research Report Series	106
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	44
Race to the Top	27
Elementary and Secondary…	20
Every Student Succeeds Act…	20
Elementary and Secondary…	16
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Education Consolidation…	4
Education for All Handicapped…	4
Head Start	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	176
Peabody Picture Vocabulary…	88
SAT (College Admission Test)	86
Test of English as a Foreign…	82
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	66
Program for International…	62
Child Behavior Checklist	59
National Assessment of…	56
ACT Assessment	52
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
Beck Depression Inventory	50
Autism Diagnostic Observation…	47
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	45
Motivated Strategies for…	43
Raven Progressive Matrices	43
Behavior Assessment System…	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Vineland Adaptive Behavior…	39
Kaufman Assessment Battery…	38
More ▼