ERIC - Search Results

Publication Date

In 2026	3
Since 2025	675
Since 2022 (last 5 years)	3176
Since 2017 (last 10 years)	7417
Since 2007 (last 20 years)	15055

Descriptor

Test Reliability	15043
Test Validity	10279
Reliability	9761
Foreign Countries	7144
Test Construction	4825
Validity	4191
Measures (Individuals)	3877
Factor Analysis	3825
Psychometrics	3526
Interrater Reliability	3124
Correlation	3040
Evaluation Methods	2746
Statistical Analysis	2533
Higher Education	2515
Questionnaires	2473
Scores	2386
College Students	2211
Student Attitudes	2148
Comparative Analysis	1943
Factor Structure	1822
Student Evaluation	1695
Rating Scales	1623
Measurement Techniques	1562
Test Items	1528
Construct Validity	1498
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	19242
Reports - Research	17430
Reports - Evaluative	3328
Speeches/Meeting Papers	1861
Tests/Questionnaires	1598
Reports - Descriptive	1544
Information Analyses	958
Dissertations/Theses -…	673
Opinion Papers	645
Guides - Non-Classroom	325
Numerical/Quantitative Data	252
Books	135
Guides - Classroom - Teacher	81
Reports - General	70
Guides - General	57
Reference Materials -…	53
Collected Works - General	40
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Non-Print Media	22
Dissertations/Theses	21
ERIC Digests in Full Text	20
More ▼

Education Level

Higher Education	4726
Postsecondary Education	3740
Secondary Education	2273
Elementary Education	2197
High Schools	1085
Middle Schools	1033
Elementary Secondary Education	876
Early Childhood Education	874
Junior High Schools	715
Primary Education	427
Intermediate Grades	401
Preschool Education	385
Grade 5	342
Grade 8	325
Grade 4	318
Grade 6	299
Grade 7	279
Grade 3	270
Kindergarten	267
Adult Education	211
Grade 1	202
Grade 2	173
Grade 9	154
Grade 10	140
Grade 11	109
More ▼

Audience

Researchers	709
Practitioners	451
Teachers	208
Administrators	122
Policymakers	66
Counselors	42
Students	38
Parents	11
Community	7
Support Staff	6
Media Staff	5
More ▼

Location

Turkey	1328
Australia	436
Canada	379
China	368
United States	271
United Kingdom	256
Indonesia	253
Taiwan	234
Netherlands	223
Spain	217
California	215
Germany	197
United Kingdom (England)	192
Malaysia	170
Hong Kong	161
Florida	159
Iran	156
Nigeria	149
South Korea	135
Texas	134
India	127
New York	119
Pennsylvania	114
South Africa	109
Japan	106
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 31 to 45 of 27,107 results Save | Export

Exploring Ranking Consistency of Generative AI in MOOC Platform Evaluation: A Non-Parametric Approach

Peer reviewed
PDF on ERIC

Download full text

Victor K. Y. Chan – International Association for Development of the Information Society, 2025

This paper extends a prior study on the consistency of generative Artificial Intelligence (AI) models in evaluating Massive Open Online Course (MOOC) platforms. While the original work focused on the consistency of direct numerical scores, this research investigates the consistency of the rankings derived from those scores. When evaluating…

Descriptors: Artificial Intelligence, MOOCs, Reliability, Evaluation Methods

Synthesizing Validity and Reliability Evidence for the Draw-A-Scientist Test

Peer reviewed
PDF on ERIC

Download full text

Julia Brochey-Taylor; Joseph A. Taylor – Educational Research and Reviews, 2024

The purpose of this synthesis study was to assess the reliability and validity of the Draw-A-Scientist Test (DAST) and its variations across multiple studies, aiming to understand limitations and propose modifications for future application within and beyond the science domain. Given the existence of multiple DAST versions, this study quantified…

Descriptors: Cognitive Tests, Freehand Drawing, Personality Measures, Projective Measures

Measuring Intentional Communication in Infants at Elevated Likelihood of Autism: Validity, Reliability, and Responsiveness of a Novel Coding Scale

Peer reviewed

Direct link

Elizabeth Choi-Tucci; John Sideris; Cristin Holland; Grace T. Baranek; Linda R. Watson – Journal of Speech, Language, and Hearing Research, 2025

Purpose: Intentional communication acts, or purposefully directed vocalizations and gestures, are particularly difficult for infants at elevated likelihood for eventual diagnosis of autism. The ability to measure and track intentional communication in infancy thus has the potential to aid early identification and intervention efforts. This study…

Descriptors: Infants, Autism Spectrum Disorders, Caregiver Child Relationship, Nonverbal Communication

Creativity Assessment over Time: Examining the Reliability of CAT Ratings

Peer reviewed

Direct link

Barth, Philipp; Stadtmann, Georg – Journal of Creative Behavior, 2021

The "consensual assessment technique" (CAT) is a reliable and valid method to measure (product) creativity and often considered "the" gold standard of creativity assessment. The reliability measure traditionally applied in CAT studies--inter-rater reliability--cannot capture time-sampling error, which is a particular relevant…

Descriptors: Creativity, Creativity Tests, Test Reliability, Interrater Reliability

Different Methods for Assessing Preservice Teachers' Instruction: Why Measures Matter

Peer reviewed

Direct link

Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024

Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…

Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Undergraduate Students' Career Resources: Validation of the Italian Version of the Career Resources Questionnaire

Peer reviewed

Direct link

Francesco Pace; Giulia Sciotto – International Journal for Educational and Vocational Guidance, 2025

In recent years, to better face university paths, the first approaches to the labor market, and then the actual university-to-work transition, university students are asked to have broader skills, such as the ability to network, to be involved in career-related issues, and to explore the characteristics of occupations as much as personal ones.…

Descriptors: Undergraduate Students, Questionnaires, Foreign Countries, Test Reliability

Mental Toughness of Physical Education Teachers: Validation of a New Questionnaire

Peer reviewed

Direct link

Sima Zach; Noa Fishler-Barum; Itamar Shidlov – Physical Educator, 2025

The purpose of the study was to develop the Teachers' Mental Toughness Questionnaire (TMTQ). The questionnaire was developed in six stages: item generation, content validity, exploratory factor analysis, reliability tests, convergent validity tests, and discriminant validity. The factor analysis indicates that it measures six factors: team,…

Descriptors: Test Construction, Test Validity, Test Reliability, Psychometrics

Automated Scoring in Learning Progression-Based Assessment: A Comparison of Researcher and Machine Interpretations

Peer reviewed

Direct link

Hui Jin; Cynthia Lima; Limin Wang – Educational Measurement: Issues and Practice, 2025

Although AI transformer models have demonstrated notable capability in automated scoring, it is difficult to examine how and why these models fall short in scoring some responses. This study investigated how transformer models' language processing and quantification processes can be leveraged to enhance the accuracy of automated scoring. Automated…

Descriptors: Automation, Scoring, Artificial Intelligence, Accuracy

Toward Sufficient Statistical Power in Algorithmic Bias Assessment: A Test for ABROCA

Peer reviewed
PDF on ERIC

Download full text

Conrad Borchers – International Educational Data Mining Society, 2025

Algorithmic bias is a pressing concern in educational data mining (EDM), as it risks amplifying inequities in learning outcomes. The Area Between ROC Curves (ABROCA) metric is frequently used to measure discrepancies in model performance across demographic groups to quantify overall model fairness. However, its skewed distribution--especially when…

Descriptors: Algorithms, Bias, Statistics, Simulation

Authenticating Truth in a Post-Truth Climate

Peer reviewed
PDF on ERIC

Download full text

Clarence Joldersma – Philosophical Studies in Education, 2025

In this paper, the author will develop a more comprehensive notion of truth, one that goes beyond the epistemological correspondence theory, and the author will argue for the importance of authentication as a crucial extension of truth, especially in a posttruth climate. Hannah Arendt observes, "facts need testimony to be remembered and…

Descriptors: Educational Philosophy, Educational Theories, Epistemology, Educational Practices

A Systematic Review of the International Assessment Literacy Measures in Higher Education (2013-2023)

Peer reviewed
PDF on ERIC

Download full text

Beyza Aksu Dunya; Mehmet Can Demir; Stefanie Wind – Research & Practice in Assessment, 2025

This paper aims to synthesize measures of assessment literacy in higher education by forging a connection between two research domains: educational assessment and psychometrics. It begins with a systematic review of assessment literacy measures within the context of higher education published within the last ten years. AL measures, including tests…

Descriptors: Assessment Literacy, Higher Education, Measures (Individuals), Reliability

Classification Consistency and Accuracy Indices for Simple Structure MIRT Model

Peer reviewed

Direct link

Huan Liu; Won-Chan Lee – Journal of Educational Measurement, 2025

This study investigates the estimation of classification consistency and accuracy indices for composite summed and theta scores within the SS-MIRT framework, using five popular approaches, including the Lee, Rudner, Guo, Bayesian EAP, and Bayesian MCMC approaches. The procedures are illustrated through analysis of two real datasets and further…

Descriptors: Classification, Reliability, Accuracy, Item Response Theory

IRT Scoring and Recursion for Estimating Reliability and Other Accuracy Indices

Peer reviewed

Direct link

Tim Moses; YoungKoung Kim – Journal of Educational Measurement, 2025

This study considers the estimation of marginal reliability and conditional accuracy measures using a generalized recursion procedure with several IRT-based ability and score estimators. The estimators include MLE, TCC, and EAP abilities, and corresponding test scores obtained with different weightings of the item scores. We consider reliability…

Descriptors: Item Response Theory, Scoring, Reliability, Accuracy

Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management

Peer reviewed

Direct link

Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023

Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…

Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 1808

Educational and Psychological…	810
ProQuest LLC	659
Journal of Psychoeducational…	399
Online Submission	327
Journal of Educational…	252
Journal of Autism and…	233
Psychology in the Schools	232
Measurement and Evaluation in…	230
Grantee Submission	184
Psychological Assessment	180
Journal of Speech, Language,…	174
Measurement in Physical…	170
Applied Psychological…	149
Assessment for Effective…	138
International Journal of…	134
Journal of Consulting and…	131
Educational Research and…	130
Assessment & Evaluation in…	124
Language Testing	120
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Applied Measurement in…	111
International Journal of…	110
ETS Research Report Series	106
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	44
Race to the Top	27
Elementary and Secondary…	20
Every Student Succeeds Act…	20
Elementary and Secondary…	16
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Head Start	5
Education Consolidation…	4
Education for All Handicapped…	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	176
Peabody Picture Vocabulary…	88
SAT (College Admission Test)	86
Test of English as a Foreign…	82
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	66
Program for International…	62
Child Behavior Checklist	59
National Assessment of…	56
ACT Assessment	52
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
Beck Depression Inventory	50
Autism Diagnostic Observation…	47
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	45
Motivated Strategies for…	43
Raven Progressive Matrices	43
Behavior Assessment System…	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Vineland Adaptive Behavior…	39
Kaufman Assessment Battery…	38
More ▼