ERIC - Search Results

Publication Date

In 2025	285
Since 2024	1149
Since 2021 (last 5 years)	3719
Since 2016 (last 10 years)	7918
Since 2006 (last 20 years)	15095

Descriptor

Test Reliability	14751
Test Validity	10028
Reliability	9655
Foreign Countries	6903
Test Construction	4695
Validity	4150
Measures (Individuals)	3801
Factor Analysis	3768
Psychometrics	3447
Interrater Reliability	3093
Correlation	3027
Evaluation Methods	2724
Statistical Analysis	2528
Higher Education	2495
Questionnaires	2433
Scores	2350
College Students	2177
Student Attitudes	2105
Comparative Analysis	1938
Factor Structure	1779
Student Evaluation	1669
Rating Scales	1602
Measurement Techniques	1554
Test Items	1487
Elementary Secondary Education	1486
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	18852
Reports - Research	17047
Reports - Evaluative	3318
Speeches/Meeting Papers	1852
Tests/Questionnaires	1560
Reports - Descriptive	1534
Information Analyses	933
Dissertations/Theses -…	673
Opinion Papers	645
Guides - Non-Classroom	324
Numerical/Quantitative Data	250
Books	131
Guides - Classroom - Teacher	80
Reports - General	70
Guides - General	57
Reference Materials -…	53
Collected Works - General	40
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Dissertations/Theses	21
ERIC Digests in Full Text	20
Non-Print Media	20
More ▼

Education Level

Higher Education	4596
Postsecondary Education	3610
Secondary Education	2196
Elementary Education	2137
High Schools	1050
Middle Schools	1004
Elementary Secondary Education	860
Early Childhood Education	852
Junior High Schools	696
Primary Education	412
Intermediate Grades	385
Preschool Education	379
Grade 5	333
Grade 8	325
Grade 4	309
Grade 6	295
Grade 7	277
Grade 3	266
Kindergarten	261
Adult Education	206
Grade 1	198
Grade 2	168
Grade 9	153
Grade 10	139
Grade 11	105
More ▼

Audience

Researchers	705
Practitioners	449
Teachers	206
Administrators	122
Policymakers	66
Counselors	42
Students	37
Parents	11
Community	7
Media Staff	5
Support Staff	5
More ▼

Location

Turkey	1274
Australia	432
Canada	375
China	346
United States	268
United Kingdom	250
Taiwan	227
Indonesia	223
Netherlands	218
California	212
Spain	210
Germany	189
United Kingdom (England)	189
Malaysia	164
Florida	159
Hong Kong	159
Iran	151
Nigeria	148
Texas	134
South Korea	127
India	119
New York	118
Pennsylvania	112
South Africa	107
Greece	103
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 76 to 90 of 26,686 results Save | Export

Using Systematic Social Observations to Measure Crime Prevention through Environmental Design and Disorder: In-situ Observations, Photographs, and Google Street View Imagery

Peer reviewed

Direct link

Sas, Marlies; Snaphaan, Thom; Pauwels, Lieven J. R.; Ponnet, Koen; Hardyns, Wim – Field Methods, 2023

This study focuses on the use of systematic social observations (SSO) to measure crime prevention through environmental design (CPTED) and disorder. To improve knowledge about measurement issues in small area research, SSO is conducted by means of three different methods: in-situ, photographs, and Google Street View (GSV) imagery. By evaluating…

Descriptors: Crime Prevention, Measurement Techniques, Photography, Observation

Development and Validation of an Integrated STEM Teacher Classroom Observation Protocol

Peer reviewed

Direct link

Zhou, Shuqi; Merzdorf, Hillary E.; Douglas, Kerrie A.; Moore, Tamara J. – Journal of Pre-College Engineering Education Research, 2023

This study aimed to develop a K-12 classroom observation protocol to assess K-12 teachers' implementation of science, technology, engineering, and mathematics (STEM) integration. The intended purpose of the observation protocol is for researchers to examine how K-12 teachers implement the STEM integrated curriculum. Based on research on STEM…

Descriptors: Test Construction, Test Validity, STEM Education, Classroom Observation Techniques

Using Bayesian Generalized Structural Equation Modeling to Analyze Latent Agreement

Direct link

McCluskey, Sydne – ProQuest LLC, 2023

Rater comparison analysis is commonly necessary in the social sciences. Conventional approaches to the problem generally focus on calculation of agreement statistics, which provide useful but incomplete information about rater agreement. Importantly, one-number agreement statistics give no indication regarding the nature of disagreements, nor do…

Descriptors: Bayesian Statistics, Structural Equation Models, Interrater Reliability, Beliefs

An Exploration of "Real Time" Assessments as a Means to Better Understand Preceptors' Judgments of Student Performance

Peer reviewed

Direct link

Luu, Kimberly; Sidhu, Ravi; Chadha, Neil K.; Eva, Kevin W. – Advances in Health Sciences Education, 2023

Clinical supervisors are known to assess trainee performance idiosyncratically, causing concern about the validity of their ratings. The literature on this issue relies heavily on retrospective collection of decisions, resulting in the risk of inaccurate information regarding what actually drives raters' perceptions. Capturing in-the-moment…

Descriptors: Clinical Experience, Practicum Supervision, Student Evaluation, Evaluation Methods

Reliability and Validity of Representational Mind-Mindedness in Mothers of Infants

Peer reviewed

Direct link

Egmose, Ida; Skou, Mia; Madsen, Eva Back; Stuart, Anne Christine; Krogh, Marianne Thode; Haase, Tina Wahl; Vaever, Mette Skovgaard – European Journal of Developmental Psychology, 2023

Mind-mindedness (MM) refers to the parent's ability to treat the child as an individual with a mind of his or her own. Studies have found representational and interactional MM to predict child development, but more research is needed on the validity of representational MM in parents of infants. Therefore, we examine the reliability and validity of…

Descriptors: Individualism, Mothers, Infants, Foreign Countries

An Experimental Study of Standard Setting Methods for Diagnostic Profiles

Direct link

Feldberg, Zachary R. – ProQuest LLC, 2023

Cognitive diagnostic models (CDMs) provide pedagogically relevant information in the form of a student profile of multiple binary categorizations of students into mastery or nonmastery statuses on latent traits called attributes. Federal educational accountability requires accountability measures to designate students into one of at least three…

Descriptors: Accountability, Standards, Cutting Scores, Models

"Rater Training" Re-Imagined for Work-Based Assessment in Medical Education

Peer reviewed

Direct link

Tavares, Walter; Kinnear, Benjamin; Schumacher, Daniel J.; Forte, Milena – Advances in Health Sciences Education, 2023

In this perspective, the authors critically examine "rater training" as it has been conceptualized and used in medical education. By "rater training," they mean the educational events intended to "improve" rater performance and contributions during assessment events. Historically, rater training programs have focused…

Descriptors: Medical Education, Interrater Reliability, Evaluation Methods, Training

A Perceptual Outcome Measure of Velopharyngeal Function Based on the Cleft Audit Protocol for Speech--Augmented (CAPS-A VPC-Sum): Validation through a Speech Osteotomy Study

Peer reviewed

Direct link

Pereira, Valerie J.; Tuomainen, Jyrki; Lee, Kathy Y. S.; Tong, Michael C. F.; Sell, Debbie A. – International Journal of Language & Communication Disorders, 2021

Background: The status of the velopharyngeal mechanism can be inferred from perceptual ratings of specified speech parameters. Several studies have proposed the measure of an overall velopharyngeal composite score based on these perceptual ratings and have reported good validity. The Cleft Audit Protocol for Speech--Augmented (CAPS-A) is a…

Descriptors: Congenital Impairments, Speech Tests, Outcome Measures, Test Validity

Visualizing Agreement: Bland-Altman Plots as a Supplement to Inter-Rater Reliability Indices

Peer reviewed

Direct link

Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024

Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…

Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques

Psychometric Synthesis of the Drug Abuse Screening Test (DAST) Versions

Peer reviewed

Direct link

Erin Johnson; Samantha Barstack; Yikai Xu; Hannah Wise; Bradley T. Erford; Catharina Chang; David Delmonico – Measurement and Evaluation in Counseling and Development, 2025

Problem Statement: Among individuals aged 12 years or older, 14.3% (40.0 million) reporting the use of an illicit drug in the previous year. Given the prevalence of drug abuse, it is increasingly important to determine effective screening practices, treatment procedures, and best practices among various subpopulations to identify drug use-related…

Descriptors: Drug Abuse, Screening Tests, Psychometrics, Synthesis

Superficially Plausible Outputs from a Black Box: Problematising GenAI Tools for Analysing Qualitative SoTL Data

Peer reviewed
PDF on ERIC

Download full text

Mirjam Sophia Glessmer; Rachel Forsyth – Teaching & Learning Inquiry, 2025

Generative AI tools (GenAI) are increasingly used for academic tasks, including qualitative data analysis for the Scholarship of Teaching and Learning (SoTL). In our practice as academic developers, we are frequently asked for advice on whether this use for GenAI is reliable, valid, and ethical. Since this is a new field, we have not been able to…

Descriptors: Artificial Intelligence, Research Methodology, Data Analysis, Scholarship

Examining the Psychometric Impact of Targeted and Random Double-Scoring in Mixed-Format Assessments

Peer reviewed

Direct link

Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025

Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…

Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods

The Behavior Problem Inventory--Short Form: Psychometric Properties in a Spanish Sample of Intellectual Disabilities

Peer reviewed

Direct link

Juliana Reyes-Martin; David Simó-Pinatella; Ana Andrés – Journal of Applied Research in Intellectual Disabilities, 2025

Background: Behavioural problems in individuals with intellectual disabilities have a negative impact on them. Limited assessment measures exist in Spain. This study aimed to validate the Behavior Problems Inventory--Short Form (BPI-S) in the Spanish population by examining its psychometric properties and factorial structures. Method: This study…

Descriptors: Foreign Countries, Behavior Problems, Students with Disabilities, Intellectual Disability

GPT-4 in Education: Evaluating Aptness, Reliability, and Loss of Coherence in Solving Calculus Problems and Grading Submissions

Peer reviewed

Direct link

Alberto Gandolfi – International Journal of Artificial Intelligence in Education, 2025

In this paper, we initially investigate the capabilities of GPT-3 5 and GPT-4 in solving college-level calculus problems, an essential segment of mathematics that remains under-explored so far. Although improving upon earlier versions, GPT-4 attains approximately 65% accuracy for standard problems and decreases to 20% for competition-like…

Descriptors: Artificial Intelligence, Reliability, Problem Solving, Mathematics Skills

Examining the Wording Effect: What Are We Measuring?

Peer reviewed

Direct link

Abdullah Faruk Kiliç; Meltem Acar Güvendir; Gül Güler; Tugay Kaçak – Measurement: Interdisciplinary Research and Perspectives, 2025

In this study, the extent to wording effects impact structure and factor loadings, internal consistency and measurement invariance was outlined. The modified form, which includes items that semantically reversed, explains %21.5 more variance than the original form. Also, reversed items' factor loadings are higher. As a result of CFA, indexes…

Descriptors: Test Items, Factor Structure, Test Reliability, Semantics

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 1780

Educational and Psychological…	810
ProQuest LLC	659
Journal of Psychoeducational…	388
Online Submission	326
Journal of Educational…	246
Measurement and Evaluation in…	230
Journal of Autism and…	226
Psychology in the Schools	212
Grantee Submission	183
Psychological Assessment	180
Journal of Speech, Language,…	173
Measurement in Physical…	165
Applied Psychological…	149
Assessment for Effective…	135
Journal of Consulting and…	131
Educational Research and…	130
Psychometrika	120
Research on Social Work…	120
Assessment & Evaluation in…	119
Educational Sciences: Theory…	119
Language Testing	118
International Journal of…	117
Applied Measurement in…	111
ETS Research Report Series	105
Assessment	100
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	44
Race to the Top	27
Elementary and Secondary…	19
Every Student Succeeds Act…	19
Elementary and Secondary…	15
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Education Consolidation…	4
Education for All Handicapped…	4
Head Start	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	175
Peabody Picture Vocabulary…	88
SAT (College Admission Test)	85
Test of English as a Foreign…	79
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	62
Program for International…	59
Child Behavior Checklist	58
National Assessment of…	56
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
ACT Assessment	49
Beck Depression Inventory	48
Autism Diagnostic Observation…	45
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	45
Motivated Strategies for…	43
Raven Progressive Matrices	43
Behavior Assessment System…	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Kaufman Assessment Battery…	38
Vineland Adaptive Behavior…	37
More ▼