ERIC - Search Results

Publication Date

In 2026	3
Since 2025	675
Since 2022 (last 5 years)	3176
Since 2017 (last 10 years)	7417
Since 2007 (last 20 years)	15055

Descriptor

Test Reliability	15043
Test Validity	10279
Reliability	9761
Foreign Countries	7144
Test Construction	4825
Validity	4191
Measures (Individuals)	3877
Factor Analysis	3825
Psychometrics	3526
Interrater Reliability	3124
Correlation	3040
Evaluation Methods	2746
Statistical Analysis	2533
Higher Education	2515
Questionnaires	2473
Scores	2386
College Students	2211
Student Attitudes	2148
Comparative Analysis	1943
Factor Structure	1822
Student Evaluation	1695
Rating Scales	1623
Measurement Techniques	1562
Test Items	1528
Construct Validity	1498
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	19242
Reports - Research	17430
Reports - Evaluative	3328
Speeches/Meeting Papers	1861
Tests/Questionnaires	1598
Reports - Descriptive	1544
Information Analyses	958
Dissertations/Theses -…	673
Opinion Papers	645
Guides - Non-Classroom	325
Numerical/Quantitative Data	252
Books	135
Guides - Classroom - Teacher	81
Reports - General	70
Guides - General	57
Reference Materials -…	53
Collected Works - General	40
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Non-Print Media	22
Dissertations/Theses	21
ERIC Digests in Full Text	20
More ▼

Education Level

Higher Education	4726
Postsecondary Education	3740
Secondary Education	2273
Elementary Education	2197
High Schools	1085
Middle Schools	1033
Elementary Secondary Education	876
Early Childhood Education	874
Junior High Schools	715
Primary Education	427
Intermediate Grades	401
Preschool Education	385
Grade 5	342
Grade 8	325
Grade 4	318
Grade 6	299
Grade 7	279
Grade 3	270
Kindergarten	267
Adult Education	211
Grade 1	202
Grade 2	173
Grade 9	154
Grade 10	140
Grade 11	109
More ▼

Audience

Researchers	709
Practitioners	451
Teachers	208
Administrators	122
Policymakers	66
Counselors	42
Students	38
Parents	11
Community	7
Support Staff	6
Media Staff	5
More ▼

Location

Turkey	1328
Australia	436
Canada	379
China	368
United States	271
United Kingdom	256
Indonesia	253
Taiwan	234
Netherlands	223
Spain	217
California	215
Germany	197
United Kingdom (England)	192
Malaysia	170
Hong Kong	161
Florida	159
Iran	156
Nigeria	149
South Korea	135
Texas	134
India	127
New York	119
Pennsylvania	114
South Africa	109
Japan	106
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 1 to 15 of 27,107 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

Test-Retest and Inter-Rater Reliability for Selected Outcomes from a Wearable 3D Inertial Sensor over Different Stable and Unstable Postural Conditions: A Validation Study

Peer reviewed

Direct link

Samuel D'Emanuele; Francesca Nardello; Fabrizio Garau; Diego Campaci; Federico Schena; Cantor Tarperi – Measurement in Physical Education and Exercise Science, 2025

The agreement between a wearable inertial sensor (GYKO, G) and the force platform (P) was assessed by evaluating "test-retest" and "inter-rater reliability." Thirty-eight subjects were enrolled; the selected indices of balance were investigated over foot positions and (un)stable conditions. Intraclass correlation coefficient…

Descriptors: Human Posture, Measurement Equipment, Interrater Reliability, Measurement Techniques

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

Validity and Reliability of the Stuttering Severity Instrument--Fourth Edition for School-Aged Children and Adult Arabic-Speaking People Who Stutter

Peer reviewed

Direct link

Mazin T. Alqhazo; Tha’er Al-Kadi; Firas S. Alfwaress – Language, Speech, and Hearing Services in Schools, 2025

Purpose: The Stuttering Severity Instrument--Fourth Edition (SSI-4) is unavailable in Arabic language. The purpose of the current research is to translate the SSI-4 (Riley, 2009) into Arabic and to discuss its validity, as well as its intrajudge and interjudge reliability. Method: Archived videos of 28 school-aged children who stutter ranged in…

Descriptors: Arabic, Translation, Test Validity, Test Reliability

Self-Assessment Survey: Evaluation of a Revised Measure Assessing Positive Behavioral Interventions and Supports

Peer reviewed

Direct link

Angus Kittelman; Sara Izzard; Kent McIntosh; Kelsey R. Morris; Timothy J. Lewis – Assessment for Effective Intervention, 2024

The purpose of this study was to evaluate the psychometric properties of the Self-Assessment Survey (SAS) 4.0, an updated measure assessing implementation fidelity of positive behavioral interventions and supports (PBIS). A total of 627 school personnel from 33 schools in six U.S. states completed the SAS 4.0 during the 2021-2022 school year. We…

Descriptors: Positive Behavior Supports, Teachers, Self Evaluation (Individuals), Test Reliability

How Consistent Are Humans When Grading Programming Assignments?

Peer reviewed

Direct link

Marcus Messer; Neil C. C. Brown; Michael Kölling; Miaojing Shi – ACM Transactions on Computing Education, 2025

Providing consistent summative assessment to students is important, as the grades they are awarded affect their progression through university and future career prospects. While small cohorts are typically assessed by a single assessor, such as the module/class leader, larger cohorts are often assessed by multiple assessors, typically teaching…

Descriptors: Foreign Countries, Grading, Interrater Reliability, Teaching Assistants

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Reporting and Measuring English School Qualifications: A Case Study of General Certificate of Secondary Education Results in Survey and Linked Administrative Data in the UK Millennium Cohort Study

Peer reviewed

Direct link

Sarah Stopforth; Roxanne Connelly; Vernon Gayle – Cambridge Journal of Education, 2025

Data on educational qualifications is essential in many research domains. The UK Millennium Cohort Study collected self-reported General Certificate of Secondary Education (GCSE) data in sweep 7 (cohort members aged 17). GCSE data from the National Pupil Database (NPD) has been linked to the MCS. This study investigates the consistency of these…

Descriptors: Foreign Countries, Adolescents, Case Studies, Secondary Education

Reliability of Ratings of an English Language Arts Curriculum with the Curriculum Evaluation Guidelines

Peer reviewed

Direct link

Matthew K. Burns; Heba Z. Abdelnaby; Jonie B. Welland; Katherine A. Graves; Kari Kurto – Assessment for Effective Intervention, 2024

The current study examined the reliability of The Reading League Curriculum-Evaluation Guidelines (CEGs), which were developed to help school-based teams rate the presence of red flags when considering adopting specific literacy curricula. Coders (n = 30) independently used the CEGs to evaluate a free online English language arts curriculum. The…

Descriptors: English Curriculum, English Instruction, Language Arts, Curriculum Evaluation

Reliable Assessment of Pain Behaviour in Adults with Profound Intellectual and Multiple Disabilities: The Development of an Instruction Protocol

Peer reviewed

Direct link

Enninga, Annemieke; Waninge, Aly; Post, Wendy J.; van der Putten, Annette A. J. – Journal of Applied Research in Intellectual Disabilities, 2023

Background: Persons with profound intellectual and multiple disabilities (PIMD) are vulnerable when it comes to experiencing pain. Reliable assessment of pain-related behaviour in these persons is difficult. "Aim" To determine how pain items can be reliably scored in adults with PIMD. Methods: We developed an instruction protocol for the…

Descriptors: Test Reliability, Pain, Behavior, Adults

Which Blueberries Are Better Value? The Development and Validation of the Functional Numeracy Assessment for Adults with Aphasia

Peer reviewed

Direct link

Ichikowitz, Kerri; Bruce, Carolyn; Meitanis, Vanessa; Cheung, Kelly; Kim, Yekyung; Talbourdet, Esther; Newton, Caroline – International Journal of Language & Communication Disorders, 2023

Background: People with aphasia (PWA) can experience functional numeracy difficulties, that is, problems understanding or using numbers in everyday life, which can have numerous negative impacts on their daily lives. There is growing interest in designing functional numeracy interventions for PWA; however, there are limited suitable assessments…

Descriptors: Test Construction, Test Validity, Numeracy, Adults

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Engaging Classroom Observation: A Brief Measure of Active Learning in the College Classroom

Peer reviewed

Direct link

Chase Young; Benjamin Mitchell-Yellin; George Kevin Randall – Active Learning in Higher Education, 2025

The purpose of this study was to develop a valid, reliable, and brief measure of active learning in college classrooms that is cheap and easy to complete and yields results that faculty can easily use to inform their development as instructors. Initial construct and face validity was achieved by modifying existing instruments and creating a draft…

Descriptors: College Faculty, College Students, Active Learning, Classroom Observation Techniques

Validity and Intrarater Reliability of the Fysiometer--Measuring Eccentric Knee Flexor Force during the Nordic Hamstring Exercise

Peer reviewed

Direct link

Morten Pallisgaard Støve; Mathias Kringelholt Kristensen; Jonas Nielsen; Lea Dyhrberg Madsen – Measurement in Physical Education and Exercise Science, 2025

Between limb strength, asymmetry is a leading risk factor for hamstring strain re-injury. However, few accurate testing methodologies are available in clinical settings. This study examined the validity and reliability of eccentric knee flexor torque measured with a novel Nordic Hamstring Device. Twenty-seven healthy participants were assessed in…

Descriptors: Validity, Reliability, Human Body, Foreign Countries

Inter-Evaluator Reliability of Sagittal and Rotational Spinal Measurements from 3D Ultrasound Imaging of Healthy Females in Standing with Varying Arm Positions

Peer reviewed

Direct link

Aislinn Ganci; Miran Qazizada; Brianna Fehr; Ana Vucenovic; Edmond Lou; Eric Parent – Measurement in Physical Education and Exercise Science, 2024

Spinal alignment can be assessed without radiation using three-dimensional ultrasound imaging (3DUS). Reliable measurements could inform the ideal arm position for scoliosis radiographs. This study determined the inter-evaluator reliability of axial vertebral rotation (AVR) measurements and sagittal curve angles in healthy females from 3DUS spinal…

Descriptors: Foreign Countries, Young Adults, Adults, Adolescents

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 1808

Educational and Psychological…	810
ProQuest LLC	659
Journal of Psychoeducational…	399
Online Submission	327
Journal of Educational…	252
Journal of Autism and…	233
Psychology in the Schools	232
Measurement and Evaluation in…	230
Grantee Submission	184
Psychological Assessment	180
Journal of Speech, Language,…	174
Measurement in Physical…	170
Applied Psychological…	149
Assessment for Effective…	138
International Journal of…	134
Journal of Consulting and…	131
Educational Research and…	130
Assessment & Evaluation in…	124
Language Testing	120
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Applied Measurement in…	111
International Journal of…	110
ETS Research Report Series	106
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	44
Race to the Top	27
Elementary and Secondary…	20
Every Student Succeeds Act…	20
Elementary and Secondary…	16
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Head Start	5
Education Consolidation…	4
Education for All Handicapped…	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	176
Peabody Picture Vocabulary…	88
SAT (College Admission Test)	86
Test of English as a Foreign…	82
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	66
Program for International…	62
Child Behavior Checklist	59
National Assessment of…	56
ACT Assessment	52
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
Beck Depression Inventory	50
Autism Diagnostic Observation…	47
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	45
Motivated Strategies for…	43
Raven Progressive Matrices	43
Behavior Assessment System…	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Vineland Adaptive Behavior…	39
Kaufman Assessment Battery…	38
More ▼