ERIC - Search Results

Publication Date

In 2025	6
Since 2024	9
Since 2021 (last 5 years)	25
Since 2016 (last 10 years)	62
Since 2006 (last 20 years)	101

Descriptor

Test Reliability	383
Tests	383
Test Validity	237
Test Construction	98
Testing	66
Statistical Analysis	47
Evaluation Methods	45
Foreign Countries	43
Higher Education	39
Scores	38
Correlation	35
Student Evaluation	35
Measurement Techniques	34
Scoring	34
Test Interpretation	34
Elementary School Students	32
Item Analysis	31
Test Results	31
Factor Analysis	30
Academic Achievement	29
Evaluation	27
Elementary Secondary Education	26
College Students	25
Student Attitudes	25
Test Items	25
More ▼

Education Level

Higher Education	29
Postsecondary Education	24
Elementary Education	20
Secondary Education	16
Junior High Schools	9
Middle Schools	9
Elementary Secondary Education	6
Grade 8	6
Grade 3	4
Grade 5	4
Grade 7	4
High Schools	3
Early Childhood Education	2
Grade 4	2
Grade 6	2
Kindergarten	2
Grade 2	1
Grade 9	1
Intermediate Grades	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Practitioners	12
Administrators	10
Teachers	5
Researchers	4
Policymakers	2
Students	1

Location

Turkey	9
New York	6
United Kingdom	5
United Kingdom (England)	5
Australia	4
Germany	4
Canada	3
Netherlands	3
Taiwan	3
Brazil	2
Florida	2
Jordan	2
New Jersey	2
Pennsylvania	2
United Kingdom (Great Britain)	2
United Kingdom (Scotland)	2
United States	2
Asia	1
Austria	1
Belgium	1
California	1
California (Berkeley)	1
Chile	1
Colombia	1
Connecticut	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 1 to 15 of 383 results Save | Export

Peer reviewed

Direct link

Daniel González-Devesa; José Carlos Diz-Gómez; Miguel Adriano Sanchez-Lastra; Aroa Otero Rodríguez; Carlos Ayán-Pérez – Measurement in Physical Education and Exercise Science, 2025

The aim of this study is to examine the available scientific evidence on the reliability and criterion validity of 6-minute run walk field-based test when administered to children and adolescents. Systematic searches were performed in three electronic databases (MEDLINE/PubMed, SPORTDiscuss and Scopus) from their inception until February 2024,…

Descriptors: Child Health, Health Related Fitness, Literature Reviews, Meta Analysis

The Test of Mastication and Swallowing Solids and the Timed Water Swallow Test: Reliability, Associations, Age and Gender Effects, and Normative Data

Peer reviewed

Direct link

Sella-Weiss, Oshrat – International Journal of Language & Communication Disorders, 2023

Background: Quantitative measures can increase precision in describing swallowing function, improve interrater and test-retest reliability, and advance clinical decision-making. The Test of Mastication and Swallowing Solids (TOMASS) and the Timed Water Swallow Test (TWST) are functional tests for swallowing that provide quantitative results. Aims:…

Descriptors: Human Body, Motor Reactions, Tests, Test Reliability

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

How Valid and Reliable Are Teachers' Assessments of Gifted Students?

Peer reviewed
PDF on ERIC

Download full text

Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025

Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…

Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques

The Importance of Thinking Multivariately When Setting Subscale Cutoff Scores

Peer reviewed

Direct link

Kroc, Edward; Olvera Astivia, Oscar L. – Educational and Psychological Measurement, 2022

Setting cutoff scores is one of the most common practices when using scales to aid in classification purposes. This process is usually done univariately where each optimal cutoff value is decided sequentially, subscale by subscale. While it is widely known that this process necessarily reduces the probability of "passing" such a test,…

Descriptors: Multivariate Analysis, Cutting Scores, Classification, Measurement

Validity and Test-Retest Reliability of a Smartphone App for Measuring Rising Time, Velocity, Power, and Inter-Limb Asymmetry during Single-Leg Sit-to-Stand Test in Female-Trained Athletes

Peer reviewed

Direct link

Yücel Makaraci; Kazim Nas; Kerem Gündüz; Abdullah Uysal; Samuel T. Orange; Juan D. Ruiz-Cárdenas – Measurement in Physical Education and Exercise Science, 2024

The aim was to determine the validity and test-retest reliability of the Sit to Stand App variables (rising time, vertical velocity, and power) for measuring single-leg sit-to-stand (STS) test compared to those derived from ground reaction force data. Twenty-seven female athletes performed the single-leg STS test over three consecutive sessions…

Descriptors: Computer Simulation, Measurement Techniques, Athletics, Physical Fitness

Evaluating the Evaluators: A Comparative Study of AI and Teacher Assessments in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Tugra Karademir Coskun; Ayfer Alper – Digital Education Review, 2024

This study aims to examine the potential differences between teacher evaluations and artificial intelligence (AI) tool-based assessment systems in university examinations. The research has evaluated a wide spectrum of exams including numerical and verbal course exams, exams with different assessment styles (project, test exam, traditional exam),…

Descriptors: Artificial Intelligence, Visual Aids, Video Technology, Tests

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Assessments Play an Important Role in Serving Students. What's Next: Policy Recommendations from the George W. Bush Institute

Download full text

Anne Wicks; Robin Berkley – George W. Bush Institute, 2025

Assessments are one of the most important--and often misunderstood--elements of education. In most cases, tests are administered by the state as well as by districts and schools. Assessments at each of these levels have distinct purposes, yield different information, and are part of a powerful, coordinated approach to improving student outcomes.…

Descriptors: Student Evaluation, Testing, Tests, Standardized Tests

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

Exploring Psychometric Properties and Determinants of PLAAFP Quality Scores

Direct link

Christopher M. Claude – ProQuest LLC, 2024

This dissertation comprises three complementary studies that aim to advance the understanding and practice of Individualized Education Programs (IEP) and Present Levels of Academic Achievement and Functional Performance (PLAAFP) development in special education. In the first study, we systematically reviewed empirical research measuring IEP…

Descriptors: Individualized Education Programs, Academic Achievement, Special Education, Measurement

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

"Classroometrics": The Validity, Reliability, and Fairness of Classroom Music Assessments

Peer reviewed

Direct link

Wesolowski, Brian C. – Music Educators Journal, 2020

Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…

Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation

The Ethical Implications of Collecting Data in Educational Settings: Discussion on the Technology and Engineering Attitude Scale (TEAS) and Its Psychometric Validation for Assessing a Pre-Engineering Design Program

Peer reviewed

Direct link

Miranda, Constanza; Goñi, Julian; Pickenpack, Astrid; Sotomayor, Trinidad – International Journal of Technology and Design Education, 2022

K-12 Engineering Education has placed a lot of attention on students' attitudes or predispositions towards science and technology. However, most assessment methods are focused on STEM as a whole or only on technology. In this article, we will discuss the instrument called Technology and Engineering Attitude Scale (TEAS) which focuses on attitudes…

Descriptors: Elementary Secondary Education, Engineering Education, Test Validity, Foreign Countries

The Effect of Chance Success on Equalization Error in Test Equation Based on Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Koçak, Duygu – International Journal of Progressive Education, 2020

The aim of this study was to determine the effect of chance success on test equalization. For this purpose, artificially generated 500 and 1000 sample size data sets were synchronized using linear equalization and equal percentage equalization methods. In the data which were produced as a simulative, a total of four cases were created with no…

Descriptors: Test Theory, Equated Scores, Error of Measurement, Sample Size

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 26

Educational and Psychological…	12
Journal of Educational…	6
Research Quarterly for…	6
Measurement in Physical…	5
New York State Education…	5
ProQuest LLC	5
International Journal of…	4
Journal of Research in…	4
Measurement and Evaluation in…	4
Psychometrika	4
Educational Measurement:…	3
Journal of Economic Education	3
Psychological Reports	3
Psychology in the Schools	3
Advances in Health Sciences…	2
Amer J Ment Deficiency	2
Applied Psychological…	2
Educ Psychol Meas	2
Educational Forum	2
European Physical Education…	2
International Journal of…	2
International Journal of…	2
Journal for Research in…	2
Journal of Applied Research…	2
Journal of Educational…	2
More ▼

Weiss, David J.	4
Skoczylas, Rudolph V.	3
Adkins, Dorothy C.	2
Atilgan, Hakan	2
Ebel, Robert L.	2
Evenhuis, Heleen M.	2
Gillmore, Gerald M.	2
Ginther, Joan R.	2
Greenberger, Ellen	2
Guthrie, P. D.	2
Göçer, Ali	2
Hoepfner, Ralph	2
Koos, Eugenia M.	2
Kristof, Walter	2
Linn, Robert L.	2
Lord, Frederic M.	2
Miles, David T.	2
Petrosko, Joseph M.	2
Silverstein, A. B.	2
Walstad, William B.	2
Abad, Francisco J.	1
Abdullah Uysal	1
Adams, David R.	1
Admiraal, Wilfried	1
More ▼

Reports - Research	143
Journal Articles	120
Tests/Questionnaires	19
Guides - Non-Classroom	14
Reports - Evaluative	14
Reports - Descriptive	13
Guides - General	10
Speeches/Meeting Papers	10
Information Analyses	9
Dissertations/Theses -…	5
Opinion Papers	5
Reference Materials -…	5
Collected Works - Serials	2
Dissertations/Theses	2
Book/Product Reviews	1
Books	1
Collected Works - Proceedings	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Teacher	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Peabody Picture Vocabulary…	4
Bayley Scales of Infant…	2
General Aptitude Test Battery	2
Marlowe Crowne Social…	2
Self Directed Search	2
State Trait Anxiety Inventory	2
Strong Vocational Interest…	2
Wide Range Achievement Test	2
California Achievement Tests	1
College Level Examination…	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
Illinois Test of…	1
Kaufman Assessment Battery…	1
Kaufman Test of Educational…	1
Law School Admission Test	1
Minnesota Importance…	1
Minnesota Tests of Creative…	1
National Assessment of…	1
Peabody Developmental Motor…	1
Personal Orientation Inventory	1
More ▼