ERIC - Search Results

Publication Date

In 2026	0
Since 2025	17
Since 2022 (last 5 years)	70
Since 2017 (last 10 years)	132
Since 2007 (last 20 years)	193

Descriptor

Error of Measurement	412
Test Reliability	412
Test Validity	125
Scores	78
Foreign Countries	65
Test Construction	62
Mathematical Models	54
Statistical Analysis	54
Test Items	54
Correlation	49
Item Response Theory	47
Psychometrics	47
Item Analysis	44
Measurement Techniques	43
Test Theory	43
Test Interpretation	41
Testing Problems	39
Evaluation Methods	37
Scoring	37
True Scores	37
Factor Analysis	35
Comparative Analysis	30
Criterion Referenced Tests	30
Interrater Reliability	29
Measurement	26
More ▼

Publication Type

Reports - Research	257
Journal Articles	249
Reports - Evaluative	54
Speeches/Meeting Papers	46
Reports - Descriptive	32
Numerical/Quantitative Data	15
Opinion Papers	10
Tests/Questionnaires	7
Dissertations/Theses -…	6
Guides - Non-Classroom	6
Information Analyses	5
Collected Works - General	2
Guides - General	2
Reports - General	2
Collected Works - Serials	1
Non-Print Media	1
Reference Materials -…	1
More ▼

Education Level

Secondary Education	32
Elementary Education	31
Higher Education	29
Postsecondary Education	26
Elementary Secondary Education	21
Middle Schools	17
Junior High Schools	14
High Schools	13
Grade 3	11
Grade 4	11
Grade 5	11
Early Childhood Education	9
Intermediate Grades	9
Primary Education	9
Grade 8	7
Grade 7	6
Grade 6	5
Grade 10	3
Grade 9	3
Grade 11	2
Grade 12	2
Kindergarten	2
Adult Education	1
More ▼

Audience

Researchers	11
Administrators	2
Counselors	1
Practitioners	1
Teachers	1

Location

Canada	7
Germany	6
United Kingdom (England)	6
Australia	5
Netherlands	5
New York	5
Spain	5
Indonesia	4
Turkey	4
Florida	3
China	2
Denmark	2
Italy	2
Malaysia	2
New Jersey	2
New Mexico	2
New Zealand	2
Norway	2
South Africa	2
South Korea	2
United Kingdom	2
Virginia	2
Belgium	1
California	1
Ecuador	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Every Student Succeeds Act…	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 412 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

The Vague Language Use Scale: Clinical Utility and Psychometrics from Adults with Traumatic Brain Injury

Peer reviewed

Direct link

Kathryn J. Greenslade; Julia K. Bushell; Emily F. Dillon; Amy E. Ramage – International Journal of Language & Communication Disorders, 2025

Background: Pragmatic communication difficulties encompass many distinct behaviours, including the use of vague and/or insufficient language, a common characteristic following traumatic brain injury (TBI) that negatively impacts psychosocial outcomes. Existing assessments evaluate pragmatic communication broadly, often with only one or two items…

Descriptors: Neurological Impairments, Head Injuries, Language Impairments, Language Tests

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Bayesian Maximal Reliability Evaluation Using Latent Variable Modeling

Peer reviewed

Direct link

Tenko Raykov; George A. Marcoulides; Natalja Menold – Applied Measurement in Education, 2024

We discuss an application of Bayesian factor analysis for estimation of the optimal linear combination and associated maximal reliability of a multi-component measuring instrument. The described procedure yields point and credibility interval estimates of this reliability coefficient, which are readily obtained in educational and behavioral…

Descriptors: Bayesian Statistics, Test Reliability, Error of Measurement, Measurement Equipment

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

What Makes Children's Responses to Creativity Assessments Difficult to Judge Reliably?

Peer reviewed
PDF on ERIC

Download full text

Direct link

Denis Dumas; Selcuk Acar; Kelly Berthiaume; Peter Organisciak; David Eby; Katalin Grajzel; Theadora Vlaamster; Michele Newman; Melanie Carrera – Grantee Submission, 2023

Open-ended verbal creativity assessments are commonly administered in psychological research and in educational practice to elementary-aged children. Children's responses are then typically rated by teams of judges who are trained to identify original ideas, hopefully with a degree of inter-rater agreement. Even in cases where the judges are…

Descriptors: Elementary School Students, Grade 3, Grade 4, Grade 5

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

Measurement Invariance of the Action Competence in Sustainable Development Questionnaire: Can We Compare between Groups?

Peer reviewed

Direct link

M. Van Harskamp; S. De Maeyer; W. Sass; P. Van Petegem; J. Boeve-de Pauw – Environmental Education Research, 2025

There is a need for valid and reliable instruments to assess learning outcomes in education for sustainable development (ESD). Measurement invariance (MI) needs to be established before results of these instruments can be validly compared between groups. Despite its importance, establishing MI is an often overlooked validation step. To provide an…

Descriptors: Measurement, Sustainable Development, Error of Measurement, Questionnaires

Estimating Reliability for Tests with One Constructed-Response Item in a Section. Research Report. ETS RR-24-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2024

The goal of this paper is to find better ways to estimate the internal consistency reliability of scores on tests with a specific type of design that are often encountered in practice: tests with constructed-response items clustered into sections that are not parallel or tau-equivalent, and one of the sections has only one item. To estimate the…

Descriptors: Test Reliability, Essay Tests, Construct Validity, Error of Measurement

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

A Meta-Analysis of Self-Assessment and Language Performance in Language Testing and Assessment

Peer reviewed

Direct link

Li, Minzi; Zhang, Xian – Language Testing, 2021

This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…

Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

Educational and Psychological…	33
Journal of Educational…	21
Psychometrika	12
Grantee Submission	11
Applied Psychological…	10
Journal of Experimental…	9
Educational Measurement:…	7
Journal of Psychoeducational…	7
Measurement in Physical…	6
ProQuest LLC	6
Applied Measurement in…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Educational…	5
Measurement and Evaluation in…	5
New York State Education…	5
Practical Assessment,…	5
Sociological Methods &…	5
Structural Equation Modeling:…	5
Measurement:…	4
International Journal of…	3
International Journal of…	3
Journal of Consulting and…	3
Psychology in the Schools	3
Research Papers in Education	3
More ▼

Zimmerman, Donald W.	10
Huynh, Huynh	7
Brennan, Robert L.	6
Livingston, Samuel A.	6
Williams, Richard H.	6
Feldt, Leonard S.	4
Whitely, Susan E.	4
Cureton, Edward E.	3
Haladyna, Tom	3
Harris, Chester W.	3
Kane, Michael T.	3
Rentz, R. Robert	3
Saunders, Joseph C.	3
Schoen, Robert C.	3
Subkoviak, Michael J.	3
Thompson, Bruce	3
Yang, Xiaotong	3
Anna-Maria Fall	2
Axelrod, Bradley N.	2
Bashaw, W. L.	2
Benton, Stephen L.	2
Beula M. Magimairaj	2
Blaker, Lisa	2
Bridgeman, Brent	2
More ▼

Wechsler Adult Intelligence…	6
ACT Assessment	4
Program for International…	4
Wechsler Intelligence Scale…	4
Early Childhood Longitudinal…	3
General Educational…	3
Iowa Tests of Basic Skills	3
Beck Depression Inventory	2
Comprehensive Tests of Basic…	2
National Assessment of…	2
Test of English as a Foreign…	2
Advanced Placement…	1
Alabama High School…	1
Armed Forces Qualification…	1
British Household Panel Survey	1
California Achievement Tests	1
Cognitive Abilities Test	1
College Level Academic Skills…	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Dimensions of Self Concept	1
Expressive One Word Picture…	1
Florida Comprehensive…	1
General Social Survey	1
Iowa Tests of Educational…	1
More ▼