ERIC - Search Results

Publication Date

In 2025	15
Since 2024	44
Since 2021 (last 5 years)	81
Since 2016 (last 10 years)	138
Since 2006 (last 20 years)	197

Descriptor

Error of Measurement	410
Test Reliability	410
Test Validity	123
Scores	78
Foreign Countries	64
Test Construction	61
Mathematical Models	54
Statistical Analysis	54
Test Items	53
Correlation	49
Item Response Theory	47
Psychometrics	47
Item Analysis	44
Measurement Techniques	43
Test Theory	43
Test Interpretation	40
Testing Problems	39
Evaluation Methods	37
True Scores	37
Scoring	36
Factor Analysis	34
Comparative Analysis	30
Criterion Referenced Tests	30
Interrater Reliability	29
Measurement	26
More ▼

Publication Type

Reports - Research	256
Journal Articles	247
Reports - Evaluative	54
Speeches/Meeting Papers	46
Reports - Descriptive	31
Numerical/Quantitative Data	15
Opinion Papers	10
Tests/Questionnaires	7
Dissertations/Theses -…	6
Guides - Non-Classroom	5
Information Analyses	5
Collected Works - General	2
Guides - General	2
Reports - General	2
Collected Works - Serials	1
Non-Print Media	1
Reference Materials -…	1
More ▼

Education Level

Elementary Education	31
Secondary Education	31
Higher Education	29
Postsecondary Education	26
Elementary Secondary Education	21
Middle Schools	16
High Schools	13
Junior High Schools	13
Grade 3	11
Grade 4	11
Grade 5	11
Early Childhood Education	9
Intermediate Grades	9
Primary Education	9
Grade 8	7
Grade 7	6
Grade 6	5
Grade 10	3
Grade 9	3
Grade 11	2
Grade 12	2
Kindergarten	2
Adult Education	1
More ▼

Audience

Researchers	11
Administrators	2
Counselors	1
Practitioners	1
Teachers	1

Location

Canada	7
Germany	6
United Kingdom (England)	6
Australia	5
Netherlands	5
New York	5
Spain	5
Indonesia	4
Turkey	4
Florida	3
Denmark	2
Italy	2
Malaysia	2
New Jersey	2
New Mexico	2
New Zealand	2
Norway	2
South Africa	2
South Korea	2
United Kingdom	2
Virginia	2
Belgium	1
California	1
China	1
Ecuador	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Every Student Succeeds Act…	1
Race to the Top	1

What Works Clearinghouse Rating

Test Reliability X

Showing 1 to 15 of 410 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

Bayesian Maximal Reliability Evaluation Using Latent Variable Modeling

Peer reviewed

Direct link

Tenko Raykov; George A. Marcoulides; Natalja Menold – Applied Measurement in Education, 2024

We discuss an application of Bayesian factor analysis for estimation of the optimal linear combination and associated maximal reliability of a multi-component measuring instrument. The described procedure yields point and credibility interval estimates of this reliability coefficient, which are readily obtained in educational and behavioral…

Descriptors: Bayesian Statistics, Test Reliability, Error of Measurement, Measurement Equipment

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

Measurement Invariance of the Action Competence in Sustainable Development Questionnaire: Can We Compare between Groups?

Peer reviewed

Direct link

M. Van Harskamp; S. De Maeyer; W. Sass; P. Van Petegem; J. Boeve-de Pauw – Environmental Education Research, 2025

There is a need for valid and reliable instruments to assess learning outcomes in education for sustainable development (ESD). Measurement invariance (MI) needs to be established before results of these instruments can be validly compared between groups. Despite its importance, establishing MI is an often overlooked validation step. To provide an…

Descriptors: Measurement, Sustainable Development, Error of Measurement, Questionnaires

Estimating Reliability for Tests with One Constructed-Response Item in a Section. Research Report. ETS RR-24-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2024

The goal of this paper is to find better ways to estimate the internal consistency reliability of scores on tests with a specific type of design that are often encountered in practice: tests with constructed-response items clustered into sections that are not parallel or tau-equivalent, and one of the sections has only one item. To estimate the…

Descriptors: Test Reliability, Essay Tests, Construct Validity, Error of Measurement

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Evaluating Test-Retest Reliability and Measurement Error of the Lumbar Flexion Relaxation Ratio within and between Days

Peer reviewed

Direct link

Samuel J. Howarth; Erinn McCreath Frangakis; Steven Hirsch; Diana De Carvalho – Measurement in Physical Education and Exercise Science, 2024

The flexion relaxation ratio (FRR) of the lumbar extensor muscles is often assessed in experimental and clinical studies. This study evaluated within- and between-session test--retest reliability and measurement error for different FRR formulations. Participants completed two identical data collection sessions 1-week apart. Spine flexion and…

Descriptors: Exercise Physiology, Human Body, Pretests Posttests, Error of Measurement

Investigating Structural Model Fit Evaluation

Peer reviewed

Direct link

Xijuan Zhang; Hao Wu – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A full structural equation model (SEM) typically consists of both a measurement model (describing relationships between latent variables and observed scale items) and a structural model (describing relationships among latent variables). However, often researchers are primarily interested in testing hypotheses related to the structural model while…

Descriptors: Structural Equation Models, Goodness of Fit, Robustness (Statistics), Factor Structure

Do Different Devices Perform Equally Well with Different Numbers of Scale Points and Response Formats? A Test of Measurement Invariance and Reliability

Peer reviewed

Direct link

Natalja Menold; Vera Toepoel – Sociological Methods & Research, 2024

Research on mixed devices in web surveys is in its infancy. Using a randomized experiment, we investigated device effects (desktop PC, tablet and mobile phone) for six response formats and four different numbers of scale points. N = 5,077 members of an online access panel participated in the experiment. An exact test of measurement invariance and…

Descriptors: Online Surveys, Handheld Devices, Telecommunications, Test Reliability

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

Evaluating the Discrepancy between Scale Reliability and Cronbach's Coefficient Alpha Using Latent Variable Modeling

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2023

This article outlines a readily applicable procedure for point and interval estimation of the population discrepancy between reliability and the popular Cronbach's coefficient alpha for unidimensional multi-component measuring instruments with uncorrelated errors, which are widely used in behavioral and social research. The method is developed…

Descriptors: Measurement, Test Reliability, Measurement Techniques, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

Educational and Psychological…	33
Journal of Educational…	21
Psychometrika	12
Grantee Submission	11
Applied Psychological…	10
Journal of Experimental…	9
Educational Measurement:…	7
Journal of Psychoeducational…	7
Measurement in Physical…	6
ProQuest LLC	6
Applied Measurement in…	5
International Journal of…	5
Journal of Educational…	5
Measurement and Evaluation in…	5
New York State Education…	5
Practical Assessment,…	5
Sociological Methods &…	5
Structural Equation Modeling:…	5
ETS Research Report Series	4
Measurement:…	4
International Journal of…	3
International Journal of…	3
Journal of Consulting and…	3
Research Papers in Education	3
ACT, Inc.	2
More ▼

Zimmerman, Donald W.	10
Huynh, Huynh	7
Brennan, Robert L.	6
Livingston, Samuel A.	6
Williams, Richard H.	6
Feldt, Leonard S.	4
Whitely, Susan E.	4
Cureton, Edward E.	3
Haladyna, Tom	3
Harris, Chester W.	3
Kane, Michael T.	3
Rentz, R. Robert	3
Saunders, Joseph C.	3
Schoen, Robert C.	3
Subkoviak, Michael J.	3
Thompson, Bruce	3
Yang, Xiaotong	3
Anna-Maria Fall	2
Axelrod, Bradley N.	2
Bashaw, W. L.	2
Benton, Stephen L.	2
Beula M. Magimairaj	2
Blaker, Lisa	2
Bridgeman, Brent	2
More ▼

Wechsler Adult Intelligence…	6
ACT Assessment	4
Program for International…	4
Wechsler Intelligence Scale…	4
Early Childhood Longitudinal…	3
General Educational…	3
Iowa Tests of Basic Skills	3
Beck Depression Inventory	2
Comprehensive Tests of Basic…	2
National Assessment of…	2
Advanced Placement…	1
Alabama High School…	1
Armed Forces Qualification…	1
British Household Panel Survey	1
California Achievement Tests	1
Cognitive Abilities Test	1
College Level Academic Skills…	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Dimensions of Self Concept	1
Expressive One Word Picture…	1
Florida Comprehensive…	1
General Social Survey	1
Iowa Tests of Educational…	1
Kaufman Assessment Battery…	1
More ▼