ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	50

Descriptor

Reliability	50
Test Theory	50
Scores	18
Validity	18
Item Response Theory	14
Error of Measurement	11
Correlation	10
Computation	8
Psychometrics	8
Statistical Analysis	8
Foreign Countries	7
Measures (Individuals)	7
Comparative Analysis	6
Equations (Mathematics)	6
Generalizability Theory	6
Measurement	6
Models	6
Test Items	6
Academic Achievement	4
Evaluation Methods	4
Prediction	4
Scoring	4
Student Evaluation	4
Accuracy	3
Achievement Tests	3
More ▼

Publication Type

Journal Articles	43
Reports - Research	22
Reports - Evaluative	15
Reports - Descriptive	7
Opinion Papers	4
Dissertations/Theses -…	2
Tests/Questionnaires	2
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Information Analyses	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1
More ▼

Education Level

Higher Education	9
Postsecondary Education	7
Secondary Education	4
Elementary Education	3
Elementary Secondary Education	2
High Schools	2
Early Childhood Education	1
Grade 10	1
Grade 2	1
Grade 3	1
Grade 9	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Teachers	2
Researchers	1

Location

Australia	2
China	1
Florida	1
Germany	1
Luxembourg	1
Taiwan	1
Texas	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Eysenck Personality Inventory	1
Stanford Achievement Tests	1
Stanford Diagnostic Reading…	1
Strengths and Difficulties…	1
Systematic Screening for…	1
Wechsler Preschool and…	1
Wisconsin Card Sorting Test	1
Woodcock Reading Mastery Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 50 results Save | Export

The Tall Order of Teaching Measurement Reliability: Introducing Classical Test Theory through Observations of Human Height

Peer reviewed

Direct link

Richards, Adam S. – Communication Teacher, 2021

Course: Communication Research Methods. Objectives: This activity provides students with an experiential introduction to measurement theory and the methods for assessing measurement reliability. First, multiple measurements of a person's height are interpreted according to classical test theory. Second, the measurement of human height is used as…

Descriptors: Body Height, Measurement, Communication Research, Test Theory

A Closed-Form Alternative for Estimating [omega] Reliability under Unidimensionality

Peer reviewed

Direct link

Hancock, Gregory R.; An, Ji – Measurement: Interdisciplinary Research and Perspectives, 2020

As an alternative to Cronbach's [alpha] for estimating scale reliability, McDonald's [omega] has attracted increased attention within the methodological community for its less stringent measurement assumptions. Notwithstanding, [omega] is still seldom used by practitioners, likely due to its unavailability in popular software packages (e.g., SPSS)…

Descriptors: Evaluation, Alternative Assessment, Reliability, Test Reliability

Modifying Spearman's Attenuation Equation to Yield Partial Corrections for Measurement Error--With Application to Sample Size Calculations

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2018

Spearman's correction for attenuation (measurement error) corrects a correlation coefficient for measurement errors in either-or-both of two variables, and follows from the assumptions of classical test theory. Spearman's equation removes all measurement error from a correlation coefficient which translates into "increasing the reliability of…

Descriptors: Error of Measurement, Correlation, Sample Size, Computation

The Importance of the Assumption of Uncorrelated Errors in Psychometric Theory

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Patelis, Thanos – Educational and Psychological Measurement, 2015

A critical discussion of the assumption of uncorrelated errors in classical psychometric theory and its applications is provided. It is pointed out that this assumption is essential for a number of fundamental results and underlies the concept of parallel tests, the Spearman-Brown's prophecy and the correction for attenuation formulas as well as…

Descriptors: Psychometrics, Correlation, Validity, Reliability

Using Rasch Measurement to Validate the Instrument of Students' Understanding of Models in Science (SUMS)

Peer reviewed

Direct link

Wei, Silin; Liu, Xiufeng; Jia, Yuane – International Journal of Science and Mathematics Education, 2014

Scientific models and modeling play an important role in science, and students' understanding of scientific models is essential for their understanding of scientific concepts. The measurement instrument of "Students' Understanding of Models in Science" (SUMS), developed by Treagust, Chittleborough & Mamiala ("International…

Descriptors: Foreign Countries, High School Students, Measures (Individuals), Models

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

Measurement Error Correction Formula for Cluster-Level Group Differences in Cluster Randomized and Observational Studies

Peer reviewed

Direct link

Cho, Sun-Joo; Preacher, Kristopher J. – Educational and Psychological Measurement, 2016

Multilevel modeling (MLM) is frequently used to detect cluster-level group differences in cluster randomized trial and observational studies. Group differences on the outcomes (posttest scores) are detected by controlling for the covariate (pretest scores) as a proxy variable for unobserved factors that predict future attributes. The pretest and…

Descriptors: Error of Measurement, Error Correction, Multivariate Analysis, Hierarchical Linear Modeling

The Oceanography Concept Inventory: A Semicustomizable Assessment for Measuring Student Understanding of Oceanography

Peer reviewed
PDF on ERIC

Download full text

Direct link

Arthurs, Leilani; Hsia, Jennifer F.; Schweinle, William – Journal of Geoscience Education, 2015

We developed and evaluated an Oceanography Concept Inventory (OCI), which used a mixed-methods approach to test student achievement of 11 learning goals for an introductory-level oceanography course. The OCI was designed with expert input, grounded in research on student (mis)conceptions, written with minimal jargon, tested on 464 students, and…

Descriptors: Oceanography, Mixed Methods Research, Academic Achievement, Introductory Courses

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

Measuring Graph Comprehension, Critique, and Construction in Science

Peer reviewed

Direct link

Lai, Kevin; Cabrera, Julio; Vitale, Jonathan M.; Madhok, Jacquie; Tinker, Robert; Linn, Marcia C. – Journal of Science Education and Technology, 2016

Interpreting and creating graphs plays a critical role in scientific practice. The K-12 Next Generation Science Standards call for students to use graphs for scientific modeling, reasoning, and communication. To measure progress on this dimension, we need valid and reliable measures of graph understanding in science. In this research, we designed…

Descriptors: Middle School Students, Secondary School Science, Science Instruction, Graphs

Our Students Suffer from Both Lack of Knowledge and Consistency: A PPT (Potential Performance Theory) Analysis of Test-Taking

Download full text

Rice, Stephen; Geels, Kasha; Trafimow, David; Hackett, Holly – Online Submission, 2011

Test scores are used to assess one's general knowledge of a specific area. Although strategies to improve test performance have been previously identified, the consistency with which one uses these strategies has not been analyzed in such a way that allows assessment of how much consistency affects overall performance. Participants completed one…

Descriptors: Performance, Test Theory, Reliability, Knowledge Level

Taking the Error Term of the Factor Model into Account: The Factor Score Predictor Interval

Peer reviewed

Direct link

Beauducel, Andre – Applied Psychological Measurement, 2013

The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…

Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement

A Psychometric Evaluation of the Digital Logic Concept Inventory

Peer reviewed

Direct link

Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C. – Computer Science Education, 2014

Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric…

Descriptors: Psychometrics, Concept Formation, Measures (Individuals), Teaching Methods

How Often Do Subscores Have Added Value? Results from Operational and Simulated Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2010

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman suggested a method based on classical test theory to determine whether subscores have added value over total scores. In this article I first provide a rich collection of results regarding when subscores were found to have added…

Descriptors: Scores, Test Theory, Simulation, Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	5
Applied Psychological…	4
Educational Testing Service	3
International Journal of…	3
ProQuest LLC	2
Psychological Assessment	2
Applied Measurement in…	1
Astronomy Education Review	1
Australian Journal of…	1
Behavior Analyst Today	1
Behavioral Disorders	1
Biochemistry and Molecular…	1
Communication Teacher	1
Computer Science Education	1
Current Issues in Education	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Research	1
Educational Sciences: Theory…	1
Florida Center for Reading…	1
High Ability Studies	1
International Journal of…	1
International Journal of…	1
Journal of Early Adolescence	1
Journal of Educational…	1
More ▼

Sinharay, Sandip	5
Haberman, Shelby J.	3
Prather, Edward E.	2
Puhan, Gautam	2
Rice, Stephen	2
Trafimow, David	2
Abbey, Jennifer	1
Almehrizi, Rashid S.	1
Amakawa, Lia	1
An, Ji	1
Ardoin, Scott P.	1
Arthurs, Leilani	1
Bailey, Janelle M.	1
Bandalos, Deborah L.	1
Batchelder, William H.	1
Beauducel, Andre	1
Boman, Peter	1
Breitbart, William	1
Brescia, Robert	1
Bretz, Stacey Lowery	1
Brunner, Martin	1
Cabrera, Julio	1
Callinan, Sarah	1
Carfolite, Jessica	1
Chen, Yi-Hsin	1
More ▼