ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	11

Descriptor

Error of Measurement	16
Item Analysis	16
Reliability	16
Measurement Techniques	7
Scores	6
Test Items	6
Statistical Analysis	5
Factor Analysis	4
Goodness of Fit	4
Sampling	4
Comparative Analysis	3
Correlation	3
Evaluation Methods	3
Foreign Countries	3
Psychometrics	3
Questionnaires	3
True Scores	3
Validity	3
Classification	2
Context Effect	2
Elementary School Students	2
Equated Scores	2
Item Response Theory	2
Latent Trait Theory	2
Personality Measures	2
More ▼

Source

Educational and Psychological…	2
Assessment for Effective…	1
European Journal of Education	1
Journal of College Teaching &…	1
Journal of Educational…	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
Practical Assessment,…	1
Psychological Methods	1
Research Papers in Education	1
Research on Social Work…	1
More ▼

Publication Type

Journal Articles	12
Reports - Research	10
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Elementary Education	3
Grade 5	2
Higher Education	2
Middle Schools	2
Elementary Secondary Education	1
Grade 3	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Researchers

Location

China	1
Maryland	1
Portugal	1
Spain	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Psychometric Evaluation of Perceived Internship PUA Scale: Using Rasch Analysis

Peer reviewed

Direct link

Yanchao Yang; Wangze Li; Sijia Xue; Wenxue Huang; Shijie Guo – European Journal of Education, 2025

In response to the prevalence of perceived internship Pick-up Artist(PUA) behaviours and the lack of appropriate measurement tools, the purpose of this study was to develop and validate a new self-designed questionnaire, the Perceived Internship PUA Scale (PIPUAS), to assess college student interns' perceptions of internship PUA behaviours. The…

Descriptors: Measurement Techniques, Incidence, Internship Programs, Validity

A Comparison of Procedures for Estimating Person Reliability Parameters in the Graded Response Model

Peer reviewed

Direct link

LaHuis, David M.; Bryant-Lees, Kinsey B.; Hakoyama, Shotaro; Barnes, Tyler; Wiemann, Andrea – Journal of Educational Measurement, 2018

Person reliability parameters (PRPs) model temporary changes in individuals' attribute level perceptions when responding to self-report items (higher levels of PRPs represent less fluctuation). PRPs could be useful in measuring careless responding and traitedness. However, it is unclear how well current procedures for estimating PRPs can recover…

Descriptors: Comparative Analysis, Reliability, Error of Measurement, Measurement Techniques

Development and Validation of the Family Feedback on Child Welfare Services (FF-CWS)

Peer reviewed

Direct link

Ayala-Nunes, Lara; Jiménez, Lucía; Hidalgo, Victoria; Dekovic, Maja; Jesus, Saul – Research on Social Work Practice, 2018

Objective: The measurement of Family Feedback on Child Welfare Services (FF-CWS) is gaining prominence as an efficacy indicator and is coherent with concerns about family-centered practice and empowerment. The aim of this study was to develop and validate an instrument that would overcome the scarcity of psychometrically sound measures in this…

Descriptors: Feedback (Response), Error of Measurement, Validity, Child Welfare

Evaluating Procedures for Reducing Measurement Error in Math Curriculum-Based Measurement Probes

Peer reviewed

Direct link

Methe, Scott A.; Briesch, Amy M.; Hulac, David – Assessment for Effective Intervention, 2015

At present, it is unclear whether math curriculum-based measurement (M-CBM) procedures provide a dependable measure of student progress in math computation because support for its technical properties is based largely upon a body of correlational research. Recent investigations into the dependability of M-CBM scores have found that evaluating…

Descriptors: Measurement Techniques, Error of Measurement, Mathematics Curriculum, Curriculum Based Assessment

An Investigation of Measurement Invariance of the Key Stage 2 National Curriculum Science Sampling Test in England

Peer reviewed

Direct link

He, Qingping; Anwyll, Steve; Glanville, Matthew; Opposs, Dennis – Research Papers in Education, 2014

Since 2010, the whole national cohort Key Stage 2 (KS2) National Curriculum test in science in England has been replaced with a sampling test taken by pupils at the age of 11 from a nationally representative sample of schools annually. The study reported in this paper compares the performance of different subgroups of the samples (classified by…

Descriptors: National Curriculum, Sampling, Foreign Countries, Factor Analysis

Quality Control Charts in Large-Scale Assessment Programs

Peer reviewed

Direct link

Schafer, William D.; Coverdale, Bradley J.; Luxenberg, Harlan; Jin, Ying – Practical Assessment, Research & Evaluation, 2011

There are relatively few examples of quantitative approaches to quality control in educational assessment and accountability contexts. Among the several techniques that are used in other fields, Shewart charts have been found in a few instances to be applicable in educational settings. This paper describes Shewart charts and gives examples of how…

Descriptors: Charts, Quality Control, Educational Assessment, Statistical Analysis

An Investigation of "Honesty Check" Items in Higher Education Course Evaluations

Peer reviewed

Direct link

Bradley, Kelly D.; Royal, Kenneth D.; Bradley, James W. – Journal of College Teaching & Learning, 2008

The reliability and validity of course evaluations in higher education is often assumed. The typical Likert-type surveys utilized when students' evaluate the course and instructor often overlook measurement issues, or deal with them in an ineffective manner. Given the importance that is placed on higher education course evaluations, with results…

Descriptors: Higher Education, Course Evaluation, Reliability, Validity

On the Consistency of Individual Classification Using Short Scales

Peer reviewed

Direct link

Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2007

Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level,…

Descriptors: Psychiatry, Patients, Error of Measurement, Test Length

A Confirmatory Analysis of Item Reliability Trends (CAIRT): Differentiating True Score and Error Variance in the Analysis of Item Context Effects

Peer reviewed

Direct link

Hartig, Johannes; Holzel, Britta; Moosbrugger, Helfried – Multivariate Behavioral Research, 2007

Numerous studies have shown increasing item reliabilities as an effect of the item position in personality scales. Traditionally, these context effects are analyzed based on item-total correlations. This approach neglects that trends in item reliabilities can be caused either by an increase in true score variance or by a decrease in error…

Descriptors: True Scores, Error of Measurement, Structural Equation Models, Simulation

Understanding Internal Consistency Reliability Estimates: A Conceptual Primer on Coefficient Alpha.

Peer reviewed

Henson, Robin K. – Measurement and Evaluation in Counseling and Development, 2001

Although often ignored, reliability is critical when interpreting study effects and test results. Accordingly, this article focuses on the most commonly used estimate of reliability, internal consistency coefficients, with emphasis on coefficient alpha. An interpretive framework is provided for applied researchers and others seeking a conceptual…

Descriptors: Error of Measurement, Item Analysis, Reliability, Research Methodology

The Effect of Sequential Dependence on the Sampling Distributions of KR-20, KR-21, and Split-Halves Reliabilities.

Download full text

Sullins, Walter L. – 1971

Five-hundred dichotomously scored response patterns were generated with sequentially independent (SI) items and 500 with dependent (SD) items for each of thirty-six combinations of sampling parameters (i.e., three test lengths, three sample sizes, and four item difficulty distributions). KR-20, KR-21, and Split-Half (S-H) reliabilities were…

Descriptors: Comparative Analysis, Correlation, Error of Measurement, Item Analysis

Item Characteristic Curve Parameters: Effects of Sample Size on Linear Equating.

Download full text

Ree, Malcom James; Jensen, Harald E. – 1980

By means of computer simulation of test responses, the reliability of item analysis data and the accuracy of equating were examined for hypothetical samples of 250, 500, 1000, and 2000 subjects for two tests with 20 equating items plus 60 additional items on the same scale. Birnbaum's three-parameter logistic model was used for the simulation. The…

Descriptors: Computer Assisted Testing, Equated Scores, Error of Measurement, Item Analysis

Evidence on the Quality of Several Approximations for Commonly Used Measurement Statistics.

PDF pending restoration

McMorris, Robert F. – 1971

The extent of error likely to occur with each of several approximations for the standard deviation, internal consistency reliability, and the standard error of measurement is analyzed. Approximations were compared with exact statistics obtained on 85 different classroom tests constructed and administered by professors in a variety of fields. Means…

Descriptors: Data Analysis, Error of Measurement, Evaluation Methods, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Anwyll, Steve	1
Ayala-Nunes, Lara	1
Barnes, Tyler	1
Bradley, James W.	1
Bradley, Kelly D.	1
Briesch, Amy M.	1
Bryant-Lees, Kinsey B.	1
Coverdale, Bradley J.	1
David B. Flora	1
David Navarro-González	1
Dekovic, Maja	1
Emons, Wilco H. M.	1
Fabia Morales-Vives	1
Glanville, Matthew	1
Gustafsson, Jan-Eric	1
Hakoyama, Shotaro	1
Hartig, Johannes	1
He, Qingping	1
Henson, Robin K.	1
Hidalgo, Victoria	1
Holzel, Britta	1
Hulac, David	1
Jensen, Harald E.	1
Jesus, Saul	1
Jiménez, Lucía	1
More ▼