ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	24

Descriptor

Accuracy	24
Correlation	24
Error of Measurement	24
Comparative Analysis	10
Statistical Analysis	8
Item Response Theory	6
Computation	5
Monte Carlo Methods	5
Reliability	5
Simulation	5
Elementary School Students	4
Foreign Countries	4
Regression (Statistics)	4
Sample Size	4
Statistical Bias	4
Test Items	4
Bayesian Statistics	3
Classification	3
Evaluation Methods	3
Factor Analysis	3
Goodness of Fit	3
Hypothesis Testing	3
Interrater Reliability	3
Item Analysis	3
Models	3
More ▼

Source

Educational and Psychological…	5
Journal of Educational…	3
Advances in Health Sciences…	1
Athletic Training Education…	1
CALICO Journal	1
International Journal of…	1
Journal of Cognition and…	1
Journal of Research on…	1
Measurement in Physical…	1
Online Submission	1
Pegem Journal of Education…	1
Practical Assessment,…	1
ProQuest LLC	1
Research Matters	1
Research Synthesis Methods	1
Society for Research on…	1
Structural Equation Modeling:…	1
Topics in Early Childhood…	1
More ▼

Publication Type

Journal Articles	22
Reports - Research	21
Reports - Evaluative	2
Dissertations/Theses -…	1

Education Level

Elementary Education	5
Secondary Education	3
Early Childhood Education	2
Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Australia	1
China	1
Ireland	1
Netherlands (Amsterdam)	1
Pennsylvania	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Educational Implications of Comparing Unidimensional and Multidimensional Item Response Theories

Peer reviewed
PDF on ERIC

Download full text

Seyma Erbay Mermer – Pegem Journal of Education and Instruction, 2024

This study aims to compare item and student parameters of dichotomously scored multidimensional constructs estimated based on unidimensional and multidimensional Item Response Theory (IRT) under different conditions of sample size, interdimensional correlation and number of dimensions. This research, conducted with simulations, is of a basic…

Descriptors: Item Response Theory, Correlation, Error of Measurement, Comparative Analysis

Comparing Mimic and Mimic-Interaction to Alignment Methods for Investigating Measurement Invariance Concerning a Continuous Violator

Peer reviewed

Direct link

Yuanfang Liu; Mark H. C. Lai; Ben Kelcey – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance holds when a latent construct is measured in the same way across different levels of background variables (continuous or categorical) while controlling for the true value of that construct. Using Monte Carlo simulation, this paper compares the multiple indicators, multiple causes (MIMIC) model and MIMIC-interaction to a…

Descriptors: Classification, Accuracy, Error of Measurement, Correlation

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Statistical Power When Adjusting for Multiple Hypothesis Tests: Methodology Expansions and Software Tools

Peer reviewed

Direct link

Kristin Porter; Luke Miratrix; Kristen Hunter – Society for Research on Educational Effectiveness, 2021

Background: Researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple testing procedures (MTPs)…

Descriptors: Statistical Analysis, Hypothesis Testing, Computer Software, Randomized Controlled Trials

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

A Polytomous Scoring Approach to Handle Not-Reached Items in Low-Stakes Assessments

Peer reviewed

Direct link

Gorgun, Guher; Bulut, Okan – Educational and Psychological Measurement, 2021

In low-stakes assessments, some students may not reach the end of the test and leave some items unanswered due to various reasons (e.g., lack of test-taking motivation, poor time management, and test speededness). Not-reached items are often treated as incorrect or not-administered in the scoring process. However, when the proportion of…

Descriptors: Scoring, Test Items, Response Style (Tests), Mathematics Tests

Item Parameter Drift in Computer Adaptive Testing Due to Lack of Content Knowledge

Peer reviewed

Direct link

Aksu Dunya, Beyza – International Journal of Testing, 2018

This study was conducted to analyze potential item parameter drift (IPD) impact on person ability estimates and classification accuracy when drift affects an examinee subgroup. Using a series of simulations, three factors were manipulated: (a) percentage of IPD items in the CAT exam, (b) percentage of examinees affected by IPD, and (c) item pool…

Descriptors: Adaptive Testing, Classification, Accuracy, Computer Assisted Testing

A Comparison of Methods for Estimating Relationships in the Change between Two Time Points for Latent Variables

Peer reviewed

Direct link

Finch, W. Holmes; Shim, Sungok Serena – Educational and Psychological Measurement, 2018

Collection and analysis of longitudinal data is an important tool in understanding growth and development over time in a whole range of human endeavors. Ideally, researchers working in the longitudinal framework are able to collect data at more than two points in time, as this will provide them with the potential for a deeper understanding of the…

Descriptors: Comparative Analysis, Computation, Time, Change

Estimating Hazard Ratios from Published Kaplan-Meier Survival Curves: A Methods Validation Study

Peer reviewed

Direct link

Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019

Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…

Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials

Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Peer reviewed

Direct link

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills

Detecting Differential Item Discrimination (DID) and the Consequences of Ignoring DID in Multilevel Item Response Models

Peer reviewed

Direct link

Lee, Woo-yeol; Cho, Sun-Joo – Journal of Educational Measurement, 2017

Cross-level invariance in a multilevel item response model can be investigated by testing whether the within-level item discriminations are equal to the between-level item discriminations. Testing the cross-level invariance assumption is important to understand constructs in multilevel data. However, in most multilevel item response model…

Descriptors: Test Items, Item Response Theory, Item Analysis, Simulation

Considerations for Time Sampling Interval Durations in the Measurement of Young Children's Classroom Engagement

Peer reviewed

Direct link

Zakszeski, Brittany N.; Hojnoski, Robin L.; Wood, Brenna K. – Topics in Early Childhood Special Education, 2017

Classroom engagement is important to young children's academic and social development. Accurate methods of capturing this behavior are needed to inform and evaluate intervention efforts. This study compared the accuracy of interval durations (i.e., 5 s, 10 s, 15 s, 20 s, 30 s, and 60 s) of momentary time sampling (MTS) in approximating the…

Descriptors: Intervals, Time, Sampling, Learner Engagement

Reliability of Entry-Level Athletic Trainers' Palpation Skills of Bony Anatomical Landmarks in the Lumbopelvic Region

Peer reviewed

Direct link

Schultz, Sarah M.; Jacobs, Michelle M.; Gorgos, Kara S.; Wasylyk, Nicole T.; Hanrahan, Sean; Van Lunen, Bonnie L. – Athletic Training Education Journal, 2015

Context: Accuracy of locating various lumbopelvic landmarks for novice athletic trainers has not been examined. Objective: To examine reliability of novice athletic trainers for identification of the L4 spinous process and right and left posterior superior iliac spine (PSIS). Design: Cross-sectional reliability. Setting: Laboratory. Patients or…

Descriptors: Athletics, Allied Health Personnel, Entry Workers, Reliability

Accuracy of Range Restriction Correction with Multiple Imputation in Small and Moderate Samples: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Pfaffel, Andreas; Spiel, Christiane – Practical Assessment, Research & Evaluation, 2016

Approaches to correcting correlation coefficients for range restriction have been developed under the framework of large sample theory. The accuracy of missing data techniques for correcting correlation coefficients for range restriction has thus far only been investigated with relatively large samples. However, researchers and evaluators are…

Descriptors: Correlation, Sample Size, Error of Measurement, Accuracy

Introducing a Model for Optimal Design of Sequential Objective Structured Clinical Examinations

Peer reviewed

Direct link

Mortaz Hejri, Sara; Yazdani, Kamran; Labaf, Ali; Norcini, John J.; Jalili, Mohammad – Advances in Health Sciences Education, 2016

In a sequential OSCE which has been suggested to reduce testing costs, candidates take a short screening test and who fail the test, are asked to take the full OSCE. In order to introduce an effective and accurate sequential design, we developed a model for designing and evaluating screening OSCEs. Based on two datasets from a 10-station…

Descriptors: Models, Instructional Design, Sequential Approach, Medical Students

Previous Page | Next Page »

Pages: 1 | 2

Ahn, Soyeon	1
Aksu Dunya, Beyza	1
Ben Kelcey	1
Braadbaart, Lieke	1
Bramley, Tom	1
Bulut, Okan	1
Casey, Jackie M.	1
Castellano, Katherine E.	1
Chan, Kelvin K. W.	1
Cheng, Sierra	1
Cho, Sun-Joo	1
Culmer, Peter R.	1
David Navarro-González	1
Fabia Morales-Vives	1
Finch, W. Holmes	1
Gorgos, Kara S.	1
Gorgun, Guher	1
Hanrahan, Sean	1
Hojnoski, Robin L.	1
Jacobs, Michelle M.	1
Jalili, Mohammad	1
Jin, Ying	1
Kristen Hunter	1
Kristin Porter	1
Kunnan, Antony John	1
More ▼