ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	15
Since 2006 (last 20 years)	22

Descriptor

Error of Measurement	26
Statistical Analysis	9
Sample Size	6
Least Squares Statistics	5
Test Items	5
Test Reliability	5
Correlation	4
Foreign Countries	4
Regression (Statistics)	4
Sampling	4
Statistical Bias	4
Statistical Inference	4
Bayesian Statistics	3
Computation	3
Equated Scores	3
Factor Analysis	3
Item Analysis	3
Item Response Theory	3
Probability	3
Reliability	3
Scores	3
Simulation	3
Accuracy	2
College Entrance Examinations	2
Computer Software	2
More ▼

Source

Practical Assessment,…

Publication Type

Journal Articles	26
Reports - Research	17
Reports - Descriptive	5
Reports - Evaluative	3
Information Analyses	1

Education Level

Higher Education	5
Postsecondary Education	5
Junior High Schools	3
Middle Schools	3
Secondary Education	3
Elementary Education	2
Grade 8	2
Elementary Secondary Education	1
Grade 3	1
Grade 5	1
High Schools	1
More ▼

Audience

Location

Canada	2
Maryland	1
Norway	1
Sweden	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Texas Assessment of Academic…

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

A Tutorial on Cross Wave Measurement Invariance Testing with Item Factor Analysis

Peer reviewed
PDF on ERIC

Download full text

R. Noah Padgett – Practical Assessment, Research & Evaluation, 2023

The consistency of psychometric properties across waves of data collection provides valuable evidence that scores can be interpreted consistently. Evidence supporting the consistency of psychometric properties can come from using a longitudinal extension of item factor analysis to account for the lack of independence of observation when evaluating…

Descriptors: Psychometrics, Factor Analysis, Item Analysis, Validity

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Frequentist and Bayesian Factorial Invariance Using R

Peer reviewed
PDF on ERIC

Download full text

Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024

The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…

Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

On the Use of Different Linkage Plans with Different Observed-Score Equipercentile Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Wiberg, Marie – Practical Assessment, Research & Evaluation, 2021

The overall aim was to examine the equated values when using different linkage plans and different observed-score equipercentile equating methods with the equivalent groups (EG) design and the nonequivalent groups with anchor test (NEAT) design. Both real data from a college admissions test and simulated data were used with frequency estimation,…

Descriptors: Equated Scores, Test Items, Methods, College Entrance Examinations

Evaluating Parent Comprehension of Measurement Error Information Presented in Score Reports

Peer reviewed
PDF on ERIC

Download full text

Kannan, Priya; Zapata-Rivera, Diego; Bryant, Andrew D. – Practical Assessment, Research & Evaluation, 2021

Individual-student score reports sometimes include information about precision of scores (i.e., measurement error). In this study, we specifically investigated if parents understand this information when presented. We conducted an online experimental study where 196 parents of middle school children, from various parts of the country, were…

Descriptors: Comprehension, Parents, Error of Measurement, Test Interpretation

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Overview and Illustration of Bayesian Confirmatory Factor Analysis with Ordinal Indicators

Peer reviewed
PDF on ERIC

Download full text

Taylor, John M. – Practical Assessment, Research & Evaluation, 2019

Although frequentist estimators can effectively fit ordinal confirmatory factor analysis (CFA) models, their assumptions are difficult to establish and estimation problems may prohibit their use at times. Consequently, researchers may want to also look to Bayesian analysis to fit their ordinal models. Bayesian methods offer researchers an…

Descriptors: Bayesian Statistics, Factor Analysis, Least Squares Statistics, Error of Measurement

Causal Inference Methods for Selection on Observed and Unobserved Factors: Propensity Score Matching, Heckit Models, and Instrumental Variable Estimation

Peer reviewed
PDF on ERIC

Download full text

Scott, Paul Wesley – Practical Assessment, Research & Evaluation, 2019

Two approaches to causal inference in the presence of non-random assignment are presented: The Propensity Score approach which pseudo-randomizes by balancing groups on observed propensity to be in treatment, and the Endogenous Treatment Effects approach which utilizes systems of equations to explicitly model selection into treatment. The three…

Descriptors: Causal Models, Statistical Inference, Probability, Scores

A Note on Using the Nonparametric Levene Test When Population Means Are Unequal

Peer reviewed
PDF on ERIC

Download full text

Shear, Benjamin R.; Nordstokke, David W.; Zumbo, Bruno D. – Practical Assessment, Research & Evaluation, 2018

This computer simulation study evaluates the robustness of the nonparametric Levene test of equal variances (Nordstokke & Zumbo, 2010) when sampling from populations with unequal (and unknown) means. Testing for population mean differences when population variances are unknown and possibly unequal is often referred to as the Behrens-Fisher…

Descriptors: Nonparametric Statistics, Computer Simulation, Monte Carlo Methods, Sampling

Heteroskedasticity in Multiple Regression Analysis: What it is, How to Detect it and How to Solve it with Applications in R and SPSS

Peer reviewed
PDF on ERIC

Download full text

Astivia, Oscar L. Olvera; Zumbo, Bruno D. – Practical Assessment, Research & Evaluation, 2019

Within psychology and the social sciences, Ordinary Least Squares (OLS) regression is one of the most popular techniques for data analysis. In order to ensure the inferences from the use of this method are appropriate, several assumptions must be satisfied, including the one of constant error variance (i.e. homoskedasticity). Most of the training…

Descriptors: Multiple Regression Analysis, Least Squares Statistics, Statistical Analysis, Error of Measurement

Data Transformations for Inference with Linear Regression: Clarifications and Recommendations

Peer reviewed
PDF on ERIC

Download full text

Pek, Jolynn; Wong, Octavia; Wong, C. M. – Practical Assessment, Research & Evaluation, 2017

Data transformations have been promoted as a popular and easy-to-implement remedy to address the assumption of normally distributed errors (in the population) in linear regression. However, the application of data transformations introduces non-ignorable complexities which should be fully appreciated before their implementation. This paper adds to…

Descriptors: Data Analysis, Regression (Statistics), Statistical Inference, Data Interpretation

The Miscalculation of Interrater Reliability: A Case Study Involving the AAC&U VALUE Rubrics

Peer reviewed
PDF on ERIC

Download full text

Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017

Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…

Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives

Measurement Error and Equating Error in Power Analysis

Peer reviewed
PDF on ERIC

Download full text

Phillips, Gary W.; Jiang, Tao – Practical Assessment, Research & Evaluation, 2016

Power analysis is a fundamental prerequisite for conducting scientific research. Without power analysis the researcher has no way of knowing whether the sample size is large enough to detect the effect he or she is looking for. This paper demonstrates how psychometric factors such as measurement error and equating error affect the power of…

Descriptors: Error of Measurement, Statistical Analysis, Equated Scores, Sample Size

Accuracy of Range Restriction Correction with Multiple Imputation in Small and Moderate Samples: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Pfaffel, Andreas; Spiel, Christiane – Practical Assessment, Research & Evaluation, 2016

Approaches to correcting correlation coefficients for range restriction have been developed under the framework of large sample theory. The accuracy of missing data techniques for correcting correlation coefficients for range restriction has thus far only been investigated with relatively large samples. However, researchers and evaluators are…

Descriptors: Correlation, Sample Size, Error of Measurement, Accuracy

Previous Page | Next Page »

Pages: 1 | 2

Zumbo, Bruno D.	3
Nordstokke, David W.	2
Astivia, Oscar L. Olvera	1
Bryant, Andrew D.	1
Cairns, Sharon L.	1
Cassady, Jerrell C.	1
Coverdale, Bradley J.	1
Gomez Grajales, Carlos Alberto	1
Han, Kyung T.	1
Huang, Francis L.	1
Huebner, Alan	1
Inga Laukaityte	1
Jiang, Tao	1
Jin, Ying	1
Kannan, Priya	1
Kellow, J. Thomas	1
Kurkiewicz, Dason	1
Lovato, Chris Y.	1
Luxenberg, Harlan	1
Marie Wiberg	1
Metsämuuronen, Jari	1
Osborne, Jason W.	1
Osbourne, Jason W.	1
Pek, Jolynn	1
Pfaffel, Andreas	1
More ▼