ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	21

Descriptor

Error of Measurement	21
Measurement	21
Simulation	21
Computation	8
Models	7
Data Analysis	6
Item Response Theory	6
Statistical Analysis	6
Test Items	6
Sampling	4
Comparative Analysis	3
Correlation	3
Evaluation Methods	3
Longitudinal Studies	3
Psychometrics	3
Regression (Statistics)	3
Sample Size	3
Scores	3
Statistical Bias	3
Academic Achievement	2
Bayesian Statistics	2
Bias	2
Classification	2
Difficulty Level	2
Educational Assessment	2
More ▼

Source

Journal of Educational…	3
Grantee Submission	2
International Journal of…	2
Journal of Educational and…	2
Psychological Methods	2
Psychometrika	2
Applied Psychological…	1
Educational and Psychological…	1
ProQuest LLC	1
Psicologica: International…	1
Research Matters	1
Society for Research on…	1
Sociological Methods &…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	11
Reports - Evaluative	9
Dissertations/Theses -…	1

Education Level

Elementary Secondary Education	2
Secondary Education	2
Elementary Education	1
Grade 10	1
Grade 8	1
Grade 9	1
High Schools	1
Higher Education	1

Audience

Location

Italy

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Early Childhood Longitudinal…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Estimating the Uncertainty of a Small Area Estimator Based on a Microsimulation Approach

Peer reviewed

Direct link

Moretti, Angelo; Whitworth, Adam – Sociological Methods & Research, 2023

Spatial microsimulation encompasses a range of alternative methodological approaches for the small area estimation (SAE) of target population parameters from sample survey data down to target small areas in contexts where such data are desired but not otherwise available. Although widely used, an enduring limitation of spatial microsimulation SAE…

Descriptors: Simulation, Geometric Concepts, Computation, Measurement

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

Examining Differential Rater Functioning Using a Between-Subgroup Outfit Approach

Peer reviewed

Direct link

Wind, Stefanie A.; Sebok-Syer, Stefanie S. – Journal of Educational Measurement, 2019

When practitioners use modern measurement models to evaluate rating quality, they commonly examine rater fit statistics that summarize how well each rater's ratings fit the expectations of the measurement model. Essentially, this approach involves examining the unexpected ratings that each misfitting rater assigned (i.e., carrying out analyses of…

Descriptors: Measurement, Models, Evaluators, Simulation

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Using Dirichlet Processes for Modeling Heterogeneous Treatment Effects across Sites

Peer reviewed
PDF on ERIC

Download full text

Miratrix, Luke; Feller, Avi; Pillai, Natesh; Pati, Debdeep – Society for Research on Educational Effectiveness, 2016

Modeling the distribution of site level effects is an important problem, but it is also an incredibly difficult one. Current methods rely on distributional assumptions in multilevel models for estimation. There it is hoped that the partial pooling of site level estimates with overall estimates, designed to take into account individual variation as…

Descriptors: Probability, Models, Statistical Distributions, Bayesian Statistics

The Role of Multiple-Group Measurement Invariance in Family Psychology Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Kern, Justin L.; McBride, Brent A.; Laxman, Daniel J.; Dyer, W. Justin; Santos, Rosa M.; Jeans, Laurie M. – Grantee Submission, 2016

Measurement invariance (MI) is a property of measurement that is often implicitly assumed, but in many cases, not tested. When the assumption of MI is tested, it generally involves determining if the measurement holds longitudinally or cross-culturally. A growing literature shows that other groupings can, and should, be considered as well.…

Descriptors: Psychology, Measurement, Error of Measurement, Measurement Objectives

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

Test Length and Decision Quality in Personnel Selection: When Is Short Too Short?

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012

Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…

Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement

Assessing Measurement Equivalence in Ordered-Categorical Data

Peer reviewed
PDF on ERIC

Download full text

Elosua, Paula – Psicologica: International Journal of Methodology and Experimental Psychology, 2011

Assessing measurement equivalence in the framework of the common factor linear models (CFL) is known as factorial invariance. This methodology is used to evaluate the equivalence among the parameters of a measurement model among different groups. However, when dichotomous, Likert, or ordered responses are used, one of the assumptions of the CFL is…

Descriptors: Measurement, Models, Data, Factor Analysis

Measurement Error Adjustment Using the SIMEX Method: An Application to Student Growth Percentiles

Peer reviewed

Direct link

Shang, Yi – Journal of Educational Measurement, 2012

Growth models are used extensively in the context of educational accountability to evaluate student-, class-, and school-level growth. However, when error-prone test scores are used as independent variables or right-hand-side controls, the estimation of such growth models can be substantially biased. This article introduces a…

Descriptors: Error of Measurement, Statistical Analysis, Regression (Statistics), Simulation

A Comparison of Four Approaches to Account for Method Effects in Latent State-Trait Analyses

Peer reviewed

Direct link

Geiser, Christian; Lockhart, Ginger – Psychological Methods, 2012

Latent state-trait (LST) analysis is frequently applied in psychological research to determine the degree to which observed scores reflect stable person-specific effects, effects of situations and/or person-situation interactions, and random measurement error. Most LST applications use multiple repeatedly measured observed variables as indicators…

Descriptors: Psychological Studies, Simulation, Measurement, Error of Measurement

Observed-Score Equating with a Heterogeneous Target Population

Peer reviewed

Direct link

Duong, Minh Q.; von Davier, Alina A. – International Journal of Testing, 2012

Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…

Descriptors: Ability Grouping, Difficulty Level, Psychometrics, Statistical Analysis

A 2 x 2 Taxonomy of Multilevel Latent Contextual Models: Accuracy-Bias Trade-Offs in Full and Partial Error Correction Models

Peer reviewed

Direct link

Ludtke, Oliver; Marsh, Herbert W.; Robitzsch, Alexander; Trautwein, Ulrich – Psychological Methods, 2011

In multilevel modeling, group-level variables (L2) for assessing contextual effects are frequently generated by aggregating variables from a lower level (L1). A major problem of contextual analyses in the social sciences is that there is no error-free measurement of constructs. In the present article, 2 types of error occurring in multilevel data…

Descriptors: Simulation, Educational Psychology, Social Sciences, Measurement

A Bayesian Approach to Ranking and Rater Evaluation: An Application to Grant Reviews

Peer reviewed

Direct link

Cao, Jing; Stokes, S. Lynne; Zhang, Song – Journal of Educational and Behavioral Statistics, 2010

We develop a Bayesian hierarchical model for the analysis of ordinal data from multirater ranking studies. The model for a rater's score includes four latent factors: one is a latent item trait determining the true order of items and the other three are the rater's performance characteristics, including bias, discrimination, and measurement error…

Descriptors: Bayesian Statistics, Data Analysis, Bias, Measurement

Improving Explanatory Inferences from Assessments

Direct link

Diakow, Ronli Phyllis – ProQuest LLC, 2013

This dissertation comprises three papers that propose, discuss, and illustrate models to make improved inferences about research questions regarding student achievement in education. Addressing the types of questions common in educational research today requires three different "extensions" to traditional educational assessment: (1)…

Descriptors: Inferences, Educational Assessment, Academic Achievement, Educational Research

Previous Page | Next Page »

Pages: 1 | 2

Bollen, Kenneth A.	1
Bramley, Tom	1
Cao, Jing	1
Chun Wang	1
Diakow, Ronli Phyllis	1
Duong, Minh Q.	1
Dyer, W. Justin	1
Elosua, Paula	1
Emons, Wilco H. M.	1
Feller, Avi	1
Frees, Edward W.	1
Geiser, Christian	1
Ghisletta, Paolo	1
Gongjun Xu	1
Haag, Nicole	1
Hertzog, Christopher	1
Jeans, Laurie M.	1
Jiang, Yanlin	1
Kern, Justin L.	1
Kim, Jee-Seon	1
Kruyen, Peter M.	1
Laxman, Daniel J.	1
Li, Deping	1
Lindenberger, Ulman	1
Lockhart, Ginger	1
More ▼