ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	10

Descriptor

Evaluation Methods	13
Monte Carlo Methods	13
Scores	13
Item Response Theory	4
Statistical Analysis	4
Correlation	3
Data Analysis	3
Error of Measurement	3
Scoring	3
Simulation	3
Statistical Significance	3
Student Evaluation	3
Test Bias	3
Comparative Analysis	2
Computation	2
Computer Simulation	2
Markov Processes	2
Measurement Techniques	2
Probability	2
Research Design	2
School Districts	2
Statistical Inference	2
Ability	1
Accuracy	1
Achievement Tests	1
More ▼

Source

Applied Psychological…	2
Journal of Educational…	2
ProQuest LLC	2
Applied Measurement in…	1
Educational and Psychological…	1
Language Testing	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
US Department of Education	1

Publication Type

Reports - Research	10
Journal Articles	9
Dissertations/Theses -…	2
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

A Nonparametric Composite Group DIF Index for Focal Groups Stemming from Multicategorical Variables

Peer reviewed

Direct link

Corinne Huggins-Manley; Anthony W. Raborn; Peggy K. Jones; Ted Myers – Journal of Educational Measurement, 2024

The purpose of this study is to develop a nonparametric DIF method that (a) compares focal groups directly to the composite group that will be used to develop the reported test score scale, and (b) allows practitioners to explore for DIF related to focal groups stemming from multicategorical variables that constitute a small proportion of the…

Descriptors: Nonparametric Statistics, Test Bias, Scores, Statistical Significance

Examining Differential Item Functioning: IRT-Based Detection in the Framework of Confirmatory Factor Analysis

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Measurement and Evaluation in Counseling and Development, 2017

This article offers an approach to examining differential item functioning (DIF) under its item response theory (IRT) treatment in the framework of confirmatory factor analysis (CFA). The approach is based on integrating IRT- and CFA-based testing of DIF and using bias-corrected bootstrap confidence intervals with a syntax code in Mplus.

Descriptors: Test Bias, Item Response Theory, Factor Analysis, Evaluation Methods

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

Improving IT Portfolio Management Decision Confidence Using Multi-Criteria Decision Making and Hypervariate Display Techniques

Direct link

Landmesser, John Andrew – ProQuest LLC, 2014

Information technology (IT) investment decision makers are required to process large volumes of complex data. An existing body of knowledge relevant to IT portfolio management (PfM), decision analysis, visual comprehension of large volumes of information, and IT investment decision making suggest Multi-Criteria Decision Making (MCDM) and…

Descriptors: Information Technology, Portfolio Assessment, Decision Making, Cognitive Processes

A Comparative Study of Exact versus Propensity Matching Techniques Using Monte Carlo Simulation

Direct link

Itang'ata, Mukaria J. J. – ProQuest LLC, 2013

Often researchers face situations where comparative studies between two or more programs are necessary to make causal inferences for informed policy decision-making. Experimental designs employing randomization provide the strongest evidence for causal inferences. However, many pragmatic and ethical challenges may preclude the use of randomized…

Descriptors: Comparative Analysis, Probability, Statistical Bias, Monte Carlo Methods

Using IRT Trait Estimates versus Summated Scores in Predicting Outcomes

Peer reviewed

Direct link

Xu, Ting; Stone, Clement A. – Educational and Psychological Measurement, 2012

It has been argued that item response theory trait estimates should be used in analyses rather than number right (NR) or summated scale (SS) scores. Thissen and Orlando postulated that IRT scaling tends to produce trait estimates that are linearly related to the underlying trait being measured. Therefore, IRT trait estimates can be more useful…

Descriptors: Educational Research, Monte Carlo Methods, Measures (Individuals), Item Response Theory

The Case for Use of Simple Difference Scores to Test the Significance of Differences in Mean Rates of Change in Controlled Repeated Measurements Designs

Peer reviewed

Direct link

Overall, John E.; Tonidandel, Scott – Multivariate Behavioral Research, 2010

A previous Monte Carlo study examined the relative powers of several simple and more complex procedures for testing the significance of difference in mean rates of change in a controlled, longitudinal, treatment evaluation study. Results revealed that the relative powers depended on the correlation structure of the simulated repeated measurements.…

Descriptors: Monte Carlo Methods, Statistical Significance, Correlation, Depression (Psychology)

How to Assign Individualized Scores on a Group Project: An Empirical Evaluation

Peer reviewed

Direct link

Zhang, Bo; Ohland, Matthew W. – Applied Measurement in Education, 2009

One major challenge in using group projects to assess student learning is accounting for the differences of contribution among group members so that the mark assigned to each individual actually reflects their performance. This research addresses the validity of grading group projects by evaluating different methods that derive individualized…

Descriptors: Monte Carlo Methods, Validity, Student Evaluation, Evaluation Methods

Multidimensional Scoring of Abilities: The Ordered Polytomous Response Case

Peer reviewed

Direct link

de la Torre, Jimmy – Applied Psychological Measurement, 2008

Recent work has shown that multidimensionally scoring responses from different tests can provide better ability estimates. For educational assessment data, applications of this approach have been limited to binary scores. Of the different variants, the de la Torre and Patz model is considered more general because implementing the scoring procedure…

Descriptors: Markov Processes, Scoring, Data Analysis, Item Response Theory

A New Method for Assessing the Statistical Significance in the Differential Functioning of Items and Tests (DFIT) Framework

Peer reviewed

Direct link

Oshima, T. C.; Raju, Nambury S.; Nanda, Alice O. – Journal of Educational Measurement, 2006

A new item parameter replication method is proposed for assessing the statistical significance of the noncompensatory differential item functioning (NCDIF) index associated with the differential functioning of items and tests framework. In this new method, a cutoff score for each item is determined by obtaining a (1-alpha ) percentile rank score…

Descriptors: Evaluation Methods, Statistical Distributions, Statistical Significance, Test Bias

Moderated Multiple Regression, Spurious Interaction Effects, and IRT

Peer reviewed

Direct link

Kang, Sun-Mee; Waller, Niels G. – Applied Psychological Measurement, 2005

Two Monte Carlo studies were conducted to explore the Type I error rates in moderated multiple regression (MMR) of observed scores and estimated latent trait scores from a two-parameter logistic item response theory (IRT) model. The results of both studies showed that MMR Type I error rates were substantially higher than the nominal alpha levels…

Descriptors: Multiple Regression Analysis, Interaction, Monte Carlo Methods, Item Response Theory

Specifying and Refining a Measurement Model for a Simulation-Based Assessment. CSE Report 619.

Download full text

Levy, Roy; Mislevy, Robert J. – US Department of Education, 2004

The challenges of modeling students' performance in simulation-based assessments include accounting for multiple aspects of knowledge and skill that arise in different situations and the conditional dependencies among multiple aspects of performance in a complex assessment. This paper describes a Bayesian approach to modeling and estimating…

Descriptors: Probability, Markov Processes, Monte Carlo Methods, Bayesian Statistics

A Comparison of Approaches for Setting Proficiency Standards via Monte Carlo Simulations. Research Evaluation Development Technical Report Series, No. 2.

Download full text

Ziomek, Robert L.; Szymczuk, Mike – 1983

In order to evaluate standard setting procedures, apart from the more commonly applied approach of simply comparing the derived standards or failure rates across various techniques, this study investigated the errors of classification associated with the contrasting groups procedures. Monte Carlo simulations were employed to produce…

Descriptors: Classification, Computer Simulation, Error of Measurement, Evaluation Methods

Anthony W. Raborn	1
Corinne Huggins-Manley	1
Dimitrov, Dimiter M.	1
Itang'ata, Mukaria J. J.	1
Kang, Sun-Mee	1
Landmesser, John Andrew	1
Levy, Roy	1
Lin, Chih-Kai	1
Mislevy, Robert J.	1
Nanda, Alice O.	1
Ohland, Matthew W.	1
Oshima, T. C.	1
Overall, John E.	1
Peggy K. Jones	1
Raju, Nambury S.	1
Stone, Clement A.	1
Szymczuk, Mike	1
Ted Myers	1
Tonidandel, Scott	1
Waller, Niels G.	1
Xu, Ting	1
Zhang, Bo	1
Ziomek, Robert L.	1
de la Torre, Jimmy	1
More ▼