ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	12

Descriptor

Educational Assessment	15
Statistical Analysis	15
Simulation	14
Comparative Analysis	6
Item Response Theory	5
Measurement	5
Psychometrics	5
Models	4
Test Items	4
Computer Software	3
Evaluation Methods	3
Foreign Countries	3
Mathematics Tests	3
Scores	3
Standardized Tests	3
Accuracy	2
Achievement Tests	2
Bayesian Statistics	2
Classification	2
Computer Assisted Testing	2
Data Analysis	2
Educational Diagnosis	2
Elementary Secondary Education	2
Equated Scores	2
Error of Measurement	2
More ▼

Source

ProQuest LLC	4
International Journal of…	2
Journal of Educational…	2
American Institutes for…	1
Applied Measurement in…	1
International Working Group…	1
Large-scale Assessments in…	1
Measurement:…	1

Publication Type

Journal Articles	7
Reports - Research	6
Dissertations/Theses -…	4
Reports - Evaluative	3
Collected Works - Proceedings	1
Guides - General	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Grade 7	2
Grade 8	2
Adult Education	1
Elementary Education	1
Grade 10	1
Grade 12	1
Grade 4	1
Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Location

Pennsylvania	2
Australia	1
Canada	1
Czech Republic	1
Israel	1
Massachusetts	1
Netherlands	1
North Carolina	1
Slovakia	1
Spain	1
Utah	1
Washington	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Massachusetts Comprehensive…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

GDINA and CDM Packages in R

Peer reviewed

Direct link

Rupp, André A.; van Rijn, Peter W. – Measurement: Interdisciplinary Research and Perspectives, 2018

We review the GIDNA and CDM packages in R for fitting cognitive diagnosis/diagnostic classification models. We first provide a summary of their core capabilities and then use both simulated and real data to compare their functionalities in practice. We found that the most relevant routines in the two packages appear to be more similar than…

Descriptors: Educational Assessment, Cognitive Measurement, Measurement, Computer Software

IRT Item Parameter Scaling for Developing New Item Pools

Peer reviewed

Direct link

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

Detection of Invalid Test Scores: The Usefulness of Simple Nonparametric Statistics

Peer reviewed

Direct link

Tendeiro, Jorge N.; Meijer, Rob R. – Journal of Educational Measurement, 2014

In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is unclear on the basis of the existing literature which statistic to use. An overview of relatively simple existing nonparametric approaches to identify atypical response…

Descriptors: Educational Assessment, Test Validity, Scores, Statistical Analysis

Posterior Predictive Model Checking in Bayesian Networks

Direct link

Crawford, Aaron – ProQuest LLC, 2014

This simulation study compared the utility of various discrepancy measures within a posterior predictive model checking (PPMC) framework for detecting different types of data-model misfit in multidimensional Bayesian network (BN) models. The investigated conditions were motivated by an applied research program utilizing an operational complex…

Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit

Detecting Differential Item Functioning Using Generalized Logistic Regression in the Context of Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Rutkowski, Leslie – Large-scale Assessments in Education, 2014

Background: When studying student performance across different countries or cultures, an important aspect for comparisons is that of score comparability. In other words, it is imperative that the latent variable (i.e., construct of interest) is understood and measured equivalently across all participating groups or countries, if our inferences…

Descriptors: Test Items, Item Response Theory, Item Analysis, Regression (Statistics)

A Study of Assessments Designed for Student Success

Direct link

Delepine, Sidney G., III – ProQuest LLC, 2012

The purpose of this quantitative study is to compare a new assessment tool, the SkillsUSA Connect Assessment with the NOCTI assessment to determine which test results in more students achieving success. A quantitative study, designed to compare test scores of students taking the NOCTI assessment and new assessments from SkillsUSA, called the…

Descriptors: Educational Assessment, Academic Achievement, Scores, Comparative Analysis

Observed-Score Equating with a Heterogeneous Target Population

Peer reviewed

Direct link

Duong, Minh Q.; von Davier, Alina A. – International Journal of Testing, 2012

Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…

Descriptors: Ability Grouping, Difficulty Level, Psychometrics, Statistical Analysis

A Multidimensional Scaling Approach to Dimensionality Assessment for Measurement Instruments Modeled by Multidimensional Item Response Theory

Direct link

Toro, Maritsa – ProQuest LLC, 2011

The statistical assessment of dimensionality provides evidence of the underlying constructs measured by a survey or test instrument. This study focuses on educational measurement, specifically tests comprised of items described as multidimensional. That is, items that require examinee proficiency in multiple content areas and/or multiple cognitive…

Descriptors: Multidimensional Scaling, Measurement, Statistical Analysis, Item Response Theory

Modeling Item-Level and Step-Level Invariance Effects in Polytomous Items Using the Partial Credit Model

Peer reviewed

Direct link

Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D. – International Journal of Testing, 2012

Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…

Descriptors: Foreign Countries, Psychometrics, Test Bias, Test Items

Matching for Bias Reduction in Treatment Effect Estimation of Hierarchically Structured Synthetic Cohort Design Data

Direct link

Wang, Qiu – ProQuest LLC, 2010

This study uses a multi-level multivariate propensity score matching approach to examine the synthetic cohort design (SCD) in estimating the schooling effect on mathematics proficiency of the focal cohort 2 (8th graders). Collecting 7th and 8th graders at the same time point, the SCD is sufficient in estimating the schooling effect under the…

Descriptors: Program Evaluation, Structural Equation Models, Grade 8, Cohort Analysis

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

Traditional Dimensionality vs. Essential Dimensionality.

Download full text

Nandakumar, Ratna – 1989

The theoretical differences between the traditional definition of dimensionality and the more recently defined notion of essential dimensionality are presented. Monte Carlo simulations are used to demonstrate the utility of W. F. Stout's procedure to assess the essential unidimensionality of the latent space underlying a set of terms. The…

Descriptors: Definitions, Educational Assessment, Latent Trait Theory, Mathematical Models

Evaluation of Bias Correction Methods for "Worst-case" Selective Non-participation in NAEP

Download full text

McLaughlin, Don; Gallagher, Larry; Stancavage, Fran – American Institutes for Research, 2004

With the advent of No Child Left Behind (NCLB), the context for NAEP participation is changing. Whereas in the past participation in NAEP has always been voluntary, participation is now mandatory for some grade and subjects among schools receiving Title I funds. While this will certainly raise school-level participation rates in the mandated…

Descriptors: Federal Legislation, School Districts, Participation, Educational Assessment

Interest in School and Learning. A Guide to the Analysis and Interpretation of EQA Scores and Related Intervention Techniques. Guide to Strategies for Improvement, Goal 4. First Edition.

Download full text

Guerriero, Carl A. – 1974

This document is designed to assist school district personnel in the identification of intervention strategies that have a good probability of increasing the district's mean score on the Goal IV Educational Quality Assessment (EQA) instrument. Appropriate educational research has been reviewed and distilled into seven propositions that are…

Descriptors: Curriculum Development, Educational Assessment, Educational Diagnosis, Elementary Secondary Education

Proceedings of the International Conference on Educational Data Mining (EDM) (2nd, Cordoba, Spain, July 1-3, 2009)

Download full text

Barnes, Tiffany, Ed.; Desmarais, Michel, Ed.; Romero, Cristobal, Ed.; Ventura, Sebastian, Ed. – International Working Group on Educational Data Mining, 2009

The Second International Conference on Educational Data Mining (EDM2009) was held at the University of Cordoba, Spain, on July 1-3, 2009. EDM brings together researchers from computer science, education, psychology, psychometrics, and statistics to analyze large data sets to answer educational research questions. The increase in instrumented…

Descriptors: Data Analysis, Educational Research, Conferences (Gatherings), Foreign Countries

Armstrong, Ronald D.	1
Barnes, Tiffany, Ed.	1
Chang, Hua-Hua	1
Crawford, Aaron	1
Delepine, Sidney G., III	1
Desmarais, Michel, Ed.	1
Duong, Minh Q.	1
Gallagher, Larry	1
Gattamorta, Karina A.	1
Guerriero, Carl A.	1
Kang, Hyeon-Ah	1
Lu, Ying	1
McLaughlin, Don	1
Meijer, Rob R.	1
Myers, Nicholas D.	1
Nandakumar, Ratna	1
Penfield, Randall D.	1
Romero, Cristobal, Ed.	1
Rupp, André A.	1
Rutkowski, Leslie	1
Shi, Min	1
Stancavage, Fran	1
Svetina, Dubravka	1
Tendeiro, Jorge N.	1
Toro, Maritsa	1
More ▼