ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	12

Source

Journal of Educational and…

Publication Type

Journal Articles	15
Reports - Research	10
Reports - Evaluative	3
Reports - Descriptive	2

Education Level

Elementary Secondary Education	2
Elementary Education	1
Grade 12	1
Grade 4	1
High Schools	1
Intermediate Grades	1
Secondary Education	1

Audience

Location

Indiana

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

Ordinal Approaches to Decomposing Between-Group Test Score Disparities

Peer reviewed

Direct link

Quinn, David M.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021

The estimation of test score "gaps" and gap trends plays an important role in monitoring educational inequality. Researchers decompose gaps and gap changes into within- and between-school portions to generate evidence on the role schools play in shaping these inequalities. However, existing decomposition methods assume an equal-interval…

Descriptors: Scores, Tests, Achievement Gap, Equal Education

Jenss-Bayley Latent Change Score Model with Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions

Peer reviewed

Direct link

Liu, Jin – Journal of Educational and Behavioral Statistics, 2022

Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the…

Descriptors: Longitudinal Studies, Individual Differences, Scores, Models

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

An Aggregate IRT Procedure for Exploratory Factor Analysis

Peer reviewed

Direct link

Camilli, Gregory; Fox, Jean-Paul – Journal of Educational and Behavioral Statistics, 2015

An aggregation strategy is proposed to potentially address practical limitation related to computing resources for two-level multidimensional item response theory (MIRT) models with large data sets. The aggregate model is derived by integration of the normal ogive model, and an adaptation of the stochastic approximation expectation maximization…

Descriptors: Factor Analysis, Item Response Theory, Grade 4, Simulation

Confidence Intervals for Assessing Heterogeneity in Generalized Linear Mixed Models

Peer reviewed

Direct link

Wagler, Amy E. – Journal of Educational and Behavioral Statistics, 2014

Generalized linear mixed models are frequently applied to data with clustered categorical outcomes. The effect of clustering on the response is often difficult to practically assess partly because it is reported on a scale on which comparisons with regression parameters are difficult to make. This article proposes confidence intervals for…

Descriptors: Hierarchical Linear Modeling, Cluster Grouping, Heterogeneous Grouping, Monte Carlo Methods

Multimodal Likelihoods in Educational Assessment: Will the Real Maximum Likelihood Score Please Stand up?

Peer reviewed

Direct link

Wothke, Werner; Burket, George; Chen, Li-Sue; Gao, Furong; Shu, Lianghua; Chia, Mike – Journal of Educational and Behavioral Statistics, 2011

It has been known for some time that item response theory (IRT) models may exhibit a likelihood function of a respondent's ability which may have multiple modes, flat modes, or both. These conditions, often associated with guessing of multiple-choice (MC) questions, can introduce uncertainty and bias to ability estimation by maximum likelihood…

Descriptors: Educational Assessment, Item Response Theory, Computation, Maximum Likelihood Statistics

Do Typical RCTS of Education Interventions Have Sufficient Statistical Power for Linking Impacts on Teacher Practice and Student Achievement Outcomes?

Peer reviewed

Direct link

Schochet, Peter Z. – Journal of Educational and Behavioral Statistics, 2011

For RCTs of education interventions, it is often of interest to estimate associations between student and mediating teacher practice outcomes, to examine the extent to which the study's conceptual model is supported by the data, and to identify specific mediators that are most associated with student learning. This article develops statistical…

Descriptors: Least Squares Statistics, Intervention, Academic Achievement, Correlation

Modeling Heterogeneity in Relationships between Initial Status and Rates of Change: Treating Latent Variable Regression Coefficients as Random Coefficients in a Three-Level Hierarchical Model

Peer reviewed

Direct link

Choi, Kilchan; Seltzer, Michael – Journal of Educational and Behavioral Statistics, 2010

In studies of change in education and numerous other fields, interest often centers on how differences in the status of individuals at the start of a period of substantive interest relate to differences in subsequent change. In this article, the authors present a fully Bayesian approach to estimating three-level Hierarchical Models in which latent…

Descriptors: Simulation, Computation, Models, Bayesian Statistics

Nonparametric Item Response Function Estimates with the EM Algorithm.

Peer reviewed

Rossi, Natasha; Wang, Xiaohui; Ramsay, James O. – Journal of Educational and Behavioral Statistics, 2002

Combined several developments in statistics and item response theory to develop a procedure for analysis of dichotomously scored test data. This version of nonparametric item response analysis, as illustrated through simulation and with data from other studies, marginalizes the role of the ability parameter theta. (SLD)

Descriptors: Ability, Item Response Theory, Nonparametric Statistics, Scores

Standard Error of Linear Equating for the Counterbalanced Design.

Peer reviewed

Zeng, Lingjia; Cope, Ronald T. – Journal of Educational and Behavioral Statistics, 1995

Large-sample standard errors of linear equating for the counterbalanced design are derived using the general delta method. Computer simulations found that standard errors derived without the normality assumption were more accurate than those derived with the normality assumption in a large sample with moderately skewed score distributions. (SLD)

Descriptors: Computer Simulation, Error of Measurement, Research Design, Sample Size

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

An Item Response Model for Characterizing Test Compromise.

Peer reviewed

Segall, Daniel O. – Journal of Educational and Behavioral Statistics, 2002

Developed an item response model for characterizing test-compromise that enables the estimation of item preview and score-gain distributions. In the approach, models parameters and posterior distributions are estimated by Markov Chain Monte Carlo procedures. Simulation study results suggest that when at least some test items are known to be…

Descriptors: Estimation (Mathematics), Item Response Theory, Markov Processes, Models

Scores	15
Simulation	13
Computation	8
Item Response Theory	5
Models	5
Test Items	4
Correlation	3
Intervals	3
Test Bias	3
Achievement Gap	2
Achievement Tests	2
Computer Simulation	2
Educational Assessment	2
Elementary Secondary Education	2
Equations (Mathematics)	2
Error of Measurement	2
Goodness of Fit	2
Hierarchical Linear Modeling	2
Intervention	2
Mathematics Achievement	2
Maximum Likelihood Statistics	2
Measurement	2
Monte Carlo Methods	2
Pretests Posttests	2
Probability	2
More ▼

Brennan, Robert L.	1
Burket, George	1
Camilli, Gregory	1
Chan, Wendy	1
Chen, Li-Sue	1
Chia, Mike	1
Choi, Kilchan	1
Cope, Ronald T.	1
Eckerly, Carol	1
Emons, Wilco H. M.	1
Fox, Jean-Paul	1
Gao, Furong	1
Gorney, Kylie	1
Gu, Zhengguo	1
Ho, Andrew D.	1
Kolen, Michael J.	1
Lee, Won-Chan	1
Liu, Jin	1
Monroe, Scott	1
Quinn, David M.	1
Ramsay, James O.	1
Rossi, Natasha	1
Schochet, Peter Z.	1
Segall, Daniel O.	1
Seltzer, Michael	1
More ▼