ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Robustness (Statistics)	6
Item Response Theory	4
Bayesian Statistics	3
Computation	3
Simulation	3
Error of Measurement	2
Markov Processes	2
Models	2
Monte Carlo Methods	2
Response Style (Tests)	2
Ability	1
Accuracy	1
Adaptive Testing	1
College Entrance Examinations	1
Comparative Analysis	1
Construct Validity	1
Design	1
Difficulty Level	1
Educational Diagnosis	1
Elementary School Students	1
Essay Tests	1
Evaluation Methods	1
Formative Evaluation	1
Goodness of Fit	1
Graduate Study	1
More ▼

Source

ETS Research Report Series

Author

Almond, Russell G.	1
Braun, Henry	1
Dorans, Neil J.	1
Guo, Hongwen	1
Hartz, Sarah	1
Hemat, Lisa A.	1
Kim, Sooyeon	1
Lu, Ru	1
Moses, Tim	1
Mulder, Joris	1
Qu, Yanxuan	1
Roussos, Louis	1
Sandip Sinharay	1
Yan, Duanli	1
Yanxuan Qu	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	6
Numerical/Quantitative Data	1

Education Level

Early Childhood Education	1
Elementary Education	1
Higher Education	1
Postsecondary Education	1
Primary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Estimating Reliability for Tests with One Constructed-Response Item in a Section. Research Report. ETS RR-24-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2024

The goal of this paper is to find better ways to estimate the internal consistency reliability of scores on tests with a specific type of design that are often encountered in practice: tests with constructed-response items clustered into sections that are not parallel or tau-equivalent, and one of the sections has only one item. To estimate the…

Descriptors: Test Reliability, Essay Tests, Construct Validity, Error of Measurement

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Investigating Robustness of Item Response Theory Proficiency Estimators to Atypical Response Behaviors under Two-Stage Multistage Testing. ETS GRE® Board Research Report. ETS GRE®-16-03. ETS Research Report No. RR-16-22

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2016

The purpose of this study is to evaluate the extent to which item response theory (IRT) proficiency estimation methods are robust to the presence of aberrant responses under the "GRE"® General Test multistage adaptive testing (MST) design. To that end, a wide range of atypical response behaviors affecting as much as 10% of the test items…

Descriptors: Item Response Theory, Computation, Robustness (Statistics), Response Style (Tests)

Robustness of Value-Added Analysis of School Effectiveness. Research Report. ETS RR-08-22

Peer reviewed
PDF on ERIC

Download full text

Braun, Henry; Qu, Yanxuan – ETS Research Report Series, 2008

This paper reports on a study conducted to investigate the consistency of the results between 2 approaches to estimating school effectiveness through value-added modeling. Estimates of school effects from the layered model employing item response theory (IRT) scaled data are compared to estimates derived from a discrete growth model based on the…

Descriptors: Value Added Models, School Effectiveness, Robustness (Statistics), Computation

Bayesian Network Models for Local Dependence among Observable Outcome Variables. Research Report. ETS RR-06-36

Peer reviewed
PDF on ERIC

Download full text

Almond, Russell G.; Mulder, Joris; Hemat, Lisa A.; Yan, Duanli – ETS Research Report Series, 2006

Bayesian network models offer a large degree of flexibility for modeling dependence among observables (item outcome variables) from the same task that may be dependent. This paper explores four design patterns for modeling locally dependent observations from the same task: (1) No context--Ignore dependence among observables; (2) Compensatory…

Descriptors: Bayesian Statistics, Networks, Models, Design

The Fusion Model for Skills Diagnosis: Blending Theory with Practicality. Research Report. ETS RR-08-71

Peer reviewed
PDF on ERIC

Download full text

Hartz, Sarah; Roussos, Louis – ETS Research Report Series, 2008

This paper presents the development of the fusion model skills diagnosis system (fusion model system), which can help integrate standardized testing into the learning process with both skills-level examinee parameters for modeling examinee skill mastery and skills-level item parameters, giving information about the diagnostic power of the test.…

Descriptors: Skill Development, Educational Diagnosis, Theory Practice Relationship, Standardized Tests