ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	9

Source

ETS Research Report Series

Publication Type

Journal Articles	13
Reports - Research	13

Education Level

Elementary Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

The Impact of Aberrant Responses and Detection in Forced-Choice Noncognitive Assessment. Research Report. ETS RR-18-32

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2018

The purpose of this study is to assess the impact of aberrant responses on the estimation accuracy in forced-choice format assessments. To that end, a wide range of aberrant response behaviors (e.g., fake, random, or mechanical responses) affecting upward of 20%--30% of the responses was manipulated under the multi-unidimensional pairwise…

Descriptors: Measurement Techniques, Response Style (Tests), Accuracy, Computation

Statistical Methods for Assessments in Simulations and Serious Games. Research Report. ETS RR-14-12

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014

Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…

Descriptors: Simulation, Evaluation Methods, Games, Data Collection

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

The Use of Quality Control and Data Mining Techniques for Monitoring Scaled Scores: An Overview. Research Report. ETS RR-12-20

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A. – ETS Research Report Series, 2012

Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…

Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling

An Illustration of the Use of Markov Decision Processes to Represent Student Growth (Learning). Research Report. ETS RR-07-40

Peer reviewed
PDF on ERIC

Download full text

Almond, Russell G. – ETS Research Report Series, 2007

Over the course of instruction, instructors generally collect a great deal of information about each student. Integrating that information intelligently requires models for how a student's proficiency changes over time. Armed with such models, instructors can "filter" the data--more accurately estimate the student's current proficiency…

Descriptors: Markov Processes, Decision Making, Student Evaluation, Learning Processes

Bayesian Network Models for Local Dependence among Observable Outcome Variables. Research Report. ETS RR-06-36

Peer reviewed
PDF on ERIC

Download full text

Almond, Russell G.; Mulder, Joris; Hemat, Lisa A.; Yan, Duanli – ETS Research Report Series, 2006

Bayesian network models offer a large degree of flexibility for modeling dependence among observables (item outcome variables) from the same task that may be dependent. This paper explores four design patterns for modeling locally dependent observations from the same task: (1) No context--Ignore dependence among observables; (2) Compensatory…

Descriptors: Bayesian Statistics, Networks, Models, Design

The Fusion Model for Skills Diagnosis: Blending Theory with Practicality. Research Report. ETS RR-08-71

Peer reviewed
PDF on ERIC

Download full text

Hartz, Sarah; Roussos, Louis – ETS Research Report Series, 2008

This paper presents the development of the fusion model skills diagnosis system (fusion model system), which can help integrate standardized testing into the learning process with both skills-level examinee parameters for modeling examinee skill mastery and skills-level item parameters, giving information about the diagnostic power of the test.…

Descriptors: Skill Development, Educational Diagnosis, Theory Practice Relationship, Standardized Tests

On the Estimation of Hierarchical Latent Linear Models for Large Scale Assessments. Research Report. ETS RR-06-37

Peer reviewed
PDF on ERIC

Download full text

Deping, Li; Oranje, Andreas – ETS Research Report Series, 2006

A hierarchical latent regression model is suggested to estimate nested and nonnested relationships in complex samples such as found in the National Assessment of Educational Progress (NAEP). The proposed model aims at improving both parameters and variance estimates via a two-level hierarchical linear model. This model falls naturally within the…

Descriptors: Hierarchical Linear Modeling, Computation, Measurement, Regression (Statistics)

User's Guide for SCORIGHT (Version 3.0): A Computer Program for Scoring Tests Built of Testlets Including a Module for Covariate Analysis. Research Report. ETS RR-04-49

Peer reviewed
PDF on ERIC

Download full text

Wang, Xiaohui; Bradlow, Eric T.; Wainer, Howard – ETS Research Report Series, 2005

SCORIGHT is a very general computer program for scoring tests. It models tests that are made up of dichotomously or polytomously rated items or any kind of combination of the two through the use of a generalized item response theory (IRT) formulation. The items can be presented independently or grouped into clumps of allied items (testlets) or in…

Descriptors: Computer Assisted Testing, Statistical Analysis, Test Items, Bayesian Statistics

A Bayesian Hierarchical Model for Large-Scale Educational Surveys: An Application to the National Assessment of Educational Progress. Research Report. ETS RR-04-38

Peer reviewed
PDF on ERIC

Download full text

Johnson, Matthew S.; Jenkins, Frank – ETS Research Report Series, 2005

Large-scale educational assessments such as the National Assessment of Educational Progress (NAEP) sample examinees to whom an exam will be administered. In most situations the sampling design is not a simple random sample and must be accounted for in the estimating model. After reviewing the current operational estimation procedure for NAEP, this…

Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, National Competency Tests, Sampling

Rasch Rating Scale Modeling of Data from the Standardized Letter of Recommendation. Research Report. ETS RR-06-33

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Kyllonen, Patrick C. – ETS Research Report Series, 2006

The Standardized Letter of Recommendation (SLR), a 28-item form, was created by ETS to supplement the qualitative rating of graduate school applicants' nonacademic qualities with a quantitative approach. The purpose of this study was to evaluate the following psychometric properties of the SLR using the Rasch rating-scale model: dimensionality,…

Descriptors: Item Response Theory, Rating Scales, Data Analysis, Models

A General Diagnostic Model Applied to Language Testing Data. Research Report. ETS RR-05-16

Peer reviewed
PDF on ERIC

Download full text

von Davier, Matthias – ETS Research Report Series, 2005

Probabilistic models with more than one latent variable are designed to report profiles of skills or cognitive attributes. Testing programs want to offer additional information beyond what a single test score can provide using these skill profiles. Many recent approaches to skill profile models are limited to dichotomous data and have made use of…

Descriptors: Models, Diagnostic Tests, Language Tests, Language Proficiency

Factor Structure of the LanguEdge™ Test across Language Groups. TOEFL® Monograph Series. MS-32. ETS RR-05-12

Peer reviewed
PDF on ERIC

Download full text

Stricker, Lawrence J.; Rock, Donald A.; Lee, Yong-Won – ETS Research Report Series, 2005

This study assessed the factor structure of the LanguEdge™ test and the invariance of its factors across language groups. Confirmatory factor analyses of individual tasks and subsets of items in the four sections of the test, Listening, Reading, Speaking, and Writing, was carried out for Arabic-, Chinese-, and Spanish-speaking test takers. Two…

Descriptors: Factor Structure, Language Tests, Factor Analysis, Semitic Languages

Markov Processes	13
Monte Carlo Methods	9
Computation	7
Item Response Theory	7
Bayesian Statistics	6
Models	6
Simulation	5
Comparative Analysis	4
Computer Assisted Testing	4
Statistical Analysis	4
Test Items	4
Accuracy	3
Data Analysis	3
Goodness of Fit	3
Maximum Likelihood Statistics	3
Scores	3
English (Second Language)	2
Error of Measurement	2
Evaluation Methods	2
Factor Analysis	2
Factor Structure	2
Hierarchical Linear Modeling	2
Language Tests	2
Learning Processes	2
Multivariate Analysis	2
More ▼

Almond, Russell G.	2
Kim, Sooyeon	2
Bradlow, Eric T.	1
Deping, Li	1
Fu, Jianbin	1
Hartz, Sarah	1
Hemat, Lisa A.	1
Jenkins, Frank	1
Johnson, Matthew S.	1
Kyllonen, Patrick C.	1
Lee, Yong-Won	1
Mavronikolas, Elia	1
Moses, Tim	1
Mulder, Joris	1
Oranje, Andreas	1
Rock, Donald A.	1
Roussos, Louis	1
Stricker, Lawrence J.	1
Wainer, Howard	1
Wang, Xiaohui	1
Wang, Zhen	1
Yan, Duanli	1
Yao, Lihua	1
Zapata, Diego	1
von Davier, Alina A.	1
More ▼