ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	15

Source

ETS Research Report Series

Publication Type

Journal Articles	16
Reports - Research	16
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Secondary Education	2
Grade 12	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

Taming Log Files from Game/Simulation-Based Assessments: Data Models and Data Analysis Tools. Research Report. ETS RR-16-10

Peer reviewed
PDF on ERIC

Download full text

Hao, Jiangang; Smith, Lawrence; Mislevy, Robert; von Davier, Alina; Bauer, Malcolm – ETS Research Report Series, 2016

Extracting information efficiently from game/simulation-based assessment (G/SBA) logs requires two things: a well-structured log file and a set of analysis methods. In this report, we propose a generic data model specified as an extensible markup language (XML) schema for the log files of G/SBAs. We also propose a set of analysis methods for…

Descriptors: Evaluation Methods, Games, Computer Assisted Testing, Data Collection

Agent-Based Modeling of Collaborative Problem Solving. Research Report. ETS RR-16-27

Peer reviewed
PDF on ERIC

Download full text

Bergner, Yoav; Andrews, Jessica J.; Zhu, Mengxiao; Gonzales, Joseph E. – ETS Research Report Series, 2016

Collaborative problem solving (CPS) is a critical competency in a variety of contexts, including the workplace, school, and home. However, only recently have assessment and curriculum reformers begun to focus to a greater extent on the acquisition and development of CPS skill. One of the major challenges in psychometric modeling of CPS is…

Descriptors: Problem Solving, Cooperative Learning, Evaluation Methods, Models

Effect of Item Response Theory (IRT) Model Selection on Testlet-Based Test Equating. Research Report. ETS RR-14-19

Peer reviewed
PDF on ERIC

Download full text

Cao, Yi; Lu, Ru; Tao, Wei – ETS Research Report Series, 2014

The local item independence assumption underlying traditional item response theory (IRT) models is often not met for tests composed of testlets. There are 3 major approaches to addressing this issue: (a) ignore the violation and use a dichotomous IRT model (e.g., the 2-parameter logistic [2PL] model), (b) combine the interdependent items to form a…

Descriptors: Item Response Theory, Equated Scores, Test Items, Simulation

Simulate to Understand Models, Not Nature. Research Report. ETS RR-14-16

Peer reviewed
PDF on ERIC

Download full text

Dorans, Neil J. – ETS Research Report Series, 2014

Simulations are widely used. Simulations produce numbers that are deductive demonstrations of what a model says will happen.They produce numerical results that are consistent with the premises of the model used to generate the numbers. These simulated numerical results are not empirical data that address aspects of the world that lies outside the…

Descriptors: Simulation, Equated Scores, Scores, Scientific Methodology

A "CBAL"™ Science Model of Cognition: Developing a Competency Model and Learning Progressions to Support Assessment Development. Research Report. ETS RR-13-29

Peer reviewed
PDF on ERIC

Download full text

Liu, Lei; Rogat, Aaron; Bertling, Maria – ETS Research Report Series, 2013

The purpose of this report is to describe a science competency model and 3 related learning progressions, which were developed by applying the "CBAL"™ approach (Bennett & Gitomer, 2009) to the domain of middle school science. The Cognitively Based Assessment "of", "for", and "as" Learning (CBAL) science…

Descriptors: Science Education, Competence, Models, Learning Processes

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

The Influence of Strategies for Selecting Loglinear Smoothing Models on Equating Functions. Research Report. ETS RR-08-25

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Holland, Paul – ETS Research Report Series, 2008

This study addressed 2 issues of using loglinear models for smoothing univariate test score distributions and for enhancing the stability of equipercentile equating functions. One issue was a comparative assessment of several statistical strategies that have been proposed for selecting 1 from several competing model parameterizations. Another…

Descriptors: Equated Scores, Selection, Models, Statistical Analysis

Studies of a Latent-Class Signal-Detection Model for Constructed-Response Scoring. Research Report. ETS RR-08-63

Peer reviewed
PDF on ERIC

Download full text

DeCarlo, Lawrence T. – ETS Research Report Series, 2008

Rater behavior in essay grading can be viewed as a signal-detection task, in that raters attempt to discriminate between latent classes of essays, with the latent classes being defined by a scoring rubric. The present report examines basic aspects of an approach to constructed-response (CR) scoring via a latent-class signal-detection model. The…

Descriptors: Scoring, Responses, Test Format, Bias

An Illustration of the Use of Markov Decision Processes to Represent Student Growth (Learning). Research Report. ETS RR-07-40

Peer reviewed
PDF on ERIC

Download full text

Almond, Russell G. – ETS Research Report Series, 2007

Over the course of instruction, instructors generally collect a great deal of information about each student. Integrating that information intelligently requires models for how a student's proficiency changes over time. Armed with such models, instructors can "filter" the data--more accurately estimate the student's current proficiency…

Descriptors: Markov Processes, Decision Making, Student Evaluation, Learning Processes

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Bayesian Network Models for Local Dependence among Observable Outcome Variables. Research Report. ETS RR-06-36

Peer reviewed
PDF on ERIC

Download full text

Almond, Russell G.; Mulder, Joris; Hemat, Lisa A.; Yan, Duanli – ETS Research Report Series, 2006

Bayesian network models offer a large degree of flexibility for modeling dependence among observables (item outcome variables) from the same task that may be dependent. This paper explores four design patterns for modeling locally dependent observations from the same task: (1) No context--Ignore dependence among observables; (2) Compensatory…

Descriptors: Bayesian Statistics, Networks, Models, Design

Monitoring and Fostering Learning through Games and Embedded Assessments. Research Report. ETS RR-08-69

Peer reviewed
PDF on ERIC

Download full text

Shute, Valerie J.; Ventura, Matthew; Bauer, Malcolm; Zapata-Rivera, Diego – ETS Research Report Series, 2008

To reveal what is being learned during the gaming experience, this report proposes an approach for embedding assessments in immersive games, drawing on recent advances in assessment design. Key to this approach are formative assessment to guide instructional experiences and evidence-centered design to systematically analyze the assessment argument…

Descriptors: Educational Games, Formative Evaluation, Instructional Design, Evidence Based Practice

The Fusion Model for Skills Diagnosis: Blending Theory with Practicality. Research Report. ETS RR-08-71

Peer reviewed
PDF on ERIC

Download full text

Hartz, Sarah; Roussos, Louis – ETS Research Report Series, 2008

This paper presents the development of the fusion model skills diagnosis system (fusion model system), which can help integrate standardized testing into the learning process with both skills-level examinee parameters for modeling examinee skill mastery and skills-level item parameters, giving information about the diagnostic power of the test.…

Descriptors: Skill Development, Educational Diagnosis, Theory Practice Relationship, Standardized Tests

Cognitive Diagnosis for NAEP Proficiency Data. Research Report. ETS RR-06-08

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2006

More than a dozen statistical models have been developed for the purpose of cognitive diagnosis. These models are supposed to extract a much finer level of information from item responses than traditional unidimensional item response models. In this paper, a general diagnostic model (GDM) was used to analyze a set of simulated sparse data and real…

Descriptors: Statistical Analysis, National Competency Tests, Diagnostic Tests, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Models	16
Simulation	15
Comparative Analysis	6
Item Response Theory	6
Scores	5
Test Items	5
Bayesian Statistics	4
Computer Assisted Testing	4
Evaluation Methods	4
Markov Processes	4
Statistical Analysis	4
Accuracy	3
Equated Scores	3
Formative Evaluation	3
Learning Processes	3
Monte Carlo Methods	3
Student Evaluation	3
Competence	2
Computation	2
Correlation	2
Diagnostic Tests	2
English (Second Language)	2
Evidence Based Practice	2
Generalization	2
Item Analysis	2
More ▼

Almond, Russell G.	2
Bauer, Malcolm	2
von Davier, Matthias	2
Andrews, Jessica J.	1
Bergner, Yoav	1
Bertling, Maria	1
Breyer, F. Jay	1
Cao, Yi	1
DeCarlo, Lawrence T.	1
Dorans, Neil J.	1
Feng, Yuling	1
Fu, Jianbin	1
Gonzales, Joseph E.	1
Hao, Jiangang	1
Hartz, Sarah	1
Hemat, Lisa A.	1
Holland, Paul	1
Liu, Lei	1
Lorenz, Florian	1
Lu, Ru	1
Mislevy, Robert	1
Moses, Tim	1
Mulder, Joris	1
Patsula, Liane	1
Rizavi, Saba	1
More ▼