Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 15 |
Descriptor
Models | 16 |
Simulation | 15 |
Comparative Analysis | 6 |
Item Response Theory | 6 |
Scores | 5 |
Test Items | 5 |
Bayesian Statistics | 4 |
Computer Assisted Testing | 4 |
Evaluation Methods | 4 |
Markov Processes | 4 |
Statistical Analysis | 4 |
More ▼ |
Source
ETS Research Report Series | 16 |
Author
Almond, Russell G. | 2 |
Bauer, Malcolm | 2 |
von Davier, Matthias | 2 |
Andrews, Jessica J. | 1 |
Bergner, Yoav | 1 |
Bertling, Maria | 1 |
Breyer, F. Jay | 1 |
Cao, Yi | 1 |
DeCarlo, Lawrence T. | 1 |
Dorans, Neil J. | 1 |
Feng, Yuling | 1 |
More ▼ |
Publication Type
Journal Articles | 16 |
Reports - Research | 16 |
Numerical/Quantitative Data | 1 |
Tests/Questionnaires | 1 |
Education Level
Secondary Education | 2 |
Grade 12 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018
In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…
Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests
Hao, Jiangang; Smith, Lawrence; Mislevy, Robert; von Davier, Alina; Bauer, Malcolm – ETS Research Report Series, 2016
Extracting information efficiently from game/simulation-based assessment (G/SBA) logs requires two things: a well-structured log file and a set of analysis methods. In this report, we propose a generic data model specified as an extensible markup language (XML) schema for the log files of G/SBAs. We also propose a set of analysis methods for…
Descriptors: Evaluation Methods, Games, Computer Assisted Testing, Data Collection
Bergner, Yoav; Andrews, Jessica J.; Zhu, Mengxiao; Gonzales, Joseph E. – ETS Research Report Series, 2016
Collaborative problem solving (CPS) is a critical competency in a variety of contexts, including the workplace, school, and home. However, only recently have assessment and curriculum reformers begun to focus to a greater extent on the acquisition and development of CPS skill. One of the major challenges in psychometric modeling of CPS is…
Descriptors: Problem Solving, Cooperative Learning, Evaluation Methods, Models
Cao, Yi; Lu, Ru; Tao, Wei – ETS Research Report Series, 2014
The local item independence assumption underlying traditional item response theory (IRT) models is often not met for tests composed of testlets. There are 3 major approaches to addressing this issue: (a) ignore the violation and use a dichotomous IRT model (e.g., the 2-parameter logistic [2PL] model), (b) combine the interdependent items to form a…
Descriptors: Item Response Theory, Equated Scores, Test Items, Simulation
Dorans, Neil J. – ETS Research Report Series, 2014
Simulations are widely used. Simulations produce numbers that are deductive demonstrations of what a model says will happen.They produce numerical results that are consistent with the premises of the model used to generate the numbers. These simulated numerical results are not empirical data that address aspects of the world that lies outside the…
Descriptors: Simulation, Equated Scores, Scores, Scientific Methodology
Liu, Lei; Rogat, Aaron; Bertling, Maria – ETS Research Report Series, 2013
The purpose of this report is to describe a science competency model and 3 related learning progressions, which were developed by applying the "CBAL"™ approach (Bennett & Gitomer, 2009) to the domain of middle school science. The Cognitively Based Assessment "of", "for", and "as" Learning (CBAL) science…
Descriptors: Science Education, Competence, Models, Learning Processes
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Moses, Tim; Holland, Paul – ETS Research Report Series, 2008
This study addressed 2 issues of using loglinear models for smoothing univariate test score distributions and for enhancing the stability of equipercentile equating functions. One issue was a comparative assessment of several statistical strategies that have been proposed for selecting 1 from several competing model parameterizations. Another…
Descriptors: Equated Scores, Selection, Models, Statistical Analysis
DeCarlo, Lawrence T. – ETS Research Report Series, 2008
Rater behavior in essay grading can be viewed as a signal-detection task, in that raters attempt to discriminate between latent classes of essays, with the latent classes being defined by a scoring rubric. The present report examines basic aspects of an approach to constructed-response (CR) scoring via a latent-class signal-detection model. The…
Descriptors: Scoring, Responses, Test Format, Bias
Almond, Russell G. – ETS Research Report Series, 2007
Over the course of instruction, instructors generally collect a great deal of information about each student. Integrating that information intelligently requires models for how a student's proficiency changes over time. Armed with such models, instructors can "filter" the data--more accurately estimate the student's current proficiency…
Descriptors: Markov Processes, Decision Making, Student Evaluation, Learning Processes
Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007
Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…
Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models
Almond, Russell G.; Mulder, Joris; Hemat, Lisa A.; Yan, Duanli – ETS Research Report Series, 2006
Bayesian network models offer a large degree of flexibility for modeling dependence among observables (item outcome variables) from the same task that may be dependent. This paper explores four design patterns for modeling locally dependent observations from the same task: (1) No context--Ignore dependence among observables; (2) Compensatory…
Descriptors: Bayesian Statistics, Networks, Models, Design
Shute, Valerie J.; Ventura, Matthew; Bauer, Malcolm; Zapata-Rivera, Diego – ETS Research Report Series, 2008
To reveal what is being learned during the gaming experience, this report proposes an approach for embedding assessments in immersive games, drawing on recent advances in assessment design. Key to this approach are formative assessment to guide instructional experiences and evidence-centered design to systematically analyze the assessment argument…
Descriptors: Educational Games, Formative Evaluation, Instructional Design, Evidence Based Practice
Hartz, Sarah; Roussos, Louis – ETS Research Report Series, 2008
This paper presents the development of the fusion model skills diagnosis system (fusion model system), which can help integrate standardized testing into the learning process with both skills-level examinee parameters for modeling examinee skill mastery and skills-level item parameters, giving information about the diagnostic power of the test.…
Descriptors: Skill Development, Educational Diagnosis, Theory Practice Relationship, Standardized Tests
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2006
More than a dozen statistical models have been developed for the purpose of cognitive diagnosis. These models are supposed to extract a much finer level of information from item responses than traditional unidimensional item response models. In this paper, a general diagnostic model (GDM) was used to analyze a set of simulated sparse data and real…
Descriptors: Statistical Analysis, National Competency Tests, Diagnostic Tests, Item Response Theory
Previous Page | Next Page »
Pages: 1 | 2