ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	20

Descriptor

Comparative Analysis	21
Computer Assisted Testing	21
Measurement	21
Test Items	9
Adaptive Testing	7
Foreign Countries	6
Item Response Theory	6
Correlation	5
Simulation	5
Accuracy	4
Item Analysis	4
Item Banks	4
Models	4
Scoring	4
Achievement Tests	3
Educational Technology	3
Educational Testing	3
Evaluation Methods	3
Mathematics Tests	3
Program Effectiveness	3
Psychometrics	3
Reaction Time	3
Scores	3
Secondary School Students	3
Test Construction	3
More ▼

Source

Educational and Psychological…	5
Applied Psychological…	2
Assessing Writing	2
Journal of Educational…	2
Applied Measurement in…	1
Assessment & Evaluation in…	1
Australian Journal of…	1
Educational Sciences: Theory…	1
Journal of Applied Testing…	1
Journal of Intelligence	1
Journal of Technology,…	1
Large-scale Assessments in…	1
Pearson	1
Routledge, Taylor & Francis…	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	14
Reports - Evaluative	6
Books	1
Information Analyses	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Secondary Education	6
Elementary Secondary Education	4
Higher Education	4
Postsecondary Education	3
Elementary Education	2
High Schools	2
Grade 7	1
Junior High Schools	1
Middle Schools	1

Audience

Practitioners	1
Researchers	1
Students	1

Location

Germany	2
Kansas	1
Maryland	1
Portugal	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Comparing Different Response Time Threshold Setting Methods to Detect Low Effort on a Large-Scale Assessment

Peer reviewed

Direct link

Soland, James; Kuhfeld, Megan; Rios, Joseph – Large-scale Assessments in Education, 2021

Low examinee effort is a major threat to valid uses of many test scores. Fortunately, several methods have been developed to detect noneffortful item responses, most of which use response times. To accurately identify noneffortful responses, one must set response time thresholds separating those responses from effortful ones. While other studies…

Descriptors: Reaction Time, Measurement, Response Style (Tests), Reading Tests

A Top-Down Approach to Designing the Computerized Adaptive Multistage Test

Peer reviewed

Direct link

Luo, Xiao; Kim, Doyoung – Journal of Educational Measurement, 2018

The top-down approach to designing a multistage test is relatively understudied in the literature and underused in research and practice. This study introduced a route-based top-down design approach that directly sets design parameters at the test level and utilizes the advanced automated test assembly algorithm seeking global optimality. The…

Descriptors: Computer Assisted Testing, Test Construction, Decision Making, Simulation

Binding Costs in Processing Efficiency as Determinants of Cognitive Ability

Peer reviewed
PDF on ERIC

Download full text

Goecke, Benjamin; Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2021

Performance in elementary cognitive tasks is moderately correlated with fluid intelligence and working memory capacity. These correlations are higher for more complex tasks, presumably due to increased demands on working memory capacity. In accordance with the binding hypothesis, which states that working memory capacity reflects the limit of a…

Descriptors: Intelligence, Cognitive Processes, Short Term Memory, Reaction Time

Comparative Analysis of Student Performance in Collaborative Problem Solving: What Does It Tell Us?

Peer reviewed

Direct link

Scoular, Claire; Eleftheriadou, Sofia; Ramalingam, Dara; Cloney, Dan – Australian Journal of Education, 2020

Collaboration is a complex skill, comprised of multiple subskills, that is of growing interest to policy makers, educators and researchers. Several definitions and frameworks have been described in the literature to support assessment of collaboration; however, the inherent structure of the construct still needs better definition. In 2015, the…

Descriptors: Cooperative Learning, Problem Solving, Computer Assisted Testing, Comparative Analysis

The Development of MST Test Information for the Prediction of Test Performances

Peer reviewed

Direct link

Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017

The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…

Descriptors: Testing, Performance, Prediction, Error of Measurement

Investigating Item Exposure Control Methods in Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Ozturk, Nagihan Boztunc; Dogan, Nuri – Educational Sciences: Theory and Practice, 2015

This study aims to investigate the effects of item exposure control methods on measurement precision and on test security under various item selection methods and item pool characteristics. In this study, the Randomesque (with item group sizes of 5 and 10), Sympson-Hetter, and Fade-Away methods were used as item exposure control methods. Moreover,…

Descriptors: Computer Assisted Testing, Item Analysis, Statistical Analysis, Comparative Analysis

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

Assessment of Complex Problem Solving: What We Know and What We Don't Know

Peer reviewed

Direct link

Herde, Christoph Nils; Wüstenberg, Sascha; Greiff, Samuel – Applied Measurement in Education, 2016

Complex Problem Solving (CPS) is seen as a cross-curricular 21st century skill that has attracted interest in large-scale-assessments. In the Programme for International Student Assessment (PISA) 2012, CPS was assessed all over the world to gain information on students' skills to acquire and apply knowledge while dealing with nontransparent…

Descriptors: Problem Solving, Achievement Tests, Foreign Countries, International Assessment

Comparison of Exposure Controls, Item Pool Characteristics, and Population Distributions for CAT Using the Partial Credit Model

Peer reviewed

Direct link

Lee, HwaYoung; Dodd, Barbara G. – Educational and Psychological Measurement, 2012

This study investigated item exposure control procedures under various combinations of item pool characteristics and ability distributions in computerized adaptive testing based on the partial credit model. Three variables were manipulated: item pool characteristics (120 items for each of easy, medium, and hard item pools), two ability…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Ability

Balancing Flexible Constraints and Measurement Precision in Computerized Adaptive Testing

Peer reviewed

Direct link

Moyer, Eric L.; Galindo, Jennifer L.; Dodd, Barbara G. – Educational and Psychological Measurement, 2012

Managing test specifications--both multiple nonstatistical constraints and flexibly defined constraints--has become an important part of designing item selection procedures for computerized adaptive tests (CATs) in achievement testing. This study compared the effectiveness of three procedures: constrained CAT, flexible modified constrained CAT,…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Item Analysis

Validating Automated Essay Scoring for Online Writing Placement

Peer reviewed

Direct link

Ramineni, Chaitanya – Assessing Writing, 2013

In this paper, I describe the design and evaluation of automated essay scoring (AES) models for an institution's writing placement program. Information was gathered on admitted student writing performance at a science and technology research university in the northeastern United States. Under timed conditions, first-year students (N = 879) were…

Descriptors: Validity, Comparative Analysis, Internet, Student Placement

Large-Scale Assessment, Locally-Developed Measures, and Automated Scoring of Essays: Fishing for Red Herrings?

Peer reviewed

Direct link

Condon, William – Assessing Writing, 2013

Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…

Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing

Hypothetical Use of Multidimensional Adaptive Testing for the Assessment of Student Achievement in the Programme for International Student Assessment

Peer reviewed

Direct link

Frey, Andreas; Seitz, Nicki-Nils – Educational and Psychological Measurement, 2011

The usefulness of multidimensional adaptive testing (MAT) for the assessment of student literacy in the Programme for International Student Assessment (PISA) was examined within a real data simulation study. The responses of N = 14,624 students who participated in the PISA assessments of the years 2000, 2003, and 2006 in Germany were used to…

Descriptors: Adaptive Testing, Literacy, Academic Achievement, Achievement Tests

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

An Empirical Evaluation of the Slip Correction in the Four Parameter Logistic Models with Computerized Adaptive Testing

Peer reviewed

Direct link

Yen, Yung-Chin; Ho, Rong-Guey; Laio, Wen-Wei; Chen, Li-Ju; Kuo, Ching-Chin – Applied Psychological Measurement, 2012

In a selected response test, aberrant responses such as careless errors and lucky guesses might cause error in ability estimation because these responses do not actually reflect the knowledge that examinees possess. In a computerized adaptive test (CAT), these aberrant responses could further cause serious estimation error due to dynamic item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Response Style (Tests)

Previous Page | Next Page »

Pages: 1 | 2

Dodd, Barbara G.	3
Chang, Hua-Hua	1
Chen, Li-Ju	1
Cheng, Ying	1
Chien, Yuehmei	1
Chung, Hyewon	1
Cloney, Dan	1
Condon, William	1
Dogan, Nuri	1
Douglas, Jeffrey	1
Eleftheriadou, Sofia	1
Ferrao, Maria	1
Finkelman, Matthew D.	1
Frey, Andreas	1
Galindo, Jennifer L.	1
Glasnapp, Douglas R.	1
Goecke, Benjamin	1
Greiff, Samuel	1
Guo, Fanmin	1
Herde, Christoph Nils	1
Ho, Rong-Guey	1
Hou, Xiaodong	1
Kim, Doyoung	1
Kim, Jiseon	1
Kim-Kang, Gyenam	1
More ▼