ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	9

Descriptor

Comparative Analysis	15
Computer Assisted Testing	15
Adaptive Testing	6
Test Items	4
Ability	3
Bayesian Statistics	3
College Students	3
Computer Simulation	3
Estimation (Mathematics)	3
Scores	3
Cognitive Processes	2
Computation	2
Computers	2
Educational Technology	2
Efficiency	2
Evaluation Methods	2
Foreign Countries	2
Handheld Devices	2
High School Students	2
Higher Education	2
Item Response Theory	2
Mathematical Models	2
Preferences	2
Problem Solving	2
Scoring	2
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	15
Reports - Research	11
Reports - Evaluative	5
Information Analyses	3

Education Level

Higher Education	2
Secondary Education	2
Elementary Secondary Education	1
Grade 11	1
Grade 8	1
High Schools	1
Postsecondary Education	1

Audience

Location

Spain	1
Texas	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Program for International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

Device Comparability of Tablets and Computers for Assessment Purposes

Peer reviewed

Direct link

Davis, Laurie Laughlin; Kong, Xiaojing; McBride, Yuanyuan; Morrison, Kristin M. – Applied Measurement in Education, 2017

The definition of what it means to take a test online continues to evolve with the inclusion of a broader range of item types and a wide array of devices used by students to access test content. To assure the validity and reliability of test scores for all students, device comparability research should be conducted to evaluate the impact of…

Descriptors: Educational Technology, Technology Uses in Education, High School Students, Tests

Comparing Human and Automated Essay Scoring for Prospective Graduate Students with Learning Disabilities and/or ADHD

Peer reviewed

Direct link

Buzick, Heather; Oliveri, Maria Elena; Attali, Yigal; Flor, Michael – Applied Measurement in Education, 2016

Automated essay scoring is a developing technology that can provide efficient scoring of large numbers of written responses. Its use in higher education admissions testing provides an opportunity to collect validity and fairness evidence to support current uses and inform its emergence in other areas such as K-12 large-scale assessment. In this…

Descriptors: Essays, Learning Disabilities, Attention Deficit Hyperactivity Disorder, Scoring

Assessment of Complex Problem Solving: What We Know and What We Don't Know

Peer reviewed

Direct link

Herde, Christoph Nils; Wüstenberg, Sascha; Greiff, Samuel – Applied Measurement in Education, 2016

Complex Problem Solving (CPS) is seen as a cross-curricular 21st century skill that has attracted interest in large-scale-assessments. In the Programme for International Student Assessment (PISA) 2012, CPS was assessed all over the world to gain information on students' skills to acquire and apply knowledge while dealing with nontransparent…

Descriptors: Problem Solving, Achievement Tests, Foreign Countries, International Assessment

Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test

Peer reviewed

Direct link

Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012

In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

The Impact of Personality and Test Conditions on Mathematical Test Performance

Peer reviewed

Direct link

Hayes, Heather; Embretson, Susan E. – Applied Measurement in Education, 2013

Online and on-demand tests are increasingly used in assessment. Although the main focus has been cheating and test security (e.g., Selwyn, 2008) the cross-setting equivalence of scores as a function of contrasting test conditions is also an issue that warrants attention. In this study, the impact of environmental and cognitive distractions, as…

Descriptors: College Students, Computer Assisted Testing, Problem Solving, Physical Environment

Comparability of Computer- and Paper-Administered Multiple-Choice Tests for K-12 Populations: A Synthesis

Peer reviewed

Direct link

Kingston, Neal M. – Applied Measurement in Education, 2009

There have been many studies of the comparability of computer-administered and paper-administered tests. Not surprisingly (given the variety of measurement and statistical sampling issues that can affect any one study) the results of such studies have not always been consistent. Moreover, the quality of computer-based test administration systems…

Descriptors: Multiple Choice Tests, Computer Assisted Testing, Printed Materials, Effect Size

Item-Level Comparative Analysis of Online and Paper Administrations of the Texas Assessment of Knowledge and Skills

Peer reviewed

Direct link

Keng, Leslie; McClarty, Katie Larsen; Davis, Laurie Laughlin – Applied Measurement in Education, 2008

This article describes a comparative study conducted at the item level for paper and online administrations of a statewide high stakes assessment. The goal was to identify characteristics of items that may have contributed to mode effects. Item-level analyses compared two modes of the Texas Assessment of Knowledge and Skills (TAKS) for up to four…

Descriptors: Computer Assisted Testing, Geometric Concepts, Grade 8, Comparative Analysis

Applying Bayesian Item Selection Approaches to Adaptive Tests Using Polytomous Items

Peer reviewed

Direct link

Penfield, Randall D. – Applied Measurement in Education, 2006

This study applied the maximum expected information (MEI) and the maximum posterior-weighted information (MPI) approaches of computer adaptive testing item selection to the case of a test using polytomous items following the partial credit model. The MEI and MPI approaches are described. A simulation study compared the efficiency of ability…

Descriptors: Bayesian Statistics, Adaptive Testing, Computer Assisted Testing, Test Items

A Comparison of Procedures for Content-Sensitive Item Selection in Computerized Adaptive Tests.

Peer reviewed

Kingsbury, G. Gage; Zara, Anthony R. – Applied Measurement in Education, 1991

This simulation investigated two procedures that reduce differences between paper-and-pencil testing and computerized adaptive testing (CAT) by making CAT content sensitive. Results indicate that the price in terms of additional test items of using constrained CAT for content balancing is much smaller than that of using testlets. (SLD)

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Computer Simulation

Computerized Mastery Testing Using Fuzzy Set Decision Theory.

Peer reviewed

Du, Yi; And Others – Applied Measurement in Education, 1993

A new computerized mastery test is described that builds on the Lewis and Sheehan procedure (sequential testlets) (1990), but uses fuzzy set decision theory to determine stopping rules and the Rasch model to calibrate items and estimate abilities. Differences between fuzzy set and Bayesian methods are illustrated through an example. (SLD)

Descriptors: Bayesian Statistics, Comparative Analysis, Computer Assisted Testing, Estimation (Mathematics)

A Comparison of the Generalizability of Scores Produced by Expert Raters and Automated Scoring Systems.

Peer reviewed

Clauser, Brian E.; Swanson, David B.; Clyman, Stephen G. – Applied Measurement in Education, 1999

Performed generalizability analyses of expert ratings and computer-produced scores for a computer-delivered performance assessment of physicians' patient management skills. The two automated scoring systems produced scores for the 200 medical students that were approximately as generalizable as those produced by the four expert raters. (SLD)

Descriptors: Comparative Analysis, Computer Assisted Testing, Generalizability Theory, Higher Education

A Comparison of the Partial Credit and Graded Response Models in Computerized Adaptive Testing.

Peer reviewed

De Ayala, R. J.; And Others – Applied Measurement in Education, 1992

A study involving 1,000 simulated examinees compared the partial credit and graded response models in computerized adaptive testing (CAT). The graded response model fit the data well and provided slightly more accurate ability estimates than those of the partial credit model. Benefits of polytomous model-based CATs are discussed. (SLD)

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Computer Simulation

The Effects of Test Difficulty Manipulation in Computerized Adaptive Testing and Self-Adapted Testing.

Peer reviewed

Ponsoda, Vicente; Olea, Julio; Rodriguez, Maria Soledad; Revuelta, Javier – Applied Measurement in Education, 1999

Compared easy and difficult versions of self-adapted tests (SAT) and computerized adapted tests. No significant differences were found among the tests for estimated ability or posttest state anxiety in studies with 187 Spanish high school students, although other significant differences were found. Discusses implications for interpreting test…

Descriptors: Ability, Adaptive Testing, Comparative Analysis, Computer Assisted Testing

Individual Differences and Test Administration Procedures: A Comparison of Fixed-Item, Computerized-Adaptive, and Self-Adapted Testing.

Peer reviewed

Vispoel, Walter P.; And Others – Applied Measurement in Education, 1994

Vocabulary fixed-item (FIT), computerized-adaptive (CAT), and self-adapted (SAT) tests were compared with 121 college students. CAT was more precise and efficient than SAT, which was more precise and efficient than FIT. SAT also yielded higher ability estimates for individuals with lower verbal self-concepts. (SLD)

Descriptors: Ability, Adaptive Testing, College Students, Comparative Analysis

Davis, Laurie Laughlin	2
Attali, Yigal	1
Buzick, Heather	1
Clauser, Brian E.	1
Clyman, Stephen G.	1
Dadey, Nathan	1
De Ayala, R. J.	1
DePascale, Charles	1
Dodd, Barbara G.	1
Du, Yi	1
Embretson, Susan E.	1
Flor, Michael	1
Greiff, Samuel	1
Hayes, Heather	1
Herde, Christoph Nils	1
Ho, Tsung-Han	1
Keng, Leslie	1
Kingsbury, G. Gage	1
Kingston, Neal M.	1
Kong, Xiaojing	1
Lyons, Susan	1
McBride, Yuanyuan	1
McClarty, Katie Larsen	1
Morrison, Kristin M.	1
More ▼