ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	19

Descriptor

Comparative Analysis	31
Computer Assisted Testing	15
Test Items	8
Adaptive Testing	6
Scores	6
Testing	5
Testing Accommodations	5
College Students	4
Computation	4
High School Students	4
Item Response Theory	4
Mathematics Tests	4
Statistical Analysis	4
Test Bias	4
Ability	3
Achievement Tests	3
Bayesian Statistics	3
Computer Simulation	3
Computers	3
Disabilities	3
Educational Technology	3
Educational Testing	3
Estimation (Mathematics)	3
Evaluation Methods	3
Goodness of Fit	3
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	31
Reports - Research	22
Reports - Evaluative	10
Information Analyses	3
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	2
Grade 3	2
Grade 4	2
Grade 5	2
High Schools	2
Higher Education	2
Secondary Education	2
Elementary Education	1
Grade 10	1
Grade 11	1
Grade 6	1
Grade 7	1
Grade 8	1
Postsecondary Education	1
More ▼

Audience

Location

Texas	2
Virginia	2
Georgia	1
Maryland	1
North Carolina	1
South Carolina	1
Spain	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Georgia Criterion Referenced…	1
Graduate Record Examinations	1
Iowa Tests of Educational…	1
National Assessment of…	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Device Comparability of Tablets and Computers for Assessment Purposes

Peer reviewed

Direct link

Davis, Laurie Laughlin; Kong, Xiaojing; McBride, Yuanyuan; Morrison, Kristin M. – Applied Measurement in Education, 2017

The definition of what it means to take a test online continues to evolve with the inclusion of a broader range of item types and a wide array of devices used by students to access test content. To assure the validity and reliability of test scores for all students, device comparability research should be conducted to evaluate the impact of…

Descriptors: Educational Technology, Technology Uses in Education, High School Students, Tests

Comparing Human and Automated Essay Scoring for Prospective Graduate Students with Learning Disabilities and/or ADHD

Peer reviewed

Direct link

Buzick, Heather; Oliveri, Maria Elena; Attali, Yigal; Flor, Michael – Applied Measurement in Education, 2016

Automated essay scoring is a developing technology that can provide efficient scoring of large numbers of written responses. Its use in higher education admissions testing provides an opportunity to collect validity and fairness evidence to support current uses and inform its emergence in other areas such as K-12 large-scale assessment. In this…

Descriptors: Essays, Learning Disabilities, Attention Deficit Hyperactivity Disorder, Scoring

Framing Appropriate Accommodations in Terms of Individual Need: Examining the Fit of Four Approaches to Selecting Test Accommodations of English Language Learners

Peer reviewed

Direct link

Koran, Jennifer; Kopriva, Rebecca J. – Applied Measurement in Education, 2017

Providing appropriate test accommodations to most English language learners (ELLs) is important to facilitate meaningful inferences about learning. This study compared teacher large-scale test accommodation recommendations to those from a literature- and practitioner-grounded accommodation selection taxonomy. The taxonomy links student-specific…

Descriptors: English Language Learners, Testing Accommodations, Comparative Analysis, Taxonomy

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

The Impact of Personality and Test Conditions on Mathematical Test Performance

Peer reviewed

Direct link

Hayes, Heather; Embretson, Susan E. – Applied Measurement in Education, 2013

Online and on-demand tests are increasingly used in assessment. Although the main focus has been cheating and test security (e.g., Selwyn, 2008) the cross-setting equivalence of scores as a function of contrasting test conditions is also an issue that warrants attention. In this study, the impact of environmental and cognitive distractions, as…

Descriptors: College Students, Computer Assisted Testing, Problem Solving, Physical Environment

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test

Peer reviewed

Direct link

Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012

In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Response Time Differences between Computers and Tablets

Peer reviewed

Direct link

Kong, Xiaojing; Davis, Laurie Laughlin; McBride, Yuanyuan; Morrison, Kristin – Applied Measurement in Education, 2018

Item response time data were used in investigating the differences in student test-taking behavior between two device conditions: computer and tablet. Analyses were conducted to address the questions of whether or not the device condition had a differential impact on rapid guessing and solution behaviors (with response time effort used as an…

Descriptors: Educational Technology, Technology Uses in Education, Computers, Handheld Devices

Assessment of Complex Problem Solving: What We Know and What We Don't Know

Peer reviewed

Direct link

Herde, Christoph Nils; Wüstenberg, Sascha; Greiff, Samuel – Applied Measurement in Education, 2016

Complex Problem Solving (CPS) is seen as a cross-curricular 21st century skill that has attracted interest in large-scale-assessments. In the Programme for International Student Assessment (PISA) 2012, CPS was assessed all over the world to gain information on students' skills to acquire and apply knowledge while dealing with nontransparent…

Descriptors: Problem Solving, Achievement Tests, Foreign Countries, International Assessment

The Utility of Augmented Subscores in a Licensure Exam: An Evaluation of Methods Using Empirical Data

Peer reviewed

Direct link

Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010

Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…

Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods

Mathematics Performance of Students with and without Disabilities under Accommodated Conditions Using Resource Guides and Calculators on High Stakes Tests

Peer reviewed

Direct link

Engelhard, George, Jr.; Fincher, Melissa; Domaleski, Christopher S. – Applied Measurement in Education, 2011

This study examines the effects of two test administration accommodations on the mathematics performance of students within the context of a large-scale statewide assessment. The two test administration accommodations were resource guides and calculators. A stratified random sample of schools was selected to represent the demographic…

Descriptors: Testing Accommodations, Disabilities, High Stakes Tests, Program Effectiveness

Comparability of Computer- and Paper-Administered Multiple-Choice Tests for K-12 Populations: A Synthesis

Peer reviewed

Direct link

Kingston, Neal M. – Applied Measurement in Education, 2009

There have been many studies of the comparability of computer-administered and paper-administered tests. Not surprisingly (given the variety of measurement and statistical sampling issues that can affect any one study) the results of such studies have not always been consistent. Moreover, the quality of computer-based test administration systems…

Descriptors: Multiple Choice Tests, Computer Assisted Testing, Printed Materials, Effect Size

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Examining Equivalence of Accommodations on a Statewide Elementary-Level Science Test

Peer reviewed

Direct link

Kim, Do-Hong; Schneider, Christina; Siskind, Theresa – Applied Measurement in Education, 2009

This study examined the extent to which the underlying factor structure of the 2005 South Carolina Palmetto Achievement Challenge Tests (PACT) in science for grades 3, 4, and 5 was equivalent for students who were administered the test in a regular (standard) or accommodated form. Three accommodation groups were of interest: students who received…

Descriptors: Testing Accommodations, Science Tests, Elementary School Science, Measurement

Item-Level Comparative Analysis of Online and Paper Administrations of the Texas Assessment of Knowledge and Skills

Peer reviewed

Direct link

Keng, Leslie; McClarty, Katie Larsen; Davis, Laurie Laughlin – Applied Measurement in Education, 2008

This article describes a comparative study conducted at the item level for paper and online administrations of a statewide high stakes assessment. The goal was to identify characteristics of items that may have contributed to mode effects. Item-level analyses compared two modes of the Texas Assessment of Knowledge and Skills (TAKS) for up to four…

Descriptors: Computer Assisted Testing, Geometric Concepts, Grade 8, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Davis, Laurie Laughlin	3
Kong, Xiaojing	2
McBride, Yuanyuan	2
Sinharay, Sandip	2
Attali, Yigal	1
Ban, Jae-Chun	1
Banks, Kathleen	1
Buzick, Heather	1
Clauser, Brian E.	1
Clyman, Stephen G.	1
Dadey, Nathan	1
De Ayala, R. J.	1
DeMars, Christine E.	1
DePascale, Charles	1
Dodd, Barbara G.	1
Domaleski, Christopher S.	1
Du, Yi	1
Embretson, Susan E.	1
Engelhard, George, Jr.	1
Feldt, Leonard	1
Fincher, Melissa	1
Flor, Michael	1
Gallant, Dorinda J.	1
Greiff, Samuel	1
More ▼