ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	10

Source

Journal of Educational…

Publication Type

Journal Articles	27
Reports - Research	18
Reports - Evaluative	6
Reports - Descriptive	3

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers

Location

Ireland

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	2
Graduate Record Examinations	2
North Carolina End of Course…	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Incorporating Test-Taking Engagement into Multistage Adaptive Testing Design for Large-Scale Assessments

Peer reviewed

Direct link

Okan Bulut; Guher Gorgun; Hacer Karamese – Journal of Educational Measurement, 2025

The use of multistage adaptive testing (MST) has gradually increased in large-scale testing programs as MST achieves a balanced compromise between linear test design and item-level adaptive testing. MST works on the premise that each examinee gives their best effort when attempting the items, and their responses truly reflect what they know or can…

Descriptors: Response Style (Tests), Testing Problems, Testing Accommodations, Measurement

Score Comparability between Online Proctored and In-Person Credentialing Exams

Peer reviewed

Direct link

Jones, Paul; Tong, Ye; Liu, Jinghua; Borglum, Joshua; Primoli, Vince – Journal of Educational Measurement, 2022

This article studied two methods to detect mode effects in two credentialing exams. In Study 1, we used a "modal scale comparison approach," where the same pool of items was calibrated separately, without transformation, within two TC cohorts (TC1 and TC2) and one OP cohort (OP1) matched on their pool-based scale score distributions. The…

Descriptors: Scores, Credentials, Licensing Examinations (Professions), Computer Assisted Testing

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

On-the-Fly Constraint-Controlled Assembly Methods for Multistage Adaptive Testing for Cognitive Diagnosis

Peer reviewed

Direct link

Liu, Shuchang; Cai, Yan; Tu, Dongbo – Journal of Educational Measurement, 2018

This study applied the mode of on-the-fly assembled multistage adaptive testing to cognitive diagnosis (CD-OMST). Several and several module assembly methods for CD-OMST were proposed and compared in terms of measurement precision, test security, and constrain management. The module assembly methods in the study included the maximum priority index…

Descriptors: Adaptive Testing, Monte Carlo Methods, Computer Security, Clinical Diagnosis

Hybrid Computerized Adaptive Testing: From Group Sequential Design to Fully Sequential Design

Peer reviewed

Direct link

Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016

Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach

A Comparison of Constraint Programming and Mixed-Integer Programming for Automated Test-Form Generation

Peer reviewed

Direct link

Li, Jie; van der Linden, Wim J. – Journal of Educational Measurement, 2018

The final step of the typical process of developing educational and psychological tests is to place the selected test items in a formatted form. The step involves the grouping and ordering of the items to meet a variety of formatting constraints. As this activity tends to be time-intensive, the use of mixed-integer programming (MIP) has been…

Descriptors: Programming, Automation, Test Items, Test Format

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Automated Test-Form Generation

Peer reviewed

Direct link

van der Linden, Wim J.; Diao, Qi – Journal of Educational Measurement, 2011

In automated test assembly (ATA), the methodology of mixed-integer programming is used to select test items from an item bank to meet the specifications for a desired test form and optimize its measurement accuracy. The same methodology can be used to automate the formatting of the set of selected items into the actual test form. Three different…

Descriptors: Test Items, Test Format, Test Construction, Item Banks

Detecting Answer Copying Using Alternate Test Forms and Seat Locations in Small-Scale Examinations

Peer reviewed

Direct link

van der Ark, L. Andries; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational Measurement, 2008

Two types of answer-copying statistics for detecting copiers in small-scale examinations are proposed. One statistic identifies the "copier-source" pair, and the other in addition suggests who is copier and who is source. Both types of statistics can be used when the examination has alternate test forms. A simulation study shows that the…

Descriptors: Cheating, Statistics, Test Format, Measures (Individuals)

An Empirical Study of Computerized Adaptive Test Administration Conditions.

Peer reviewed

Lunz, Mary E.; Bergstrom, Betty A. – Journal of Educational Measurement, 1994

The impact of computerized adaptive test (CAT) administration formats on student performance was studied with 645 medical technology students who also took a paper-and-pencil test. Analysis of covariance indicates no significant interactions among test administration formats and provides evidence for adjusting CAT test to more familiar modalities.…

Descriptors: Academic Achievement, Adaptive Testing, Analysis of Covariance, Computer Assisted Testing

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

Effect of the Medium of Item Presentation on Examinee Performance and Item Characteristics.

Peer reviewed

Spray, Judith A.; And Others – Journal of Educational Measurement, 1989

Findings of studies examining the effects of presentation media on item characteristics are reviewed. The effect of medium of presentation independent of adaptive methodology in computer-assisted testing was studied through tests of 763 Marine trainees. Conditions necessary for score equivalence between item presentation media are described. (SLD)

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Military Personnel

Current Validity of 1975 and 1985 SATs: Implications for Validity Trends since the Mid-1970s.

Peer reviewed

Stricker, Lawrence J. – Journal of Educational Measurement, 1991

To study whether different forms of the Scholastic Aptitude Test (SAT) used since the mid-1970s varied in their correlations with academic performance criteria, 1975 and 1985 forms were administered to 1,554 and 1,753 high school juniors, respectively. The 1975 form did not have greater validity than the 1985 form. (SLD)

Descriptors: Class Rank, College Entrance Examinations, Comparative Testing, Correlation

A Comparison of Quantitative Questions in Open-Ended and Multiple-Choice Formats.

Peer reviewed

Bridgeman, Brent – Journal of Educational Measurement, 1992

Examinees in a regular administration of the quantitative portion of the Graduate Record Examination responded to particular items in a machine-scannable multiple-choice format. Volunteers (n=364) used a computer to answer open-ended counterparts of these items. Scores for both formats demonstrated similar correlational patterns. (SLD)

Descriptors: Answer Sheets, College Entrance Examinations, College Students, Comparative Testing

Previous Page | Next Page »

Pages: 1 | 2

Test Format	27
Test Items	13
Computer Assisted Testing	11
Test Construction	11
Comparative Analysis	8
Comparative Testing	8
Multiple Choice Tests	8
Item Response Theory	7
Scores	6
Testing Problems	6
Adaptive Testing	5
High School Students	5
High Schools	5
College Entrance Examinations	4
Scoring	4
Achievement Tests	3
Constructed Response	3
Elementary Secondary Education	3
Evaluation Methods	3
Grade 8	3
Junior High Schools	3
Mathematics Tests	3
Test Bias	3
Testing	3
Testing Programs	3
More ▼

van der Linden, Wim J.	2
Albano, Anthony D.	1
Askegaard, Lewis D.	1
Baldwin, Peter	1
Bennett, Randy Elliot	1
Bergstrom, Betty A.	1
Bolger, Niall	1
Borglum, Joshua	1
Braun, Henry I.	1
Bridgeman, Brent	1
Cai, Yan	1
Chang, Hua-Hua	1
Clauser, Brian E.	1
Diao, Qi	1
Douglas, Jeff	1
Emons, Wilco H. M.	1
Guher Gorgun	1
Hacer Karamese	1
Hughes, David C.	1
Jodoin, Michael G.	1
Jones, Paul	1
Kellaghan, Thomas	1
Kim, Sooyeon	1
Li, Jie	1
More ▼