ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Source

Journal of Educational…	5
Applied Psychological…	4
Educational and Psychological…	3
Journal of Educational…	2
Advances in Health Sciences…	1
Applied Measurement in…	1
College Teaching	1
Educational Measurement:…	1
Journal of Geography in…	1
Oxford Review of Education	1

Publication Type

Reports - Evaluative	35
Journal Articles	20
Speeches/Meeting Papers	10
Reports - Research	2

Education Level

Elementary Education	2
Elementary Secondary Education	2
Higher Education	2
Grade 3	1
Grade 5	1
Grade 8	1
Secondary Education	1

Audience

Researchers

Location

Alabama	1
Australia	1
Jamaica	1
United Kingdom	1
United Kingdom (England)	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	3
Advanced Placement…	1
Alabama High School…	1
College Board Achievement…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Afraid Not: Student Performance versus Perception Based on Exam Question Format

Peer reviewed

Direct link

Laprise, Shari L. – College Teaching, 2012

Successful exam composition can be a difficult task. Exams should not only assess student comprehension, but be learning tools in and of themselves. In a biotechnology course delivered to nonmajors at a business college, objective multiple-choice test questions often require students to choose the exception or "not true" choice. Anecdotal student…

Descriptors: Feedback (Response), Test Items, Multiple Choice Tests, Biotechnology

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

Comparability of GCSE Examinations in Different Subjects: An Application of the Rasch Model

Peer reviewed

Direct link

Coe, Robert – Oxford Review of Education, 2008

The comparability of examinations in different subjects has been a controversial topic for many years and a number of criticisms have been made of statistical approaches to estimating the "difficulties" of achieving particular grades in different subjects. This paper argues that if comparability is understood in terms of a linking…

Descriptors: Test Items, Grades (Scholastic), Foreign Countries, Test Bias

Item Selection Strategy for Reducing the Number of Items Rated in an Angoff Standard Setting Study

Peer reviewed

Direct link

Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2007

In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…

Descriptors: Test Items, Probability, Item Analysis, Standard Setting (Scoring)

Use of a Committee Review Process to Improve the Quality of Course Examinations

Peer reviewed

Direct link

Wallach, P. M.; Crespo, L. M.; Holtzman, K. Z.; Galbraith, R. M.; Swanson, D. B. – Advances in Health Sciences Education, 2006

Purpose: In conjunction with curricular changes, a process to develop integrated examinations was implemented. Pre-established guidelines were provided favoring vignettes, clinically relevant material, and application of knowledge rather than simple recall. Questions were read aloud in a committee including all course directors, and a reviewer…

Descriptors: Test Items, Rating Scales, Examiners, Guidelines

Examination of Various Influences on the Mantel-Haenszel Statistic.

Clauser, Brian E.; And Others – 1991

Item bias has been a major concern for test developers during recent years. The Mantel-Haenszel statistic has been among the preferred methods for identifying biased items. The statistic's performance in identifying uniform bias in simulated data modeled by producing various levels of difference in the (item difficulty) b-parameter for reference…

Descriptors: Comparative Testing, Difficulty Level, Item Bias, Item Response Theory

Setting the Response Time Threshold Parameter to Differentiate Solution Behavior from Rapid-Guessing Behavior

Peer reviewed

Direct link

Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007

This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…

Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory

Identification of Non-Uniform Differential Item Functioning Using a Variation of the Mantel-Haenszel Procedure.

Download full text

Mazor, Kathleen M.; And Others – 1993

The Mantel-Haenszel (MH) procedure has become one of the most popular procedures for detecting differential item functioning (DIF). One of the most troublesome criticisms of this procedure is that while detection rates for uniform DIF are very good, the procedure is not sensitive to non-uniform DIF. In this study, examinee responses were generated…

Descriptors: Comparative Testing, Computer Simulation, Item Bias, Item Response Theory

Application of an Automated Item Selection Method to Real Data.

Peer reviewed

Stocking, Martha L.; And Others – Applied Psychological Measurement, 1993

A method of automatically selecting items for inclusion in a test with constraints on item content and statistical properties was applied to real data. Tests constructed manually from the same data and constraints were compared to tests constructed automatically. Results show areas in which automated assembly can improve test construction. (SLD)

Descriptors: Algorithms, Automation, Comparative Testing, Computer Assisted Testing

Interpreting Scales through Scale Anchoring.

Peer reviewed

Beaton, Albert E.; Allen, Nancy L. – Journal of Educational Statistics, 1992

The National Assessment of Educational Progress (NAEP) makes possible comparison of groups of students and provides information about what these groups know and can do. The scale anchoring techniques described in this chapter address the latter purpose. The direct method and the smoothing method of scale anchoring are discussed. (SLD)

Descriptors: Comparative Testing, Educational Assessment, Elementary Secondary Education, Knowledge Level

A Note on the Format of Ennis' Multiple-Choice Tests of Deductive Reasoning Competence.

Download full text

Brandon, E. P. – 1992

In his pioneer investigations of deductive logical reasoning competence, R. H. Ennis (R. H. Ennis and D. H. Paulus, 1965) used a multiple-choice format in which the premises are given, and it is asked whether the conclusion would then be true. In the adaptation of his work for use in Jamaica, the three possible answers were stated as…

Descriptors: Adults, Cognitive Tests, Comparative Testing, Competence

The Relationship of Expert-System Scored Constrained Free-Response Items to Multiple-Choice and Open-Ended Items.

Peer reviewed

Bennett, Randy Elliot; And Others – Applied Psychological Measurement, 1990

The relationship of an expert-system-scored constrained free-response item type to multiple-choice and free-response items was studied using data for 614 students on the College Board's Advanced Placement Computer Science (APCS) Examination. Implications for testing and the APCS test are discussed. (SLD)

Descriptors: College Students, Comparative Testing, Computer Assisted Testing, Computer Science

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Use of an Inclusive Option and the Optimal Number of Options for Multiple-Choice Items.

Peer reviewed

Crehan, Kevin D.; And Others – Educational and Psychological Measurement, 1993

Studies with 220 college students found that multiple-choice test items with 3 items are more difficult than those with 4 items, and items with the none-of-these option are more difficult than those without this option. Neither format manipulation affected item discrimination. Implications for test construction are discussed. (SLD)

Descriptors: College Students, Comparative Testing, Difficulty Level, Distractors (Tests)

Previous Page | Next Page »

Pages: 1 | 2 | 3

Comparative Testing	35
Test Items	35
Item Response Theory	12
Test Construction	12
Mathematical Models	10
Test Format	9
Computer Assisted Testing	8
Computer Simulation	7
Estimation (Mathematics)	7
Item Bias	7
Multiple Choice Tests	7
Equated Scores	6
Equations (Mathematics)	6
Higher Education	6
Adaptive Testing	5
Difficulty Level	5
Item Analysis	5
Mathematics Tests	5
Student Evaluation	5
College Entrance Examinations	4
College Students	4
Educational Assessment	4
Foreign Countries	4
Scoring	4
Test Length	4
More ▼

De Ayala, R. J.	2
Nandakumar, Ratna	2
Plake, Barbara S.	2
Allen, Nancy L.	1
Ankenman, Robert D.	1
Beaton, Albert E.	1
Bennett, Randy Elliot	1
Bhola, Dennison S.	1
Brandon, E. P.	1
Chen, Shu-Ying	1
Clauser, Brian E.	1
Coe, Robert	1
Cohen, Allan S.	1
Cook, Linda L.	1
Crehan, Kevin D.	1
Crespo, L. M.	1
Dorans, Neil J.	1
Du Bose, Pansy	1
Ferdous, Abdullah A.	1
Galbraith, R. M.	1
Haladyna, Thomas A.	1
Holtzman, K. Z.	1
Jones, Allan	1
Kato, Kentaro	1
More ▼