ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Comparative Testing	31
Difficulty Level	31
Test Items	31
Higher Education	16
Test Construction	12
Multiple Choice Tests	11
Test Format	10
Computer Assisted Testing	7
College Students	6
Item Response Theory	6
Test Reliability	6
Adaptive Testing	5
Item Analysis	5
Mathematics Tests	5
College Entrance Examinations	4
Elementary School Students	4
Scores	4
Test Anxiety	4
Test Validity	4
Black Students	3
Comparative Analysis	3
Elementary Education	3
Foreign Countries	3
Graduate Students	3
High Schools	3
More ▼

Source

Journal of Educational…	4
Applied Measurement in…	1
College Teaching	1
Contemporary Educational…	1
Educational and Psychological…	1
Evaluation and the Health…	1
Focus	1
Journal of Educational…	1
Language Learning	1
Practical Assessment,…	1
School Science and Mathematics	1
Studies in Educational…	1
More ▼

Publication Type

Reports - Research	26
Journal Articles	14
Speeches/Meeting Papers	12
Reports - Evaluative	5
Collected Works - Serials	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Higher Education	2
Grade 3	1
Grade 4	1
Grade 7	1
Grade 8	1
Postsecondary Education	1

Audience

Researchers

Location

United States	2
Alabama	1
Indonesia	1
Israel	1
Thailand	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
Graduate Record Examinations	2
Alabama High School…	1
Raven Progressive Matrices	1
Test Anxiety Inventory	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

From Investigating the Alignment of a Priori Item Characteristics Based on the CTT and Four-Parameter Logistic (4-PL) IRT Models to Further Exploring the Comparability of the Two Models

Peer reviewed
PDF on ERIC

Download full text

Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024

The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…

Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction

Afraid Not: Student Performance versus Perception Based on Exam Question Format

Peer reviewed

Direct link

Laprise, Shari L. – College Teaching, 2012

Successful exam composition can be a difficult task. Exams should not only assess student comprehension, but be learning tools in and of themselves. In a biotechnology course delivered to nonmajors at a business college, objective multiple-choice test questions often require students to choose the exception or "not true" choice. Anecdotal student…

Descriptors: Feedback (Response), Test Items, Multiple Choice Tests, Biotechnology

Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

Peer reviewed

Direct link

Wang, Jianjun – School Science and Mathematics, 2011

As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking

A Method for Comparing Test Difficulties.

Download full text

Frisbie, David A. – 1981

The relative difficulty ratio (RDR) is used as a method of representing test difficulty. The RDR is the ratio of a test mean to the ideal mean, the point midway between the perfect score and the mean chance score for the test. The RDR tranformation is a linear scale conversion method but not a linear equating method in the classical sense. The…

Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Raw Scores

Examination of Various Influences on the Mantel-Haenszel Statistic.

Clauser, Brian E.; And Others – 1991

Item bias has been a major concern for test developers during recent years. The Mantel-Haenszel statistic has been among the preferred methods for identifying biased items. The statistic's performance in identifying uniform bias in simulated data modeled by producing various levels of difference in the (item difficulty) b-parameter for reference…

Descriptors: Comparative Testing, Difficulty Level, Item Bias, Item Response Theory

Some Advantages of Alternate-Choice Test Items.

Ebel, Robert L. – 1981

An alternate-choice test item is a simple declarative sentence, one portion of which is given with two different wordings. For example, "Foundations like Ford and Carnegie tend to be (1) eager (2) hesitant to support innovative solutions to educational problems." The examinee's task is to choose the alternative that makes the sentence…

Descriptors: Comparative Testing, Difficulty Level, Guessing (Tests), Multiple Choice Tests

Self-Adapted Testing: A Performance-Improving Variant of Computerized Adaptive Testing.

Peer reviewed

Rocklin, Thomas; O'Donnell, Angela M. – Journal of Educational Psychology, 1987

An experiment was conducted that contrasted a variant of computerized adaptive testing, self-adapted testing, with two traditional tests. Participants completed a self-report of text anxiety and were randomly assigned to take one of the three tests of verbal ability. Subjects generally chose more difficult items as the test progressed. (Author/LMO)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

The None-of-the-Above Option: An Empirical Study.

Peer reviewed

Frary, Robert B. – Applied Measurement in Education, 1991

The use of the "none-of-the-above" option (NOTA) in 20 college-level multiple-choice tests was evaluated for classes with 100 or more students. Eight academic disciplines were represented, and 295 NOTA and 724 regular test items were used. It appears that the NOTA can be compatible with good classroom measurement. (TJH)

Descriptors: College Students, Comparative Testing, Difficulty Level, Discriminant Analysis

Use of an Inclusive Option and the Optimal Number of Options for Multiple-Choice Items.

Peer reviewed

Crehan, Kevin D.; And Others – Educational and Psychological Measurement, 1993

Studies with 220 college students found that multiple-choice test items with 3 items are more difficult than those with 4 items, and items with the none-of-these option are more difficult than those without this option. Neither format manipulation affected item discrimination. Implications for test construction are discussed. (SLD)

Descriptors: College Students, Comparative Testing, Difficulty Level, Distractors (Tests)

The Effect of Altering the Position of Options in a Multiple-Choice Examination.

Download full text

Cizek, Gregory J. – 1991

A commonly accepted rule for developing equated examinations using the common-items non-equivalent groups (CINEG) design is that items common to the two examinations being equated should be identical. The CINEG design calls for two groups of examinees to respond to a set of common items that is included in two examinations. In practice, this rule…

Descriptors: Certification, Comparative Testing, Difficulty Level, Higher Education

The Effects of Feedback in Computerized Adaptive and Self-Adapted Tests.

Download full text

Roos, Linda L.; And Others – 1992

Computerized adaptive (CA) testing uses an algorithm to match examinee ability to item difficulty, while self-adapted (SA) testing allows the examinee to choose the difficulty of his or her items. Research comparing SA and CA testing has shown that examinees experience lower anxiety and improved performance with SA testing. All previous research…

Descriptors: Ability Identification, Adaptive Testing, Algebra, Algorithms

Applications of the Analytically Derived Asymptotic Standard Errors of Item Response Theory Item Parameter Estimates

Peer reviewed

Direct link

Li, Yuan H.; Lissitz, Robert W. – Journal of Educational Measurement, 2004

The analytically derived asymptotic standard errors (SEs) of maximum likelihood (ML) item estimates can be approximated by a mathematical function without examinees' responses to test items, and the empirically determined SEs of marginal maximum likelihood estimation (MMLE)/Bayesian item estimates can be obtained when the same set of items is…

Descriptors: Test Items, Computation, Item Response Theory, Error of Measurement

Internal-Structure Analysis of Analytical Reasoning Worksamples 244 D and E and Development of Form H. Technical Report 1992-1.

Download full text

Bethscheider, Janine K. – 1992

Standard and experimental forms of the Johnson O'Connor Research Foundations Analytical Reasoning test were administered to 1,496 clients of the Foundation (persons seeking information about aptitude for educational and career decisions). The objectives were to develop a new form of the test and to better understand what makes some items more…

Descriptors: Adults, Aptitude Tests, Career Choice, Comparative Testing

Multiple-Choice and Alternate-Choice Questions: Description and Analysis.

Download full text

Dowd, Steven B. – 1992

An alternative to multiple-choice (MC) testing is suggested as it pertains to the field of radiologic technology education. General principles for writing MC questions are given and contrasted with a new type of MC question, the alternate-choice (AC) question, in which the answer choices are embedded in the question in a short form that resembles…

Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Higher Education

An Investigation of the Relationship between Item Arrangement and Test Performance.

Chissom, Brad; Chukabarah, Prince C. O. – 1985

The comparative effects of various sequences of test items were examined for over 900 graduate students enrolled in an educational research course at The University of Alabama, Tuscaloosa. experiment, which was conducted a total of four times using four separate tests, presented three different arrangements of 50 multiple-choice items: (1)…

Descriptors: Analysis of Variance, Comparative Testing, Difficulty Level, Graduate Students

Previous Page | Next Page »

Pages: 1 | 2 | 3

Wise, Steven L.	2
Agus Santoso	1
Albanese, Mark A.	1
Anderson, Paul S.	1
Benderson, Albert, Ed.	1
Bethscheider, Janine K.	1
Chissom, Brad	1
Chukabarah, Prince C. O.	1
Cizek, Gregory J.	1
Clauser, Brian E.	1
Crehan, Kevin D.	1
Dowd, Steven B.	1
Ebel, Robert L.	1
Engelhard, George, Jr.	1
Frary, Robert B.	1
Freedle, Roy	1
Frisbie, David A.	1
Gerritz, Kalle	1
Green, Kathy E.	1
Gulzhaina K. Kassymova	1
Heri Retnawati	1
Hu, P. Gillian	1
Hyers, Albert D.	1
Ibnu Rafi	1
More ▼