ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	29

Descriptor

Comparative Testing	146
Test Items	146
Test Format	44
Higher Education	42
Test Construction	38
Multiple Choice Tests	33
Difficulty Level	31
Computer Assisted Testing	29
Item Analysis	28
Foreign Countries	27
Item Response Theory	26
Test Validity	26
Mathematics Tests	22
Test Reliability	22
Scores	19
College Students	18
Item Bias	18
Adaptive Testing	17
Mathematical Models	17
College Entrance Examinations	16
Comparative Analysis	16
Test Bias	16
Achievement Tests	15
High Schools	15
Elementary School Students	14
More ▼

Publication Type

Reports - Research	102
Journal Articles	81
Speeches/Meeting Papers	39
Reports - Evaluative	35
Tests/Questionnaires	6
Reports - Descriptive	5
Numerical/Quantitative Data	3
Collected Works - Serials	2
Collected Works - General	1
Dissertations/Theses -…	1
Opinion Papers	1
More ▼

Education Level

Higher Education	11
Elementary Secondary Education	10
Postsecondary Education	7
Elementary Education	6
Grade 8	5
Grade 4	4
Secondary Education	4
Grade 3	3
Early Childhood Education	2
Grade 5	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Researchers	6
Practitioners	1
Teachers	1

Location

United States	7
Canada	6
Germany	3
Israel	3
Australia	2
China	2
South Africa	2
United Kingdom (England)	2
Alabama	1
Canada (Edmonton)	1
France	1
Hong Kong	1
Indonesia	1
Jamaica	1
Maryland	1
Netherlands	1
New York	1
Portugal	1
Taiwan (Taipei)	1
Thailand	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Test Items X

Showing 31 to 45 of 146 results Save | Export

Identification of Non-Uniform Differential Item Functioning Using a Variation of the Mantel-Haenszel Procedure.

Download full text

Mazor, Kathleen M.; And Others – 1993

The Mantel-Haenszel (MH) procedure has become one of the most popular procedures for detecting differential item functioning (DIF). One of the most troublesome criticisms of this procedure is that while detection rates for uniform DIF are very good, the procedure is not sensitive to non-uniform DIF. In this study, examinee responses were generated…

Descriptors: Comparative Testing, Computer Simulation, Item Bias, Item Response Theory

Application of an Automated Item Selection Method to Real Data.

Peer reviewed

Stocking, Martha L.; And Others – Applied Psychological Measurement, 1993

A method of automatically selecting items for inclusion in a test with constraints on item content and statistical properties was applied to real data. Tests constructed manually from the same data and constraints were compared to tests constructed automatically. Results show areas in which automated assembly can improve test construction. (SLD)

Descriptors: Algorithms, Automation, Comparative Testing, Computer Assisted Testing

Implicit Aspects of Paper and Pencil Mathematics Assessment that Come to Light through the Use of the Computer

Peer reviewed

Direct link

Threlfall, John; Pool, Peter; Homer, Matthew; Swinnerton, Bronwen – Educational Studies in Mathematics, 2007

This article explores the effect on assessment of "translating" paper and pencil test items into their computer equivalents. Computer versions of a set of mathematics questions derived from the paper-based end of key stage 2 and 3 assessments in England were administered to age appropriate pupil samples, and the outcomes compared.…

Descriptors: Test Items, Student Evaluation, Foreign Countries, Test Validity

A Comparison of Item Sampling Plans in the Application of Multiple Matrix Sampling.

Peer reviewed

Gressard, Risa P.; Loyd, Brenda H. – Journal of Educational Measurement, 1991

A Monte Carlo study, which simulated 10,000 examinees' responses to four tests, investigated the effect of item stratification on parameter estimation in multiple matrix sampling of achievement data. Practical multiple matrix sampling is based on item stratification by item discrimination and a sampling plan with moderate number of subtests. (SLD)

Descriptors: Achievement Tests, Comparative Testing, Computer Simulation, Estimation (Mathematics)

Some Advantages of Alternate-Choice Test Items.

Ebel, Robert L. – 1981

An alternate-choice test item is a simple declarative sentence, one portion of which is given with two different wordings. For example, "Foundations like Ford and Carnegie tend to be (1) eager (2) hesitant to support innovative solutions to educational problems." The examinee's task is to choose the alternative that makes the sentence…

Descriptors: Comparative Testing, Difficulty Level, Guessing (Tests), Multiple Choice Tests

The Evaluation of Visual Arts in a Basic Education Program.

Brigham, Donald; Sullivan, Edward A. – 1980

The goals of the visual arts program of the Attleboro (MA) public schools, its relationship with the rest of the curriculum, and a study of the effectiveness of the program in seventh grade are described. It is suggested that the visual conceptual skills that are developed through the visual arts program are essential to cognitive processes and…

Descriptors: Cognitive Development, Comparative Testing, Concept Formation, Elementary Secondary Education

Self-Adapted Testing: A Performance-Improving Variant of Computerized Adaptive Testing.

Peer reviewed

Rocklin, Thomas; O'Donnell, Angela M. – Journal of Educational Psychology, 1987

An experiment was conducted that contrasted a variant of computerized adaptive testing, self-adapted testing, with two traditional tests. Participants completed a self-report of text anxiety and were randomly assigned to take one of the three tests of verbal ability. Subjects generally chose more difficult items as the test progressed. (Author/LMO)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

A Detailed Analysis of Group Differences on the California Short-Form Test of Mental Maturity Between 1956 and 1977.

Peer reviewed

Dash, Udaya; Maguire, Thomas – Alberta Journal of Educational Research, 1984

Compares scores of 3,443 third graders in 1956 and 4,378 third graders in 1977 on the California Short Form Test of Mental Maturity. Examines differences in factoral structure and differences in ability level between groups for factors (64 items related to 7 components) apparently measuring consistent abilities. (SB)

Descriptors: Academic Ability, Comparative Analysis, Comparative Testing, Elementary Education

Sex Differences in WAIS-R Item Performance.

Peer reviewed

Ilai, Doron; Willerman, Lee – Intelligence, 1989

Items showing sex differences on the revised Wechsler Adult Intelligence Scale (WAIS-R) were studied. In a sample of 206 young adults (110 males and 96 females), 15 items demonstrated significant sex differences, but there was no relationship of item-specific gender content to sex differences in item performance. (SLD)

Descriptors: Comparative Testing, Females, Intelligence Tests, Item Analysis

Use of a Committee Review Process to Improve the Quality of Course Examinations

Peer reviewed

Direct link

Wallach, P. M.; Crespo, L. M.; Holtzman, K. Z.; Galbraith, R. M.; Swanson, D. B. – Advances in Health Sciences Education, 2006

Purpose: In conjunction with curricular changes, a process to develop integrated examinations was implemented. Pre-established guidelines were provided favoring vignettes, clinically relevant material, and application of knowledge rather than simple recall. Questions were read aloud in a committee including all course directors, and a reviewer…

Descriptors: Test Items, Rating Scales, Examiners, Guidelines

Setting the Response Time Threshold Parameter to Differentiate Solution Behavior from Rapid-Guessing Behavior

Peer reviewed

Direct link

Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007

This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…

Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory

Stability of IRT b-Values over Time and Position.

Download full text

Sykes, Robert C. – 1989

An analysis-of-covariance methodology was used to investigate whether there were population differences between tryout and operational Rasch item b-values relative to differences between pairs of item response theory (IRT) b-values from consecutive operational item administrations. This methodology allowed the evaluation of whether any such…

Descriptors: Analysis of Covariance, Certification, Comparative Testing, Item Response Theory

Comparison of the PPVT-R and WISC-R with Urban Educable Mentally Retarded Students.

Peer reviewed

Prasse, David P.; Bracken, Bruce A. – Psychology in the Schools, 1981

Significant differences were found between the Peabody Picture Vocabulary Test-Revised mean standard scores and Verbal, Performance, and Full Scale IQs. The PPVT-R did not correlate significantly with the WISC-R scales or subtests, suggesting the tests are measuring different abilities. (Author)

Descriptors: Ability Identification, Children, Comparative Testing, Intelligence Tests

Multiple Choice and True-False: Reliability and Validity Compared.

Peer reviewed

Green, Kathy – Journal of Experimental Education, 1979

Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)

Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format

Convergent and Discriminant Validity of the Locus of Control Construct.

Download full text

Borich, Gary D.; Paver, Sydney W. – 1974

Eighty undergraduates were administered four self-report locus of control inventories, in order to evaluate the convergent and discriminant validity of four categories common to these inventories: chance, fate, personal control, and powerful others. The four inventories were: (1) Internal, Powerful Others and Chance scales; (2) James Internal…

Descriptors: Comparative Testing, Higher Education, Individual Differences, Locus of Control

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Journal of Educational…	16
Applied Psychological…	5
Educational and Psychological…	5
Applied Measurement in…	3
Educational Measurement:…	3
Journal of Technology,…	3
American Educational Research…	2
Contemporary Educational…	2
Evaluation and the Health…	2
Intelligence	2
Journal of Cross-Cultural…	2
Journal of Educational…	2
Journal of Educational…	2
Journal of Experimental…	2
Studies in Educational…	2
Advances in Health Sciences…	1
Alberta Journal of…	1
British Educational Research…	1
Career Development and…	1
College Teaching	1
Curriculum Journal	1
ETS Research Report Series	1
Education and Information…	1
Educational Assessment	1
Educational Research Quarterly	1
More ▼

Wise, Steven L.	3
Badger, Elizabeth	2
Bridgeman, Brent	2
Clarke, S. C. T.	2
Clauser, Brian E.	2
De Ayala, R. J.	2
Ellis, Barbara B.	2
Hughes, Carolyn	2
Lissitz, Robert W.	2
Little, Todd D.	2
Nandakumar, Ratna	2
Palmer, Susan B.	2
Plake, Barbara S.	2
Ryan, Katherine E.	2
Seo, Hyojeong	2
Shogren, Karrie A.	2
Sykes, Robert C.	2
Thomas, Brenda	2
Thompson, James R.	2
Trevisan, Michael S.	2
Wainer, Howard	2
Wehmeyer, Michael L.	2
Welch, Catherine J.	2
Agus Santoso	1
More ▼

SAT (College Admission Test)	6
Graduate Record Examinations	5
National Assessment of…	5
Trends in International…	5
California Achievement Tests	3
Advanced Placement…	2
Iowa Tests of Basic Skills	2
Program for International…	2
Progress in International…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
Alabama High School…	1
Beck Depression Inventory	1
Behavior Assessment System…	1
California Test of Mental…	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Embedded Figures Test	1
Gates MacGinitie Reading Tests	1
General Educational…	1
Kaufman Assessment Battery…	1
Metropolitan Achievement Tests	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
Stanford Achievement Tests	1
More ▼