ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	29

Descriptor

Comparative Testing	146
Test Items	146
Test Format	44
Higher Education	42
Test Construction	38
Multiple Choice Tests	33
Difficulty Level	31
Computer Assisted Testing	29
Item Analysis	28
Foreign Countries	27
Item Response Theory	26
Test Validity	26
Mathematics Tests	22
Test Reliability	22
Scores	19
College Students	18
Item Bias	18
Adaptive Testing	17
Mathematical Models	17
College Entrance Examinations	16
Comparative Analysis	16
Test Bias	16
Achievement Tests	15
High Schools	15
Elementary School Students	14
More ▼

Publication Type

Reports - Research	102
Journal Articles	81
Speeches/Meeting Papers	39
Reports - Evaluative	35
Tests/Questionnaires	6
Reports - Descriptive	5
Numerical/Quantitative Data	3
Collected Works - Serials	2
Collected Works - General	1
Dissertations/Theses -…	1
Opinion Papers	1
More ▼

Education Level

Higher Education	11
Elementary Secondary Education	10
Postsecondary Education	7
Elementary Education	6
Grade 8	5
Grade 4	4
Secondary Education	4
Grade 3	3
Early Childhood Education	2
Grade 5	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Researchers	6
Practitioners	1
Teachers	1

Location

United States	7
Canada	6
Germany	3
Israel	3
Australia	2
China	2
South Africa	2
United Kingdom (England)	2
Alabama	1
Canada (Edmonton)	1
France	1
Hong Kong	1
Indonesia	1
Jamaica	1
Maryland	1
Netherlands	1
New York	1
Portugal	1
Taiwan (Taipei)	1
Thailand	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Test Items X

Showing 46 to 60 of 146 results Save | Export

An Examination of the Feasibility of Using Criterion-Referenced Measurement in Large-Scale, Survey Testing Situations.

Download full text

Graham, Darol L. – 1974

The adequacy of a test developed for statewide assessment of basic mathematics skills was investigated. The test, comprised of multiple-choice items reflecting a series of behavioral objectives, was compared with a more extensive criterion measure generated from the same objectives by the application of a strict item sampling model. In many…

Descriptors: Comparative Testing, Criterion Referenced Tests, Educational Assessment, Item Sampling

Interpreting Scales through Scale Anchoring.

Peer reviewed

Beaton, Albert E.; Allen, Nancy L. – Journal of Educational Statistics, 1992

The National Assessment of Educational Progress (NAEP) makes possible comparison of groups of students and provides information about what these groups know and can do. The scale anchoring techniques described in this chapter address the latter purpose. The direct method and the smoothing method of scale anchoring are discussed. (SLD)

Descriptors: Comparative Testing, Educational Assessment, Elementary Secondary Education, Knowledge Level

The None-of-the-Above Option: An Empirical Study.

Peer reviewed

Frary, Robert B. – Applied Measurement in Education, 1991

The use of the "none-of-the-above" option (NOTA) in 20 college-level multiple-choice tests was evaluated for classes with 100 or more students. Eight academic disciplines were represented, and 295 NOTA and 724 regular test items were used. It appears that the NOTA can be compatible with good classroom measurement. (TJH)

Descriptors: College Students, Comparative Testing, Difficulty Level, Discriminant Analysis

Conceptual versus Monolingual Scoring: When Does It Make a Difference?

Peer reviewed

Direct link

Bedore, Lisa M.; Pena, Elizabeth D.; Garcia, Melissa; Cortez, Celina – Language, Speech, and Hearing Services in Schools, 2005

Purpose: This study evaluates the extent to which bilingual children produce the same or overlapping responses on tasks assessing semantic skills in each of their languages and whether classification analysis based on monolingual or conceptual scoring can accurately classify the semantic development of typically developing (TD) bilingual children.…

Descriptors: Monolingualism, Semantics, Skill Development, Young Children

A Note on the Format of Ennis' Multiple-Choice Tests of Deductive Reasoning Competence.

Download full text

Brandon, E. P. – 1992

In his pioneer investigations of deductive logical reasoning competence, R. H. Ennis (R. H. Ennis and D. H. Paulus, 1965) used a multiple-choice format in which the premises are given, and it is asked whether the conclusion would then be true. In the adaptation of his work for use in Jamaica, the three possible answers were stated as…

Descriptors: Adults, Cognitive Tests, Comparative Testing, Competence

The Relationship of Expert-System Scored Constrained Free-Response Items to Multiple-Choice and Open-Ended Items.

Peer reviewed

Bennett, Randy Elliot; And Others – Applied Psychological Measurement, 1990

The relationship of an expert-system-scored constrained free-response item type to multiple-choice and free-response items was studied using data for 614 students on the College Board's Advanced Placement Computer Science (APCS) Examination. Implications for testing and the APCS test are discussed. (SLD)

Descriptors: College Students, Comparative Testing, Computer Assisted Testing, Computer Science

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Use of an Inclusive Option and the Optimal Number of Options for Multiple-Choice Items.

Peer reviewed

Crehan, Kevin D.; And Others – Educational and Psychological Measurement, 1993

Studies with 220 college students found that multiple-choice test items with 3 items are more difficult than those with 4 items, and items with the none-of-these option are more difficult than those without this option. Neither format manipulation affected item discrimination. Implications for test construction are discussed. (SLD)

Descriptors: College Students, Comparative Testing, Difficulty Level, Distractors (Tests)

Relationships among Multiple-Choice and Open-Ended Analytical Questions.

Peer reviewed

Bridgeman, Brent; Rock, Donald A. – Journal of Educational Measurement, 1993

Exploratory and confirmatory factor analyses were used to explore relationships among existing item types and three new computer-administered item types for the analytical scale of the Graduate Record Examination General Test. Results with 349 students indicate constructs the item types are measuring. (SLD)

Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing

Effects of Practical Constraints on Item Selection Rules at the Early Stages of Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Shu-Ying; Ankenman, Robert D. – Journal of Educational Measurement, 2004

The purpose of this study was to compare the effects of four item selection rules--(1) Fisher information (F), (2) Fisher information with a posterior distribution (FP), (3) Kullback-Leibler information with a posterior distribution (KP), and (4) completely randomized item selection (RN)--with respect to the precision of trait estimation and the…

Descriptors: Test Length, Adaptive Testing, Computer Assisted Testing, Test Selection

Assessing Dimensionality of a Set of Items--Comparison of Different Approaches.

Download full text

Nandakumar, Ratna – 1992

The performance of the following four methodologies for assessing unidimensionality was examined: (1) DIMTEST; (2) the approach of P. W. Holland and P. R. Rosenbaum; (3) linear factor analysis; and (4) non-linear factor analysis. Each method is examined and compared with other methods using simulated data sets and real data sets. Seven data sets,…

Descriptors: Ability, Comparative Testing, Correlation, Equations (Mathematics)

Influence of the Criterion Variable on the Identification of Differentially Functioning Test Items Using the Mantel-Haenszel Statistic. Lab Report 198.

Clauser, Brian E.; And Others – 1991

This paper explores the effectiveness of the Mantel-Haenszel (MH) statistic in detecting differentially functioning test items when the internal criterion is varied. Using a data set from the 1982 statewide administration of a 150-item life skills examination (the New Mexico High School Proficiency Examination), a randomly selected sample of 1,000…

Descriptors: American Indians, Anglo Americans, Comparative Testing, High School Students

The Effect of Altering the Position of Options in a Multiple-Choice Examination.

Download full text

Cizek, Gregory J. – 1991

A commonly accepted rule for developing equated examinations using the common-items non-equivalent groups (CINEG) design is that items common to the two examinations being equated should be identical. The CINEG design calls for two groups of examinees to respond to a set of common items that is included in two examinations. In practice, this rule…

Descriptors: Certification, Comparative Testing, Difficulty Level, Higher Education

Computerized Adaptive Testing: A Comparison of the Nominal Response Model and the Three Parameter Logistic Model.

Download full text

DeAyala, R. J.; Koch, William R. – 1987

A nominal response model-based computerized adaptive testing procedure (nominal CAT) was implemented using simulated data. Ability estimates from the nominal CAT were compared to those from a CAT based upon the three-parameter logistic model (3PL CAT). Furthermore, estimates from both CAT procedures were compared with the known true abilities used…

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

The Effect of Negation and Polar Opposite Item Reversals on Questionnaire Reliability and Validity: An Experimental Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991

Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)

Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Journal of Educational…	16
Applied Psychological…	5
Educational and Psychological…	5
Applied Measurement in…	3
Educational Measurement:…	3
Journal of Technology,…	3
American Educational Research…	2
Contemporary Educational…	2
Evaluation and the Health…	2
Intelligence	2
Journal of Cross-Cultural…	2
Journal of Educational…	2
Journal of Educational…	2
Journal of Experimental…	2
Studies in Educational…	2
Advances in Health Sciences…	1
Alberta Journal of…	1
British Educational Research…	1
Career Development and…	1
College Teaching	1
Curriculum Journal	1
ETS Research Report Series	1
Education and Information…	1
Educational Assessment	1
Educational Research Quarterly	1
More ▼

Wise, Steven L.	3
Badger, Elizabeth	2
Bridgeman, Brent	2
Clarke, S. C. T.	2
Clauser, Brian E.	2
De Ayala, R. J.	2
Ellis, Barbara B.	2
Hughes, Carolyn	2
Lissitz, Robert W.	2
Little, Todd D.	2
Nandakumar, Ratna	2
Palmer, Susan B.	2
Plake, Barbara S.	2
Ryan, Katherine E.	2
Seo, Hyojeong	2
Shogren, Karrie A.	2
Sykes, Robert C.	2
Thomas, Brenda	2
Thompson, James R.	2
Trevisan, Michael S.	2
Wainer, Howard	2
Wehmeyer, Michael L.	2
Welch, Catherine J.	2
Agus Santoso	1
More ▼

SAT (College Admission Test)	6
Graduate Record Examinations	5
National Assessment of…	5
Trends in International…	5
California Achievement Tests	3
Advanced Placement…	2
Iowa Tests of Basic Skills	2
Program for International…	2
Progress in International…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
Alabama High School…	1
Beck Depression Inventory	1
Behavior Assessment System…	1
California Test of Mental…	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Embedded Figures Test	1
Gates MacGinitie Reading Tests	1
General Educational…	1
Kaufman Assessment Battery…	1
Metropolitan Achievement Tests	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
Stanford Achievement Tests	1
More ▼