ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	29

Descriptor

Comparative Testing	146
Test Items	146
Test Format	44
Higher Education	42
Test Construction	38
Multiple Choice Tests	33
Difficulty Level	31
Computer Assisted Testing	29
Item Analysis	28
Foreign Countries	27
Item Response Theory	26
Test Validity	26
Mathematics Tests	22
Test Reliability	22
Scores	19
College Students	18
Item Bias	18
Adaptive Testing	17
Mathematical Models	17
College Entrance Examinations	16
Comparative Analysis	16
Test Bias	16
Achievement Tests	15
High Schools	15
Elementary School Students	14
More ▼

Publication Type

Reports - Research	102
Journal Articles	81
Speeches/Meeting Papers	39
Reports - Evaluative	35
Tests/Questionnaires	6
Reports - Descriptive	5
Numerical/Quantitative Data	3
Collected Works - Serials	2
Collected Works - General	1
Dissertations/Theses -…	1
Opinion Papers	1
More ▼

Education Level

Higher Education	11
Elementary Secondary Education	10
Postsecondary Education	7
Elementary Education	6
Grade 8	5
Grade 4	4
Secondary Education	4
Grade 3	3
Early Childhood Education	2
Grade 5	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Researchers	6
Practitioners	1
Teachers	1

Location

United States	7
Canada	6
Germany	3
Israel	3
Australia	2
China	2
South Africa	2
United Kingdom (England)	2
Alabama	1
Canada (Edmonton)	1
France	1
Hong Kong	1
Indonesia	1
Jamaica	1
Maryland	1
Netherlands	1
New York	1
Portugal	1
Taiwan (Taipei)	1
Thailand	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Test Items X

Showing 16 to 30 of 146 results Save | Export

2011 NAEP-TIMSS Linking Study: Linking Methodologies and Their Evaluations. NCES 2013-469

Peer reviewed
PDF on ERIC

Download full text

National Center for Education Statistics, 2013

The 2011 NAEP-TIMSS linking study conducted by the National Center for Education Statistics (NCES) was designed to predict Trends in International Mathematics and Science Study (TIMSS) scores for the U.S. states that participated in 2011 National Assessment of Educational Progress (NAEP) mathematics and science assessment of eighth-grade students.…

Descriptors: Grade 8, Research Methodology, Research Design, Trend Analysis

TIMSS 2011 User Guide for the International Database. Supplement 1: International Version of the TIMSS 2011 Background and Curriculum Questionnaires

Download full text

Foy, Pierre, Ed.; Arora, Alka, Ed.; Stanco, Gabrielle M., Ed. – International Association for the Evaluation of Educational Achievement, 2013

The TIMSS 2011 International Database includes data for all questionnaires administered as part of the TIMSS 2011 assessment. This supplement contains the international version of the TIMSS 2011 background questionnaires and curriculum questionnaires in the following 10 sections: (1) Fourth Grade Student Questionnaire; (2) Fourth Grade Home…

Descriptors: Background, Questionnaires, Test Items, Grade 4

Not Read, but Nevertheless Solved? Three Experiments on PIRLS Multiple Choice Reading Comprehension Test Items

Peer reviewed

Direct link

Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012

Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…

Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4

Is It Necessary to Make Anchor Tests Mini-Versions of the Tests Being Equated or Can Some Restrictions Be Relaxed?

Peer reviewed

Direct link

Sinharay, Sandip; Holland, Paul W. – Journal of Educational Measurement, 2007

It is a widely held belief that anchor tests should be miniature versions (i.e., "minitests"), with respect to content and statistical characteristics, of the tests being equated. This article examines the foundations for this belief regarding statistical characteristics. It examines the requirement of statistical representativeness of…

Descriptors: Test Items, Comparative Testing

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

Peer reviewed

Direct link

Wang, Jianjun – School Science and Mathematics, 2011

As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

The Analysis of Measurement Equivalence in International Studies Using the Rasch Model

Peer reviewed

Direct link

Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011

When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…

Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education

Same-Form Retest Effects on Credentialing Examinations

Peer reviewed

Direct link

Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009

Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…

Descriptors: Test Results, Test Items, Testing, Aptitude Tests

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

Comparability of GCSE Examinations in Different Subjects: An Application of the Rasch Model

Peer reviewed

Direct link

Coe, Robert – Oxford Review of Education, 2008

The comparability of examinations in different subjects has been a controversial topic for many years and a number of criticisms have been made of statistical approaches to estimating the "difficulties" of achieving particular grades in different subjects. This paper argues that if comparability is understood in terms of a linking…

Descriptors: Test Items, Grades (Scholastic), Foreign Countries, Test Bias

Item Selection Strategy for Reducing the Number of Items Rated in an Angoff Standard Setting Study

Peer reviewed

Direct link

Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2007

In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…

Descriptors: Test Items, Probability, Item Analysis, Standard Setting (Scoring)

A Method for Comparing Test Difficulties.

Download full text

Frisbie, David A. – 1981

The relative difficulty ratio (RDR) is used as a method of representing test difficulty. The RDR is the ratio of a test mean to the ideal mean, the point midway between the perfect score and the mean chance score for the test. The RDR tranformation is a linear scale conversion method but not a linear equating method in the classical sense. The…

Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Raw Scores

The Use of Negative Item Stems: A Cautionary Note.

Peer reviewed

Melnick, Steven A.; Gable, Robert K. – Educational Research Quarterly, 1990

By administering an attitude survey to 3,328 parents of elementary school students, use of positive and negative Likert item stems was analyzed. Respondents who answered positive/negative item pairs that were parallel in meaning consistently were compared with those who answered inconsistently. Implications for construction of affective measures…

Descriptors: Affective Measures, Comparative Testing, Elementary Education, Likert Scales

Examination of Various Influences on the Mantel-Haenszel Statistic.

Clauser, Brian E.; And Others – 1991

Item bias has been a major concern for test developers during recent years. The Mantel-Haenszel statistic has been among the preferred methods for identifying biased items. The statistic's performance in identifying uniform bias in simulated data modeled by producing various levels of difference in the (item difficulty) b-parameter for reference…

Descriptors: Comparative Testing, Difficulty Level, Item Bias, Item Response Theory

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Journal of Educational…	16
Applied Psychological…	5
Educational and Psychological…	5
Applied Measurement in…	3
Educational Measurement:…	3
Journal of Technology,…	3
American Educational Research…	2
Contemporary Educational…	2
Evaluation and the Health…	2
Intelligence	2
Journal of Cross-Cultural…	2
Journal of Educational…	2
Journal of Educational…	2
Journal of Experimental…	2
Studies in Educational…	2
Advances in Health Sciences…	1
Alberta Journal of…	1
British Educational Research…	1
Career Development and…	1
College Teaching	1
Curriculum Journal	1
ETS Research Report Series	1
Education and Information…	1
Educational Assessment	1
Educational Research Quarterly	1
More ▼

Wise, Steven L.	3
Badger, Elizabeth	2
Bridgeman, Brent	2
Clarke, S. C. T.	2
Clauser, Brian E.	2
De Ayala, R. J.	2
Ellis, Barbara B.	2
Hughes, Carolyn	2
Lissitz, Robert W.	2
Little, Todd D.	2
Nandakumar, Ratna	2
Palmer, Susan B.	2
Plake, Barbara S.	2
Ryan, Katherine E.	2
Seo, Hyojeong	2
Shogren, Karrie A.	2
Sykes, Robert C.	2
Thomas, Brenda	2
Thompson, James R.	2
Trevisan, Michael S.	2
Wainer, Howard	2
Wehmeyer, Michael L.	2
Welch, Catherine J.	2
Agus Santoso	1
More ▼

SAT (College Admission Test)	6
Graduate Record Examinations	5
National Assessment of…	5
Trends in International…	5
California Achievement Tests	3
Advanced Placement…	2
Iowa Tests of Basic Skills	2
Program for International…	2
Progress in International…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
Alabama High School…	1
Beck Depression Inventory	1
Behavior Assessment System…	1
California Test of Mental…	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Embedded Figures Test	1
Gates MacGinitie Reading Tests	1
General Educational…	1
Kaufman Assessment Battery…	1
Metropolitan Achievement Tests	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
Stanford Achievement Tests	1
More ▼