Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedPreece, Peter F. W.; Skinner, Nigel G.; Riall, Robin A. H. – International Journal of Science Education, 1999
Describes a study of gender differences in science achievement in England and Wales. Finds that the most pronounced gender differences in favor of males occurred in the higher-level papers taken by more able students, especially in physics. Also, more discriminating questions exhibited larger gender gaps in favor of males. Contains 21 references.…
Descriptors: Achievement, Foreign Countries, Grade 9, National Competency Tests
Peer reviewedKing, Chris; Brooks, Mike; Gill, Robin; Rhodes, Alan; Thompson, David – School Science Review, 1999
Finds variable coverage of Earth Science topics in the United Kingdom among General Certificate of Secondary Education (GCSE) double-award science syllabuses and examination papers. Concludes that the levels of error in the examination papers were high and that Earth Science questions showed lower levels of demand and higher levels of recall than…
Descriptors: British National Curriculum, Earth Science, Foreign Countries, National Competency Tests
Peer reviewedShymansky, James A.; Chidsey, Jennifer L.; Henriques, Laura; Enger, Sandra; Yore, Larry D.; Wolfe, Edward W.; Jorgensen, Margaret – School Science and Mathematics, 1997
Describes the design of four science-performance tasks for grade 9 students and the relationship between their performance on those tasks and multiple-choice items on the Iowa Tests of Educational Development. The students and schools used to develop the tasks were not included in the verification sample. Contains 22 references. (Author/ASK)
Descriptors: Academic Achievement, Grade 9, High Schools, Multiple Choice Tests
Peer reviewedLane, Suzanne; Parke, Carol S.; Stone, Clement A. – Educational Measurement: Issues and Practice, 1998
Provides a general framework for examining the consequences of assessment programs, especially statewide programs that intend to improve student learning by holding schools accountable. The framework is intended for use with programs using performance-based tasks but can be used with programs using traditional item formats as well. (SLD)
Descriptors: Accountability, Educational Assessment, Elementary Secondary Education, Performance Based Assessment
Peer reviewedMeijer, Rob R.; And Others – Applied Psychological Measurement, 1995
Three methods based on the nonparametric item response theory (IRT) of R. J. Mokken for the estimation of the reliability of single dichotomous test items are discussed. Analytical and Monte Carlo studies show that one method, designated "MS," is superior because of smaller bias and smaller sampling variance. (SLD)
Descriptors: Estimation (Mathematics), Item Response Theory, Monte Carlo Methods, Nonparametric Statistics
Peer reviewedSireci, Stephen G.; Berberoglu, Giray – Applied Measurement in Education, 2000
Studied a method for investigating the equivalence of translated-adapted items using bilingual test takers through item response theory. Results from an English-Turkish course evaluation form completed by 688 Turkish students indicate that the methodology is effective in flagging items that function differentially across languages and informing…
Descriptors: Bilingualism, College Students, Evaluation Methods, Higher Education
Peer reviewedPiotrowski, Chris; Perdue, Bob – Behavioral & Social Sciences Librarian, 1999
Presents an overview of the major contemporary reference sources (print, online, and electronic) for scholarly information about psychological/educational tests. Stresses books and compendia that will assist reference librarians, and includes a compilation of texts that provide actual test items. Contains 53 references. (Author/LRW)
Descriptors: Educational Testing, Electronic Text, Library Materials, Online Searching
Peer reviewedAllalouf, Avi; Hambleton, Ronald K.; Sireci, Stephen G. – Journal of Educational Measurement, 1999
Focused on whether differential item functioning (DIF) is related to item type in translated test items and the causes of DIF using data from an Israeli college entrance test in Hebrew and a Russian translation. Results from 24,304 college applicants indicate that 34% of items functioned differently across items. (SLD)
Descriptors: College Applicants, College Entrance Examinations, Foreign Countries, Hebrew
Peer reviewedMcInerney, Dennis M.; Yeung, Alexander Seeshing; McInerney, Valentina – Journal of Applied Measurement, 2001
Validated the Motivation Orientation scales of the Inventory of School Motivation (ISM) (M. Maher, 1984) across Navajo (n=760) and Anglo (n=1,012) students. Findings show that even though the ISM motivation orientation scales are applicable to students of different cultural backgrounds, meaningful cross-cultural comparisons should use the 30 items…
Descriptors: American Indians, Anglo Americans, Comparative Analysis, Cross Cultural Studies
Monahan, Patrick O.; Ankenmann, Robert D. – Journal of Educational Measurement, 2005
Empirical studies demonstrated Type-I error (TIE) inflation (especially for highly discriminating easy items) of the Mantel-Haenszel chi-square test for differential item functioning (DIF), when data conformed to item response theory (IRT) models more complex than Rasch, and when IRT proficiency distributions differed only in means. However, no…
Descriptors: Sample Size, Item Response Theory, Test Items, Test Bias
Kulm, Gerald; Dager Wilson, Linda; Kitchen, Richard – Educational Assessment, 2005
Alignment has taken on increased importance given the current high-stakes nature of assessment. To make well-informed decisions about student learning on the basis of test results, assessment items need to be well aligned with standards. Project 2061 of the American Association for the Advancement of Science (AAAS) has developed a procedure for…
Descriptors: Test Results, Test Validity, Evaluation Methods, Mathematics Instruction
DiBattista, David; Mitterer, John O.; Gosse, Leanne – Teaching in Higher Education, 2004
Undergraduates completed a questionnaire after using the Immediate Feedback Assessment Technique (IFAT), a commercially available answer form for multiple-choice (MC) testing that can be used easily and conveniently with large classes. This simple new technique for MC testing provides immediate feedback for each item in an answer-until-correct…
Descriptors: Multiple Choice Tests, Testing, Feedback, Guessing (Tests)
Bridgeman, Brent; Cline, Frederick – Journal of Educational Measurement, 2004
Time limits on some computer-adaptive tests (CATs) are such that many examinees have difficulty finishing, and some examinees may be administered tests with more time-consuming items than others. Results from over 100,000 examinees suggested that about half of the examinees must guess on the final six questions of the analytical section of the…
Descriptors: Guessing (Tests), Timed Tests, Adaptive Testing, Computer Assisted Testing
Kim, Jee-Seon – Journal of Educational Measurement, 2006
Simulation and real data studies are used to investigate the value of modeling multiple-choice distractors on item response theory linking. Using the characteristic curve linking procedure for Bock's (1972) nominal response model presented by Kim and Hanson (2002), all-category linking (i.e., a linking based on all category characteristic curves…
Descriptors: Multiple Choice Tests, Test Items, Item Response Theory, Simulation
DiStefano, Christine; Motl, Robert W. – Structural Equation Modeling: A Multidisciplinary Journal, 2006
This article used multitrait-multimethod methodology and covariance modeling for an investigation of the presence and correlates of method effects associated with negatively worded items on the Rosenberg Self-Esteem (RSE) scale (Rosenberg, 1989) using a sample of 757 adults. Results showed that method effects associated with negative item phrasing…
Descriptors: Adults, Correlation, Self Esteem, Surveys

Direct link
