Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedHsu, Tse-chi; Yu, Lifa – Educational Measurement: Issues and Practice, 1989
How computers are used to analyze item data is reviewed, and the information that existing item-analysis programs provide is described. Summaries of studies comparing the performance of some of these packages reveal some of their current limitations. Emphasis is on the usefulness to educational practice of these packages. (SLD)
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Computer Uses in Education
Peer reviewedSmith, Richard M.; And Others – Journal of Dental Education, 1989
A study of gender bias in the Dental Admission Test's mathematics test and its validity in predicting dental school success found no significant difference between male and female performance and no significant difference in the predictive validity of items favoring males or females. (Author/MSE)
Descriptors: College Entrance Examinations, Dental Schools, Higher Education, Logical Thinking
Peer reviewedGohmann, Stephan F.; Spector, Lee C. – Journal of Economic Education, 1989
Compares the effect of content ordering and scrambled ordering on examinations in courses, such as economics, that require quantitative skills. Empirical results suggest that students do no better if they are given a content-ordered rather than a scrambled examination as student performance is not adversely affected by scrambled ordered…
Descriptors: Cheating, Economics Education, Educational Research, Grading
Peer reviewedWay, Walter D.; And Others – Applied Measurement in Education, 1989
The effects of using item response theory (IRT) ability estimates based on customized tests formed by selecting areas from a nationally standardized achievement test were examined. For some populations, in some conditions, IRT ability estimates can be equivalent to scores based on full-length tests. (SLD)
Descriptors: Achievement Tests, Adaptive Testing, Content Validity, Elementary Education
Peer reviewedMcKinley, Robert L. – Journal of Educational Measurement, 1988
Six procedures for combining sets of item response theory (IRT) item parameter estimates from different samples were evaluated using real and simulated response data. Results support use of covariance matrix-weighted averaging and a procedure using sample-size-weighted averaging of estimated item characteristic curves at the center of the ability…
Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Estimation (Mathematics)
Peer reviewedReynolds, Trudy; And Others – Language Testing, 1994
Presents a study conducted to provide a comparative analysis of five item analysis indices using both IRT and non-IRT indices to describe the characteristics of flagged items and to investigate the appropriateness of logistic regression as an item analysis technique for further studies. The performance of five item analysis indices was examined.…
Descriptors: College Students, Comparative Analysis, English (Second Language), Item Analysis
Peer reviewedBordage, Georges; And Others – Academic Medicine, 1995
Three related Canadian studies assessed the content validity of 59 clinical problems designed as part of a test of medical decision-making skills. Focus was on the key features, i.e., the critical or essential steps in identification and management of the clinical problem. Results support content validity of the key features. (MSE)
Descriptors: Clinical Teaching (Health Professions), Content Validity, Decision Making, Foreign Countries
Peer reviewedKorashy, Abdel-Fattah El- – Educational and Psychological Measurement, 1995
The Rasch model was applied to selection of items for an Arabic version of the Otis-Lennon Mental Ability Test using a sample of 599 male and female Kuwaiti secondary school and university students. Results indicated that the test is suitable for the range of ability intended to be measured. (SLD)
Descriptors: Arabic, Cognitive Ability, College Students, Foreign Countries
Peer reviewedBlack, Paul – Studies in Educational Evaluation, 1995
The role of assessment in science education is explored, focusing on summative assessment in British public certificate examinations. Examples of test items are presented to illustrate difficulties in making valid and reliable assessments, and issues with implications for formative assessment are discussed. (SLD)
Descriptors: Educational Assessment, Feedback, Foreign Countries, Formative Evaluation
Peer reviewedDornyei, Zoltan; Katona, Lucy – Language Testing, 1992
A total of 102 university English majors were administered 4 different language tests to form a General Language Proficiency measure against which the C-test was evaluated. Results confirmed its reliability and validity and also provided data on text difficulty/appropriateness, word structure, content, and different scoring methods. (13…
Descriptors: College Students, English (Second Language), Higher Education, Language Proficiency
Peer reviewedTalmir, Pinchas – Biochemical Education, 1991
Describes how multiple-choice items can be designed and used as an effective diagnostic tool by avoiding their pitfalls and by taking advantage of their potential benefits. The following issues are discussed: correct' versus best answers; construction of diagnostic multiple-choice items; the problem of guessing; the use of justifications of…
Descriptors: Biochemistry, Educational Research, Evaluation, Higher Education
Peer reviewedRomberg, Thomas A.; Wilson, Linda D. – Arithmetic Teacher, 1992
Examined 6 widely used grade-8 standardized tests for content, required processes, and level to determine their alignment with the 5-8 NCTM "Curriculum and Evaluation Standards." Concluded that these tests inadequately covered the 5-8 standards. A follow-up study examined items from newly developed and foreign tests to demonstrate the…
Descriptors: Achievement Tests, Educational Change, Elementary Education, Mathematics Achievement
Peer reviewedKim, Seock-Ho; Cohen, Allan S. – Applied Psychological Measurement, 1991
The exact and closed-interval area measures for detecting differential item functioning are compared for actual data from 1,000 African-American and 1,000 white college students taking a vocabulary test with items intentionally constructed to favor 1 set of examinees. No real differences in detection of biased items were found. (SLD)
Descriptors: Black Students, College Students, Comparative Testing, Equations (Mathematics)
Peer reviewedHerron, Carol – Modern Language Journal, 1994
Using 38 beginning-level university students of French, this study confirmed that student listening comprehension of a foreign language video would be facilitated by the use of an advance organizer consisting of several short sentences, written in French, that summarized chronologically the events in the video. Sample test items and answers are…
Descriptors: Advance Organizers, College Students, French, Higher Education
Peer reviewedCohen, Allan S.; Kim, Seock-Ho – Applied Measurement in Education, 1992
Studied effects of students' use of calculators with 2 experimental forms of a university mathematics test taken by 765 and 725 college students, respectively. Calculator effects are not found for overall scores but are seen for some individual items. Analysis at the item level makes the actual impact apparent. (SLD)
Descriptors: Calculators, College Students, Educational Technology, Equations (Mathematics)


