Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedDodd, Barbara G. – Applied Psychological Measurement, 1990
Using one simulated and two real data sets, the effects of the systematic variation of the item-selection procedure and the stepsize method on the operating characteristics of computerized adaptive testing (CAT) for instruments with polychotomously scored rating scale items were studied. The six rating scale CAT procedures used performed well.…
Descriptors: Adaptive Testing, Attitude Measures, Comparative Analysis, Computer Assisted Testing
Peer reviewedKoch, William R.; And Others – Measurement and Evaluation in Counseling and Development, 1990
Implemented computerized adaptive testing (CAT) to measure students' attitudes toward alcohol. Administered a paper-and-pencil version and a CAT version of an attitudes toward alcohol scale to 113 undergraduates enrolled in health education classes. Findings showed a high correlation between scores from the CAT and the paper-and-pencil versions.…
Descriptors: Adaptive Testing, College Students, Computer Assisted Testing, Drinking
Peer reviewedCase, Susan M.; Swanson, David B. – Teaching and Learning in Medicine, 1993
Extended matching, a test item format used currently in medical licensing examinations, is described. Procedures for writing and reviewing such test items are outlined, test development and psychometric advantages are discussed, and issues in test administration and scoring are examined. The extended matching form is also seen as having uses for…
Descriptors: Clinical Diagnosis, Decision Making, Higher Education, Licensing Examinations (Professions)
Peer reviewedBudescu, David; Bar-Hillel, Maya – Journal of Educational Measurement, 1993
Test taking and scoring are examined from the normative and descriptive perspectives of judgment and decision theory. The number-right scoring rule is endorsed because it discourages omissions and is robust against variability in respondent motivations, item vagaries, and limitations in judgments of uncertainty. (SLD)
Descriptors: Elementary Secondary Education, Guessing (Tests), Knowledge Level, Multiple Choice Tests
Peer reviewedBridgeman, Brent; Rock, Donald A. – Journal of Educational Measurement, 1993
Exploratory and confirmatory factor analyses were used to explore relationships among existing item types and three new computer-administered item types for the analytical scale of the Graduate Record Examination General Test. Results with 349 students indicate constructs the item types are measuring. (SLD)
Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing
Peer reviewedDeMars, Christine E. – Applied Measurement in Education, 1998
Scores from mathematics (tested at 102 schools) and science (tested at 99 schools) sections of pilot forms of the Michigan High School Proficiency Test were examined for interaction between gender and response format (multiple choice or constructed response). Overall, neither males nor females seemed to be disadvantaged by item format. (SLD)
Descriptors: Constructed Response, High School Students, High Schools, Mathematics Tests
Peer reviewedFerrara, Steven; Huynh, Huynh; Michaels, Hillary – Journal of Educational Measurement, 1999
Provides hypothesized explanations for local item dependence (LID) in a large-scale hands-on science performance assessment involving approximately 55,000 students each at grades 3, 5, and 8. Items that appear to elicit locally dependent responses require examinees to answer and explain their answers or to use given or generalized information to…
Descriptors: Context Effect, Elementary Education, Hands on Science, Junior High Schools
Peer reviewedJafarpur, A. – System, 1999
Examines whether a defect of the C-test can be avoided by constructing a C-test with five texts and 126 items. The test was tried with 146 Iranian English majors. On the basis of item analysis, a tailored C-test with 100 items was developed and tried with 60 other subjects. Results show no gains were made with the classical item analysis.…
Descriptors: College Students, English (Second Language), Higher Education, Item Analysis
Peer reviewedHolahan, John M.; Saunders, T. Clark – Bulletin of the Council for Research in Music Education, 1997
Investigates two problems: (1) do learning effects accrue in accuracy or response time when computerized tests are administered in two sessions? and (2) what are the effects of tonal pattern order and contour types on average item difficulty and length of response time for children with different levels of achievement? (DSK)
Descriptors: Auditory Perception, Children, Cognitive Processes, Computer Assisted Testing
Peer reviewedImpara, James C.; Plake, Barbara S. – Journal of Educational Measurement, 1998
Sixth-grade teachers (n=26) estimated item performance for their students (724 total students) on a 50-item district-wide science test. Teachers were more accurate in estimating performance of the total group than of the borderline group, but in neither case was their accuracy high. Estimating proportion-correct values using the Angoff standard…
Descriptors: Difficulty Level, Elementary School Teachers, Grade 6, Intermediate Grades
Peer reviewedSchwarz, Richard D. – Applied Measurement in Education, 1998
Referral, placement, and retention decisions were analyzed using item response theory (IRT) to study whether classification decisions could be placed on the latent continuum of ability normally associated with test items and to study the existence of classification differential item functioning. Results with 352 kindergarten children demonstrate…
Descriptors: Ability, Classification, Decision Making, Grade Repetition
Peer reviewedBonbright, Jane M.; McGreevy-Nichols, Susan – Arts Education Policy Review, 1999
Reports on the data gleaned from the survey on dance education administered simultaneously with the 1997 National Assessment of Educational Progress (NAEP) arts assessments. Presents the process and problems of developing and implementing assessments in dance. Considers the value of the assessments to dance and offers recommendations for the…
Descriptors: Advocacy, Art Education, Dance Education, Educational Testing
Peer reviewedFan, Xitao; And Others – Educational and Psychological Measurement, 1996
Applying 2 different models of test construction to test results for a pool of more than 190,000 high school students found no systematic bias against groups with smaller or no representation in the test standardization sample. These results support the integrity of widely used sampling and item selection procedures. (SLD)
Descriptors: Culture Fair Tests, Ethnic Groups, High School Students, High Schools
Peer reviewedSnetzler, Suzi; Qualls, Audrey L. – Educational and Psychological Measurement, 2000
Examined the incidence of differential item functioning (DIF) on 3 subtests of the Iowa Tests of Basic Skills using test scores for 2,867 Alaskan students, characterized as "Native" or White at fourth and sixth grades or sixth and eighth grades. Effect size differences favoring whites were larger when students of equal English…
Descriptors: Achievement Tests, Alaska Natives, Item Bias, Limited English Speaking
Peer reviewedGarden, Robert A. – Studies in Educational Evaluation, 1999
Describes the development of the performance assessment tasks of the Third International Mathematics and Science Study. The challenge was to produce tasks that would measure the achievement of curricular objectives while being sufficiently reliable to allow comparisons between countries and of groups within countries. (SLD)
Descriptors: Comparative Analysis, Elementary Secondary Education, Foreign Countries, International Education


