Publication Date
| In 2026 | 2 |
| Since 2025 | 188 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2889 |
| Since 2007 (last 20 years) | 6174 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Slater, Sharon C.; Schaeffer, Gary A. – 1996
The General Computer Adaptive Test (CAT) of the Graduate Record Examinations (GRE) includes three operational sections that are separately timed and scored. A "no score" is reported if the examinee answers fewer than 80% of the items or if the examinee does not answer all of the items and leaves the section before time expires. The 80%…
Descriptors: Adaptive Testing, College Students, Computer Assisted Testing, Equal Education
Green, Bert F. – 1995
Setting performance standards is an area that different constituencies see quite differently. The choices of elements for a particular standard depend to a large extent on the purposes the standard is intended to serve. Standards can be used in certification, as predictors, as descriptors, and as motivators. While performance standards indicate…
Descriptors: Certification, Course Content, Cutting Scores, Elementary Secondary Education
Crooks, Terry – 1996
A recently developed model of validation (T. J. Crooks, M. T. Kane, and A. S. Cohen, 1996) is briefly outlined. It conceptualizes assessment as divided into a chain of eight linked stages: (1) administration; (2) scoring; (3) aggregation; (4) generalization; (5) extrapolation; (6) evaluation; (7) decision; and (8) impact. The model is then used to…
Descriptors: Decision Making, Educational Assessment, Foreign Countries, Models
Kirsch, Irwin S.; Mosenthal, Peter B. – 1988
Critical variables that underlie the performance of a national sample of young adults on a diverse set of document literacy tasks were identified. The final sample was 3,618 adults. The identification of these variables provides an important first step toward building a theoretical model that would systematically account for the constructs of…
Descriptors: Blacks, Cognitive Processes, Educational Attainment, Ethnic Groups
Gao, Xiaohong – 1996
The use of the Work Keys Listening and Writing Assessment, part of an assessment system of the generic employability skills of individuals, needs to be accompanied by systematic evaluation of its technical qualities. This study examined sampling variability and generalizability of Listening and Writing scores when multiple forms, raters, and…
Descriptors: Adults, Difficulty Level, Generalization, Job Skills
Northwest Regional Educational Lab., Portland, OR. Test Center. – 1996
This bibliography includes papers about grading and reporting, sample report card formats, training materials on grading and reporting, and first-person narratives from educators who have tried to reform the ways they grade students. The first section of this annotated bibliography is a listing of the articles in alphabetical order by primary…
Descriptors: Annotated Bibliographies, Educational Assessment, Elementary Secondary Education, Grades (Scholastic)
Deville, Craig W.; Chalhoub-Deville, Micheline – 1993
A study demonstrated the utility of item analyses to investigate which items function well or poorly in a second language reading recall protocol instrument. Data were drawn from a larger study of 56 learners of German as a second language at various proficiency levels. Pausal units of scored recall protocols were analyzed using both classical…
Descriptors: German, Item Analysis, Reading Comprehension, Reading Tests
De Champlain, Andre F.; Margolis, Melissa J.; Ross, Linette P.; Macmillan, Mary K.; Klass, Daniel J. – 1998
The purpose of the present investigation was to address several critical issues relating to setting a performance standard on a nationally administered standardized patient examination (SPX). The specific goals of the study were to: (1) compare pass/fail rates from this exercise to those of past studies undertaken with the same examination; (2)…
Descriptors: Clinical Experience, Higher Education, Interrater Reliability, Medical Education
Bay, Luz; Loomis, Susan Cooper; Wang, Tianyou – 1995
This study examines the effects on the National Assessment of Educational Progress (NAEP) achievement levels of using item response theory (IRT) models that have nominal missing-response parameters. It compares cutpoints based on item parameters that were fitted using two different models. The first set of cutpoints were based on parameters for…
Descriptors: Academic Achievement, Comparative Analysis, Cutting Scores, High School Seniors
Bay, Luz; Nering, Michael L. – 1998
The use of person-fit methods to determine the extent to which a panelist's ratings fit the item response theory (IRT) models used in the National Assessment of Educational Progress (NAEP) is demonstrated. Person-fit methods are statistical methods that allow the identification of nonfitting response vectors. To determine whether panelists'…
Descriptors: Academic Achievement, Geography, Goodness of Fit, High School Seniors
Glas, Cees A. W.; Beguin, Anton A. – 1996
Recently, L. Zeng and M. J. Kolen (1995) have introduced item response theory (IRT) observed score (OS) equating of number-correct (NC) scores for equating different forms of a test. In this paper, IRT-OS-NC equating is adapted to equating the cut-off scores of examinations. Next, the differences between results obtained using a Rasch model for…
Descriptors: Achievement Tests, Cutting Scores, Equated Scores, Foreign Countries
Riley, Stanley – 1992
The Riley Inventory of Basic Learning Skills (RIBLS) is a group test that assesses 12 learning process skills in 5 general modalities (visual, auditory, verbal, kinesthetic, and abstract). The RIBLS is intended for administration to groups or individuals from age 6 to adulthood in two levels: a lower level for those who cannot read or are under…
Descriptors: Adolescents, Adults, Basic Skills, Children
Zin, Than Than; Williams, John – 1991
Brief explanations are presented of some of the different methods used to score multiple-choice tests; and some studies of partial information, guessing strategies, and test-taking behaviors are reviewed. Studies are grouped in three categories of effort to improve scoring: (1) those that require extra effort from the examinee to answer…
Descriptors: Educational Research, Estimation (Mathematics), Guessing (Tests), Literature Reviews
Ankenmann, Robert D.; Stone, Clement A. – 1992
Effects of test length, sample size, and assumed ability distribution were investigated in a multiple replication Monte Carlo study under the 1-parameter (1P) and 2-parameter (2P) logistic graded model with five score levels. Accuracy and variability of item parameter and ability estimates were examined. Monte Carlo methods were used to evaluate…
Descriptors: Computer Simulation, Estimation (Mathematics), Item Bias, Mathematical Models
Goodison, Jules M. – 1991
Procedures used in securing the cooperation of states and schools for the 1990 Trial State Assessment Program (TSAP) of the National Assessment of Educational Progress (NAEP) are described. These include procedures for: (1) the 1989 Field Test; (2) the training of local administrators and quality control monitors; and (3) scoring and processing…
Descriptors: Academic Achievement, Data Collection, Grade 8, Junior High Schools


