NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 196 to 210 of 505 results Save | Export
van der Linden, Wim J. – 1981
It has often been argued that all techniques of standard setting are arbitrary and likely to yield different results for different techniques or persons. This paper deals with a related but hitherto ignored aspect of standard setting, namely, the possibility that Angoff or Nedelsky judges misspecify the probabilities of the borderline student's…
Descriptors: Error of Measurement, Evaluators, Foreign Countries, Latent Trait Theory
Poggio, John P.; And Others – 1982
Alternative group judgment approaches to setting minimum competency standards were compared. Replication of results was possible for eight different tests (reading and mathematics, across four grade levels). The Kansas Competency Based Tests in reading and mathematics were administered statewide to students in grades two, four, six, and eight.…
Descriptors: Academic Standards, Basic Skills, Elementary Education, Minimum Competencies
Hambleton, Ronald K.; And Others – 1979
Issues involved in standard setting along with methods for standard setting are reviewed, with specific reference to their relevance for criterion referenced testing. Definitions are given of continuum and state models, and traditional and normative standard setting procedures. Since continuum models are considered more appropriate for criterion…
Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education
Peer reviewed Peer reviewed
Andrew, Barbara J.; Hecht, James T. – Educational and Psychological Measurement, 1976
Results suggest that different groups of judges do set similar examination standards when using the same procedure, and that the average of individual judgments does not differ significantly from group consensus judgments. Significant differences were found, however, between the standards set by the two procedures employed. (RC)
Descriptors: Comparative Analysis, Cutting Scores, Multiple Choice Tests, Pass Fail Grading
Peer reviewed Peer reviewed
Beuk, Cees H. – Journal of Educational Measurement, 1984
A systematic method for compromise between absolute and relative examination standards is proposed. The passing score is assumed to be related to expected pass rate through a simple linear function. Results define a function relating the percentage of successful candidates given a specified passing score to the passing score. (Author/DWH)
Descriptors: Achievement Tests, Cutting Scores, Foreign Countries, Mathematical Models
Huyhn, Huynh – 2000
Item mappings are widely used in educational assessment for applications such as test administration (through test form assembly and computer assisted testing) and for criterion-referenced (CR) interpretation of test scores or scale anchoring. Item mappings are also used to construct ordered item booklets in the CTB/McGraw Hill Bookmark standard…
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Selection, Standard Setting (Scoring)
Huynh, Huynh – 2000
By noting that a Rasch or two parameter logistic (2PL) item belongs to the exponential family of random variables and that the probability density function (pdf) of the correct response (X=1) and the incorrect response (X=0) are symmetric with respect to the vertical line at the item location, it is shown that the conjugate prior for ability is…
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Selection, Standard Setting (Scoring)
Verhelst, N. D.; Kaftandjieva, F. – 1999
A new method is proposed to set multiple standards in performance tests. The method combines three sources of information coming from three different data collections. The first is an empirical definition of mastery of an item; the second consists of parameter estimates of the items in an Item Response Theory (IRT) model, and the third source is a…
Descriptors: Cutting Scores, Data Collection, Foreign Countries, Item Response Theory
Peer reviewed Peer reviewed
Smith, Richard M.; Gross, Leon J. – Journal of Outcome Measurement, 1997
Five forms of a basic science examination administered over three years in a national board testing program were analyzed to determine the stability of judged cut scores. Results indicate that cut scores derived from the modified Nedelsky procedure were within equating error of the Rasch equated cut scores over the five administrations. (SLD)
Descriptors: Cutting Scores, Equated Scores, Licensing Examinations (Professions), Science Tests
Peer reviewed Peer reviewed
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 1996
Data from two standard-setting exercises were analyzed using the logistic regression model that assumes no variation in severity of raters, and results were compared with those obtained by logistic regression that allowed for severity variation. Results illustrate the importance of taking between-rater differences into account. (SLD)
Descriptors: Cutting Scores, Decision Making, Evaluators, Individual Differences
Peer reviewed Peer reviewed
Archbald, Doug – International Journal of Educational Reform, 1997
Clarifies key propositions of the central-curriculum-control model and presents teachers' beliefs about effects of centralized policies on practice, based on interviews with teachers from three urban districts. Curriculum-control policies contribute to content standardization and standard setting. They control what is taught, rather than how it is…
Descriptors: Centralization, Curriculum, Educational Policy, High Schools
Peer reviewed Peer reviewed
Chang, Lei – Applied Measurement in Education, 1999
Compared the Nedelsky (L. Nedelsky, 1954) and Angoff (W. Angoff, 1971) standard-setting methods in three studies involving 80 graduate students as judges. Nedelsky cutscores were significantly lower than Angoff cutscores. Suggests that combining the strong features of both methods would make a better standard-setting procedure. (SLD)
Descriptors: Comparative Analysis, Cutting Scores, Graduate Students, Graduate Study
Peer reviewed Peer reviewed
Engelhard, George, Jr.; Stone, Gregory E. – Educational and Psychological Measurement, 1998
A new approach based on Rasch measurement theory is described for examining the quality of ratings from standard-setting judges. Ratings of nine judges for 213 items on a nursing examination show that judges vary in their views of the essential items for nursing certification, with statistically significant variability in the judged essentiality…
Descriptors: Certification, Evaluation Methods, Item Response Theory, Judges
Peer reviewed Peer reviewed
Erwin, T. Dary; Wise, Steven L. – New Directions for Institutional Research, 2001
Provides techniques for identifying a particular score, or standard, that differentiates student competence from non-competence. Outlines explicit procedures for developing appropriate standards that can assist institutions in resisting legal challenges when the results of testing are contested. (EV)
Descriptors: College Students, Competence, Competency Based Education, Court Litigation
Peer reviewed Peer reviewed
Direct linkDirect link
Radwan, Nizam; Rogers, W. Todd – Alberta Journal of Educational Research, 2006
The recent increase in the use of constructed-response items in educational assessment and the dissatisfaction with the nature of the decision that the judges must make using traditional standard-setting methods created a need to develop new and effective standard-setting procedures for tests that include both multiple-choice and…
Descriptors: Criticism, Cutting Scores, Educational Assessment, Standard Setting (Scoring)
Pages: 1  |  ...  |  10  |  11  |  12  |  13  |  14  |  15  |  16  |  17  |  18  |  ...  |  34