ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	18

Source

Educational and Psychological…

Publication Type

Journal Articles	35
Reports - Research	24
Reports - Evaluative	11
Speeches/Meeting Papers	3
Reports - Descriptive	2
Opinion Papers	1

Education Level

Elementary Secondary Education	3
Elementary Education	2
Higher Education	2
Secondary Education	2
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Location

Australia	1
Canada	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Advanced Placement…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

The Importance of Thinking Multivariately When Setting Subscale Cutoff Scores

Peer reviewed

Direct link

Kroc, Edward; Olvera Astivia, Oscar L. – Educational and Psychological Measurement, 2022

Setting cutoff scores is one of the most common practices when using scales to aid in classification purposes. This process is usually done univariately where each optimal cutoff value is decided sequentially, subscale by subscale. While it is widely known that this process necessarily reduces the probability of "passing" such a test,…

Descriptors: Multivariate Analysis, Cutting Scores, Classification, Measurement

The Response Vector for Mastery Method of Standard Setting

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2022

Proposed is a new method of standard setting referred to as response vector for mastery (RVM) method. Under the RVM method, the task of panelists that participate in the standard setting process does not involve conceptualization of a borderline examinee and probability judgments as it is the case with the Angoff and bookmark methods. Also, the…

Descriptors: Standard Setting (Scoring), Cutting Scores, Computation, Mastery Learning

The Poor Fit of Model Fit for Selecting Number of Factors in Exploratory Factor Analysis for Scale Evaluation

Peer reviewed

Direct link

Montoya, Amanda K.; Edwards, Michael C. – Educational and Psychological Measurement, 2021

Model fit indices are being increasingly recommended and used to select the number of factors in an exploratory factor analysis. Growing evidence suggests that the recommended cutoff values for common model fit indices are not appropriate for use in an exploratory factor analysis context. A particularly prominent problem in scale evaluation is the…

Descriptors: Goodness of Fit, Factor Analysis, Cutting Scores, Correlation

Examining Rounding Rules in Angoff-Type Standard-Setting Methods

Peer reviewed

Direct link

Wyse, Adam E.; Reckase, Mark D. – Educational and Psychological Measurement, 2012

This study investigates how different rounding rules and ways of providing Angoff standard-setting judgments affect cut-scores. A simulation design based on data from the National Assessment of Education Progress was used to investigate how rounding judgments to the nearest whole number (e.g., 0, 1, 2, etc.), nearest 0.05, or nearest two decimal…

Descriptors: Standard Setting, Cutting Scores, Statistical Bias, Numbers

A Body of Work Standard-Setting Method with Construct Maps

Peer reviewed

Direct link

Wyse, Adam E.; Bunch, Michael B.; Deville, Craig; Viger, Steven G. – Educational and Psychological Measurement, 2014

This article describes a novel variation of the Body of Work method that uses construct maps to overcome problems of transparency, rater inconsistency, and scores gaps commonly occurring with the Body of Work method. The Body of Work method with construct maps was implemented to set cut-scores for two separate K-12 assessment programs in a large…

Descriptors: Standard Setting (Scoring), Educational Assessment, Elementary Secondary Education, Measurement

Using the Many-Faceted Rasch Model to Evaluate Standard Setting Judgments: An Illustration with the Advanced Placement Environmental Science Exam

Peer reviewed

Direct link

Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A. – Educational and Psychological Measurement, 2013

The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…

Descriptors: Item Response Theory, Models, Standard Setting (Scoring), Science Tests

Polytomous Adaptive Classification Testing: Effects of Item Pool Size, Test Termination Criterion, and Number of Cutscores

Peer reviewed

Direct link

Gnambs, Timo; Batinic, Bernad – Educational and Psychological Measurement, 2011

Computer-adaptive classification tests focus on classifying respondents in different proficiency groups (e.g., for pass/fail decisions). To date, adaptive classification testing has been dominated by research on dichotomous response formats and classifications in two groups. This article extends this line of research to polytomous classification…

Descriptors: Test Length, Computer Assisted Testing, Classification, Test Items

Reducing the Cognitive Complexity Associated with Standard Setting: A Comparison of the Single-Passage Bookmark and Yes/No Methods

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F. – Educational and Psychological Measurement, 2011

Judgmental standard setting methods have been criticized for the cognitive complexity of the judgment task that panelists are asked to complete. This study compared two methods designed to reduce this complexity: the yes/no method and the single-passage bookmark method. Two mock standard setting panel meetings were convened, one for each method,…

Descriptors: Standard Setting (Scoring), Methods, Cutting Scores, Experienced Teachers

Computerized Classification Testing under the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011

The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…

Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory

Computerized Classification Testing under the One-Parameter Logistic Response Model with Ability-Based Guessing

Peer reviewed

Direct link

Wang, Wen-Chung; Huang, Sheng-Yun – Educational and Psychological Measurement, 2011

The one-parameter logistic model with ability-based guessing (1PL-AG) has been recently developed to account for effect of ability on guessing behavior in multiple-choice items. In this study, the authors developed algorithms for computerized classification testing under the 1PL-AG and conducted a series of simulations to evaluate their…

Descriptors: Computer Assisted Testing, Classification, Item Analysis, Probability

Peer reviewed

Direct link

Wyse, Adam E. – Educational and Psychological Measurement, 2011

Standard setting is a method used to set cut scores on large-scale assessments. One of the most popular standard setting methods is the Bookmark method. In the Bookmark method, panelists are asked to envision a response probability (RP) criterion and move through a booklet of ordered items based on a RP criterion. This study investigates whether…

Descriptors: Testing Programs, Standard Setting (Scoring), Cutting Scores, Probability

Comparing Construct Definition in the Angoff and Objective Standard Setting Models: Playing in a House of Cards without a Full Deck

Peer reviewed

Direct link

Stone, Gregory Ethan; Koskey, Kristin L. K.; Sondergeld, Toni A. – Educational and Psychological Measurement, 2011

Typical validation studies on standard setting models, most notably the Angoff and modified Angoff models, have ignored construct development, a critical aspect associated with all conceptualizations of measurement processes. Stone compared the Angoff and objective standard setting (OSS) models and found that Angoff failed to define a legitimate…

Descriptors: Cutting Scores, Standard Setting (Scoring), Models, Construct Validity

Item Selection in Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Educational and Psychological Measurement, 2009

Several alternatives for item selection algorithms based on item response theory in computerized classification testing (CCT) have been suggested, with no conclusive evidence on the substantial superiority of a single method. It is argued that the lack of sizable effect is because some of the methods actually assess items very similarly through…

Descriptors: Item Response Theory, Psychoeducational Methods, Cutting Scores, Simulation

Estimating Trends from Censored Assessment Data under No Child Left Behind

Peer reviewed

Direct link

Furgol, Katherine E.; Ho, Andrew D.; Zimmerman, Dale L. – Educational and Psychological Measurement, 2010

Under the No Child Left Behind Act, large-scale test score trend analyses are widespread. These analyses often gloss over interesting changes in test score distributions and involve unrealistic assumptions. Further complications arise from analyses of unanchored, censored assessment data, or proportions of students lying within performance levels…

Descriptors: Trend Analysis, Sample Size, Federal Legislation, Simulation

Estimating Standard Errors of Cut Scores for Item Rating and Mapmark Procedures: A Generalizability Theory Approach

Peer reviewed

Direct link

Yin, Ping; Sconing, James – Educational and Psychological Measurement, 2008

Standard-setting methods are widely used to determine cut scores on a test that examinees must meet for a certain performance standard. Because standard setting is a measurement procedure, it is important to evaluate variability of cut scores resulting from the standard-setting process. Generalizability theory is used in this study to estimate…

Descriptors: Generalizability Theory, Standard Setting, Cutting Scores, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Wyse, Adam E.	3
Behuniak, Peter, Jr.	2
Hurtz, Gregory M.	2
Plake, Barbara S.	2
Wang, Wen-Chung	2
Andrew, Barbara J.	1
Andrulis, Richard S.	1
Auerbach, Meredith A.	1
Batinic, Bernad	1
Brennan, Robert L.	1
Bunch, Michael B.	1
Clarke, David M.	1
Cooper-Hakim, Amy	1
Deville, Craig	1
Dimitrov, Dimiter M.	1
Edwards, Michael C.	1
Engelhard, George, Jr.	1
Fehrmann, Melinda L.	1
Ferdous, Abdullah A.	1
Furgol, Katherine E.	1
Gnambs, Timo	1
Halpin, Gerald	1
Hambleton, Ronald K.	1
Harasym, P. H.	1
More ▼

Cutting Scores	42
Standard Setting (Scoring)	15
Comparative Analysis	8
Classification	7
Criterion Referenced Tests	7
Item Response Theory	7
Test Items	7
Error of Measurement	6
Computer Assisted Testing	5
Interrater Reliability	5
Multiple Choice Tests	5
Probability	5
Simulation	5
Test Reliability	5
Generalizability Theory	4
Item Analysis	4
Monte Carlo Methods	4
Reading Tests	4
Scores	4
Scoring	4
Scoring Formulas	4
Standard Setting	4
Test Construction	4
Achievement Tests	3
Computer Programs	3
More ▼