Descriptor
Source
Applied Measurement in… | 10 |
Author
Klein, Stephen P. | 2 |
Bell, Robert M. | 1 |
Chang, Lucy | 1 |
Clauser, Brian E. | 1 |
Clyman, Stephen G. | 1 |
Comfort, Kathy | 1 |
Crocker, Linda | 1 |
Dutka, Sue | 1 |
El-Bayoumi, Gigi | 1 |
Fuchs, Douglas | 1 |
Fuchs, Lynn S. | 1 |
More ▼ |
Publication Type
Journal Articles | 10 |
Reports - Evaluative | 7 |
Reports - Research | 3 |
Information Analyses | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Vermont | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Wolfe, Edward W.; Gitomer, Drew H. – Applied Measurement in Education, 2001
Attempted to improve the measurement quality of a complex performance assessment through principled assessment design using the example of the National Board for Professional Teaching Standards Early Childhood/Generalist examination. All indexes examined improved after revisions were made. Results show the importance of attention to assessment…
Descriptors: Change, Performance Based Assessment, Psychometrics, Scores

Miller, M. David; Crocker, Linda – Applied Measurement in Education, 1990
This review of methods for validating writing assessments was conceptualized within a framework suggested by S. Messick (1989) that included five operational components of construct validation: (1) content representativeness; (2) structural fidelity; (3) nomological validity; (4) criterion-related validity; and (5) nomothetic span. (SLD)
Descriptors: Construct Validity, Content Validity, Elementary Secondary Education, Performance Based Assessment

Fuchs, Lynn S.; Fuchs, Douglas; Karns, Kathy; Hamlett, Carol L.; Dutka, Sue; Katzaroff, Michelle – Applied Measurement in Education, 2000
Examined the effects of providing students with background information about the structure and scoring of mathematics performance assessments (PA). Results for 187 elementary school students who had PA orientation and 182 who did not show the effects of test wiseness training for average and above-average students, but not for below-average…
Descriptors: Background, Elementary Education, Elementary School Students, Mathematics

Hardy, Roy A. – Applied Measurement in Education, 1995
Cost factors associated with the development, administration, and scoring of performance assessment tasks are examined in the context of a statewide or other large-scale assessment program. Resources of money, time, and expertise are discussed. (SLD)
Descriptors: Cost Estimates, Costs, Educational Assessment, Estimation (Mathematics)

Welch, Catherine; Hoover, H. D. – Applied Measurement in Education, 1993
Methodology is suggested for several statistical procedures to detect polytomously scored items that function differently for two subgroups of examinees. The 3 methods are alternative ways of combining the data from 2 x "k" tables. Simulation results demonstrate the superiority of two of the methods, designated HW1 and HW3. (SLD)
Descriptors: Computer Simulation, Effect Size, Equations (Mathematics), Estimation (Mathematics)

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995
Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

Klein, Stephen P.; Stecher, Brian M.; Shavelson, Richard J.; McCaffrey, Daniel; Ormseth, Tor; Bell, Robert M.; Comfort, Kathy; Othman, Abdul R. – Applied Measurement in Education, 1998
Two studies involving 368 elementary and high school students and 29 readers were conducted to investigate reader consistency, score reliability, and reader time requirements of three hands-on science performance tasks. Holistic scores were as reliable as analytic scores, and there was a high correlation between them after they were disattenuated…
Descriptors: Elementary School Students, Elementary Secondary Education, Hands on Science, High School Students

Clauser, Brian E.; Ross, Linette P.; Clyman, Stephen G.; Rose, Kathie M.; Margolis, Melissa J.; Nungester, Ronald J.; Piemme, Thomas E.; Chang, Lucy; El-Bayoumi, Gigi; Malakoff, Gary L.; Pincetl, Pierre S. – Applied Measurement in Education, 1997
Describes an automated scoring algorithm for a computer-based simulation examination of physicians' patient-management skills. Results with 280 medical students show that scores produced using this algorithm are highly correlated to actual clinician ratings. Scores were also effective in discriminating between case performance judged passing or…
Descriptors: Algorithms, Computer Assisted Testing, Computer Simulation, Evaluators

Hambleton, Ronald K.; Slater, Sharon C. – Applied Measurement in Education, 1997
A brief history of developments in the assessment of the reliability of credentialing examinations is presented, and some new results are outlined that highlight the interactions among scoring, standard setting, and the reliability and validity of pass-fail decisions. Decision consistency is an important concept in evaluating credentialing…
Descriptors: Certification, Credentials, Decision Making, Interaction

Goldberg, Gail Lynn; Kapinus, Barbara – Applied Measurement in Education, 1993
Using responses of 123 elementary school teachers, a battery of performance-assessment tasks designed to generate responses to reading tests was evaluated from task development and scoring perspectives. More than one dozen types of errors were identified. Practical outcomes of the study and improvement of task development and scoring are…
Descriptors: Educational Assessment, Educational Practices, Elementary Education, Elementary School Teachers