NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)4
Since 2006 (last 20 years)42
Audience
Researchers3
Laws, Policies, & Programs
No Child Left Behind Act 20012
What Works Clearinghouse Rating
Showing 1 to 15 of 79 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Seo, Hyojeong; Shogren, Karrie A.; Wehmeyer, Michael L.; Hughes, Carolyn; Thompson, James R.; Little, Todd D.; Palmer, Susan B. – Career Development and Transition for Exceptional Individuals, 2016
This study examined similarities and differences in measurement properties and score comparability of the "Supports Intensity Scale-Adult Version" (16-64 years) and the "Supports Intensity Scale-Children's Version" (5-16 years). Data were collected from 142 adolescents with intellectual disability with both versions of the…
Descriptors: Adolescents, Intellectual Disability, Special Needs Students, Transitional Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016
Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis
Seo, Hyojeong; Shogren, Karrie A.; Wehmeyer, Michael L.; Hughes, Carolyn; Thompson, James R.; Little, Todd D.; Palmer, Susan B. – Grantee Submission, 2016
This study examined similarities and differences in measurement properties and score comparability of the "Supports Intensity Scale-Adult Version" (16-64 years) and the "Supports Intensity Scale-Children's Version" (5-16 years). Data were collected from 142 adolescents with intellectual disability with both versions of the…
Descriptors: Adolescents, Intellectual Disability, Special Needs Students, Transitional Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Kaderavek, Joan N.; Guo, Ying; Justice, Laura M. – Journal of Research in Reading, 2014
The present study investigates the validity of a 4-point rating scale used to measure the level of preschool children's orientation to literacy during shared book reading. Validity was explored by (a) comparing the children's level of literacy orientation as measured with the "Children's Orientation to Book Reading Rating Scale" (COB)…
Descriptors: Reading Habits, Reading Interests, Rating Scales, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Glass, Arnold Lewis; Sinha, Neha – Educational Psychology, 2013
In the context of an upper-level psychology course, even when students were given an opportunity to refer to text containing the answers and change their exam responses in order to improve their exam scores, their performance on these questions improved slightly or not at all. Four experiments evaluated competing explanations for the students'…
Descriptors: Academic Achievement, Item Analysis, Test Norms, Comparative Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Laprise, Shari L. – College Teaching, 2012
Successful exam composition can be a difficult task. Exams should not only assess student comprehension, but be learning tools in and of themselves. In a biotechnology course delivered to nonmajors at a business college, objective multiple-choice test questions often require students to choose the exception or "not true" choice. Anecdotal student…
Descriptors: Feedback (Response), Test Items, Multiple Choice Tests, Biotechnology
Peer reviewed Peer reviewed
Direct linkDirect link
Taherbhai, Husein; Seo, Daeryong; Bowman, Trinell – British Educational Research Journal, 2012
Literature in the United States provides many examples of no difference in student achievement when measured against the mode of test administration i.e., paper-pencil and online versions of the test. However, most of these researches centre on "regular" students who do not require differential teaching methods or different evaluation…
Descriptors: Learning Disabilities, Statistical Analysis, Teaching Methods, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012
Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…
Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4
Peer reviewed Peer reviewed
Direct linkDirect link
Stone, Gregory Ethan; Koskey, Kristin L. K.; Sondergeld, Toni A. – Educational and Psychological Measurement, 2011
Typical validation studies on standard setting models, most notably the Angoff and modified Angoff models, have ignored construct development, a critical aspect associated with all conceptualizations of measurement processes. Stone compared the Angoff and objective standard setting (OSS) models and found that Angoff failed to define a legitimate…
Descriptors: Cutting Scores, Standard Setting (Scoring), Models, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Hatcher, Donald L. – New Directions for Institutional Research, 2011
In this article, after describing one approach for teaching critical thinking (CT) that was in place at Baker University from 1990 to 2008, the author describes their experience assessing CT using three standardized exams and shows why the choice of a standardized CT test can be problematic and the results misleading. These results can be…
Descriptors: Test Results, Essay Tests, Critical Thinking, Thinking Skills
Zhang, Bin – ProQuest LLC, 2012
Social scientists usually are more interested in consumers' dichotomous choice, such as purchase a product or not, adopt a technology or not, etc. However, up to date, there is nearly no model can help us solve the problem of multi-network effects comparison with a dichotomous dependent variable. Furthermore, the study of multi-network…
Descriptors: Social Networks, Network Analysis, Comparative Analysis, Population Groups
Anderson, Stephen A. – Online Submission, 2010
This paper summarizes an action research project to develop a math screening instrument that would be effective (valid and reliable) and efficient (time for administration). An instrument was developed after review of the mathematics assessment and mathematics disabilities literature. The instrument was administered to kindergarten, first, and…
Descriptors: Action Research, Achievement Tests, Kindergarten, Grade 2
Peer reviewed Peer reviewed
Direct linkDirect link
Ricketts, Chris; Brice, Julie; Coombes, Lee – Advances in Health Sciences Education, 2010
The purpose of multiple choice tests of medical knowledge is to estimate as accurately as possible a candidate's level of knowledge. However, concern is sometimes expressed that multiple choice tests may also discriminate in undesirable and irrelevant ways, such as between minority ethnic groups or by sex of candidates. There is little literature…
Descriptors: Medical Students, Testing Accommodations, Ethnic Groups, Learning Disabilities
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010
In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…
Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6