Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 23 |
Descriptor
| Evaluation Methods | 40 |
| Multiple Choice Tests | 40 |
| Test Reliability | 30 |
| Test Validity | 24 |
| Test Construction | 15 |
| Student Evaluation | 12 |
| Foreign Countries | 7 |
| Test Items | 7 |
| Comparative Analysis | 6 |
| Higher Education | 6 |
| Performance Based Assessment | 6 |
| More ▼ | |
Source
Author
| Herman, Joan L. | 2 |
| Ahmed, Wondimu | 1 |
| Akarsu, Bayram | 1 |
| Alemi, Minoo | 1 |
| Apantee Poonputta | 1 |
| Barniol, Pablo | 1 |
| Beddow, Peter A. | 1 |
| Beile, Penny | 1 |
| Beile, Penny M. | 1 |
| Bruen, Charles | 1 |
| Burgin, John | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 9 |
| Postsecondary Education | 8 |
| Elementary Education | 5 |
| Elementary Secondary Education | 2 |
| Grade 10 | 2 |
| Grade 8 | 2 |
| High Schools | 2 |
| Junior High Schools | 2 |
| Middle Schools | 2 |
| Secondary Education | 2 |
| Grade 4 | 1 |
| More ▼ | |
Audience
| Practitioners | 1 |
Location
| Iran | 2 |
| Arizona | 1 |
| California | 1 |
| Colorado | 1 |
| Florida | 1 |
| Japan | 1 |
| Mexico | 1 |
| Pennsylvania | 1 |
| Russia | 1 |
| South Carolina | 1 |
| Thailand | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
| Dynamic Indicators of Basic… | 1 |
| Peabody Individual… | 1 |
| Social Skills Rating System | 1 |
| Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025
Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…
Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Cesur, Kursat – Educational Policy Analysis and Strategic Research, 2019
Examinees' performances are assessed using a wide variety of different techniques. Multiple-choice (MC) tests are among the most frequently used ones. Nearly, all standardized achievement tests make use of MC test items and there is a variety of ways to score these tests. The study compares number right and liberal scoring (SAC) methods. Mixed…
Descriptors: Multiple Choice Tests, Scoring, Evaluation Methods, Guessing (Tests)
Caspari-Sadeghi, Sima; Mille, Elena; Epperlein, Hella; Forster-Heinlein, Brigitte – Mathematics Teaching Research Journal, 2022
This collaborative action research highlights the need for developing students' evaluative competence and self-reflection by embedding self-and-peer assessment into online instruction. Over the course of a semester in an online master program in mathematics and computer sciences, students conducted research on assigned topics, held presentations,…
Descriptors: Graduate Students, Masters Programs, College Mathematics, Mathematics Education
Koskey, Kristin L. K.; Makki, Nidaa; Ahmed, Wondimu; Garafolo, Nicholas G.; Visco, Donald P., Jr. – School Science and Mathematics, 2020
Integrating engineering into the K-12 science curriculum continues to be a focus in national reform efforts in science education. Although there is an increasing interest in research in and practice of integrating engineering in K-12 science education, to date only a few studies have focused on the development of an assessment tool to measure…
Descriptors: Middle School Students, Engineering, Design, Science Education
Thompson, Andrew R.; O'Loughlin, Valerie D. – Anatomical Sciences Education, 2015
Bloom's taxonomy is a resource commonly used to assess the cognitive level associated with course assignments and examination questions. Although widely utilized in educational research, Bloom's taxonomy has received limited attention as an analytical tool in the anatomical sciences. Building on previous research, the Blooming Anatomy Tool (BAT)…
Descriptors: Anatomy, Classification, Scoring Rubrics, Multiple Choice Tests
Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017
The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…
Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation
Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016
The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…
Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items
Sparks, Jesse R.; Katz, Irvin R.; Beile, Penny M. – ETS Research Report Series, 2016
Digital information literacy (DIL)--generally defined as the ability to obtain, understand, evaluate, and use information in a variety of digital technology contexts--is a critically important skill deemed necessary for success in higher education as well as in the global networked economy. To determine whether college graduates possess the…
Descriptors: Technological Literacy, Information Literacy, Higher Education, Definitions
Wilcox, Bethany R.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015
Standardized conceptual assessment represents a widely used tool for educational researchers interested in student learning within the standard undergraduate physics curriculum. For example, these assessments are often used to measure student learning across educational contexts and instructional strategies. However, to support the large-scale…
Descriptors: Science Instruction, Scientific Concepts, College Science, Physics
Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C. – Computer Science Education, 2014
Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric…
Descriptors: Psychometrics, Concept Formation, Measures (Individuals), Teaching Methods
Barniol, Pablo; Zavala, Genaro – Physical Review Special Topics - Physics Education Research, 2014
In this article we discuss the findings of our research on students' understanding of vector concepts in problems without physical context. First, we develop a complete taxonomy of the most frequent errors made by university students when learning vector concepts. This study is based on the results of several test administrations of open-ended…
Descriptors: Multiple Choice Tests, Geometric Concepts, Algebra, Psychometrics
Karimi, Lotfollah; Mehrdad, Ali Gholami – Higher Education Studies, 2012
This study has attempted to investigate the administered written tests in the language department of Islamic Azad University of Hamedan, Iran from validity, practicality and reliability points of view. To this end two steps were taken. First, examining 112 tests, we knew that the face validity of 50 tests had been threatened, 9 tests lacked…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Multiple Choice Tests
Kettler, Ryan J.; Elliott, Stephen N.; Kurz, Alexander; Zigmond, Naomi; Lemons, Christopher J.; Kloo, Amanda; Shrago, Jacqueline; Beddow, Peter A.; Williams, Leila; Bruen, Charles; Lupp, Lynda; Farmer, Jeanie; Mosiman, Melanie – Assessment for Effective Intervention, 2014
Motivated by the multiple-measures clause of recent federal policy regarding student eligibility for alternate assessments based on modified academic achievement standards (AA-MASs), this study examined how scores or combinations of scores from a diverse set of assessments predicted students' end-of-year proficiency status on statewide achievement…
Descriptors: Eligibility, Alternative Assessment, Academic Achievement, Predictive Validity
Clarke-Midura, Jody; Dede, Chris – Journal of Research on Technology in Education, 2010
Despite three decades of advances in information and communications technology (ICT) and a generation of research on cognition and new pedagogical strategies, the field of assessment has not progressed much beyond paper-and-pencil item-based tests. Research has shown these instruments are not valid measures of sophisticated intellectual…
Descriptors: Technology Integration, Computer Assisted Testing, Student Evaluation, Evaluation Methods

Peer reviewed
Direct link
