Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 28 |
Descriptor
Evaluation Methods | 57 |
Multiple Choice Tests | 57 |
Test Validity | 42 |
Test Reliability | 24 |
Student Evaluation | 22 |
Test Construction | 19 |
Test Items | 13 |
Foreign Countries | 12 |
Computer Assisted Testing | 10 |
Validity | 9 |
Correlation | 8 |
More ▼ |
Source
Author
Ahmed, Amer | 1 |
Ahmed, Wondimu | 1 |
Akarsu, Bayram | 1 |
Alemi, Minoo | 1 |
Alexander, Cara J. | 1 |
Amy K. Clark | 1 |
Apantee Poonputta | 1 |
Avsec, Stanislav | 1 |
Ball, Deborah Loewenberg | 1 |
Barniol, Pablo | 1 |
Beddow, Peter A. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 12 |
Postsecondary Education | 9 |
Elementary Education | 7 |
High Schools | 5 |
Elementary Secondary Education | 4 |
Middle Schools | 4 |
Secondary Education | 4 |
Grade 10 | 3 |
Grade 8 | 3 |
Junior High Schools | 2 |
Grade 11 | 1 |
More ▼ |
Audience
Practitioners | 2 |
Teachers | 1 |
Location
Arizona | 2 |
California | 2 |
Iran | 2 |
Virginia | 2 |
Australia | 1 |
Canada | 1 |
Colorado | 1 |
Florida | 1 |
Japan | 1 |
Kentucky | 1 |
Massachusetts | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
ACT Assessment | 1 |
Dynamic Indicators of Basic… | 1 |
Peabody Individual… | 1 |
Sequential Tests of… | 1 |
Social Skills Rating System | 1 |
Trends in International… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025
Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…
Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries
Kelsey Nason; Christine E. DeMars – Research & Practice in Assessment, 2023
Universities administer assessments for accountability and program improvement. Student effort is low during assessments due to minimal perceived consequences. The effects of low effort are compounded by assessment context. This project investigates validity concerns caused by minimal effort and exacerbated by contextual factors. Systematic…
Descriptors: Test Validity, COVID-19, Pandemics, Environmental Influences
Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024
This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…
Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation
Thomas Bickerton, Robert; Sangwin, Chris J. – International Journal of Mathematical Education in Science and Technology, 2022
We discuss a practical method for assessing mathematical proof online. We examine the use of faded worked examples and reading comprehension questions to understand proof. By breaking down a given proof, we formulate a checklist that can be used to generate comprehension questions which can be assessed automatically online. We then provide some…
Descriptors: Mathematics Instruction, Validity, Mathematical Logic, Evaluation Methods
Avsec, Stanislav; Jamšek, Janez – International Journal of Technology and Design Education, 2016
Technological literacy is identified as a vital achievement of technology- and engineering-intensive education. It guides the design of technology and technical components of educational systems and defines competitive employment in technological society. Existing methods for measuring technological literacy are incomplete or complicated,…
Descriptors: Technological Literacy, Elementary School Students, Secondary School Students, Evaluation Methods
Smith, Mark; Breakstone, Joel; Wineburg, Sam – Cognition and Instruction, 2019
This article reports a validity study of History Assessments of Thinking (HATs), which are short, constructed-response assessments of historical thinking. In particular, this study focuses on aspects of cognitive validity, which is an examination of whether assessments tap the intended constructs. Think-aloud interviews with 26 high school…
Descriptors: History, History Instruction, Thinking Skills, Multiple Choice Tests
Lenchuk, Iryna; Ahmed, Amer – Arab World English Journal, 2021
This article describes the results of Action Research conducted in an ESP classroom of Dhofar University located in Oman. Following the call of Oman Vision 2040 to emphasize educational practices that promote the development of higher-order cognitive processes, this study raises the following question: Can an online multiple choice question (MCQ)…
Descriptors: Taxonomy, Thinking Skills, Cognitive Processes, Multiple Choice Tests
Koskey, Kristin L. K.; Makki, Nidaa; Ahmed, Wondimu; Garafolo, Nicholas G.; Visco, Donald P., Jr. – School Science and Mathematics, 2020
Integrating engineering into the K-12 science curriculum continues to be a focus in national reform efforts in science education. Although there is an increasing interest in research in and practice of integrating engineering in K-12 science education, to date only a few studies have focused on the development of an assessment tool to measure…
Descriptors: Middle School Students, Engineering, Design, Science Education
Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016
The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…
Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items
Sparks, Jesse R.; Katz, Irvin R.; Beile, Penny M. – ETS Research Report Series, 2016
Digital information literacy (DIL)--generally defined as the ability to obtain, understand, evaluate, and use information in a variety of digital technology contexts--is a critically important skill deemed necessary for success in higher education as well as in the global networked economy. To determine whether college graduates possess the…
Descriptors: Technological Literacy, Information Literacy, Higher Education, Definitions
Karimi, Lotfollah; Mehrdad, Ali Gholami – Higher Education Studies, 2012
This study has attempted to investigate the administered written tests in the language department of Islamic Azad University of Hamedan, Iran from validity, practicality and reliability points of view. To this end two steps were taken. First, examining 112 tests, we knew that the face validity of 50 tests had been threatened, 9 tests lacked…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Multiple Choice Tests
Wilcox, Bethany R.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015
Standardized conceptual assessment represents a widely used tool for educational researchers interested in student learning within the standard undergraduate physics curriculum. For example, these assessments are often used to measure student learning across educational contexts and instructional strategies. However, to support the large-scale…
Descriptors: Science Instruction, Scientific Concepts, College Science, Physics
Lee, Hee-Sun; Liu, Ou Lydia; Linn, Marcia C. – Applied Measurement in Education, 2011
This study explores measurement of a construct called knowledge integration in science using multiple-choice and explanation items. We use construct and instructional validity evidence to examine the role multiple-choice and explanation items plays in measuring students' knowledge integration ability. For construct validity, we analyze item…
Descriptors: Knowledge Level, Construct Validity, Validity, Scaffolding (Teaching Technique)
Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C. – Computer Science Education, 2014
Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric…
Descriptors: Psychometrics, Concept Formation, Measures (Individuals), Teaching Methods
Hadenfeldt, Jan C.; Bernholt, Sascha; Liu, Xiufeng; Neumann, Knut; Parchmann, Ilka – Journal of Chemical Education, 2013
Helping students develop a sound understanding of scientific concepts can be a major challenge. Lately, learning progressions have received increasing attention as a means to support students in developing understanding of core scientific concepts. At the center of a learning progression is a sequence of developmental levels reflecting an…
Descriptors: Elementary School Science, Secondary School Science, Science Instruction, Chemistry