Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 16 |
Descriptor
| Difficulty Level | 25 |
| Evaluation Methods | 25 |
| Test Validity | 25 |
| Test Items | 14 |
| Test Reliability | 9 |
| Foreign Countries | 6 |
| Item Analysis | 6 |
| Multiple Choice Tests | 6 |
| Psychometrics | 6 |
| Cognitive Processes | 5 |
| Test Construction | 5 |
| More ▼ | |
Source
Author
| Alexander, Patricia A. | 1 |
| Alysha Calleia | 1 |
| Ames, Wilbur S. | 1 |
| Anna V. Fisher | 1 |
| Aria Tsegai-Moore | 1 |
| Barniol, Pablo | 1 |
| Beifang Ma | 1 |
| Bejar, Isaac I. | 1 |
| Bernholt, Sascha | 1 |
| Bradley, John M. | 1 |
| Cassondra M. Eng | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 17 |
| Reports - Research | 15 |
| Reports - Evaluative | 4 |
| Reports - Descriptive | 2 |
| Collected Works - Proceedings | 1 |
| Guides - Classroom - Teacher | 1 |
| Guides - General | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Higher Education | 5 |
| Elementary Education | 4 |
| Postsecondary Education | 4 |
| Early Childhood Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 10 | 1 |
| Grade 11 | 1 |
| Grade 12 | 1 |
| Grade 4 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| More ▼ | |
Audience
Location
| United Kingdom (England) | 2 |
| United States | 2 |
| Colombia | 1 |
| Colorado | 1 |
| Dominica | 1 |
| Greece | 1 |
| Grenada | 1 |
| Mexico | 1 |
| Netherlands | 1 |
| Pennsylvania (Pittsburgh) | 1 |
| Saint Lucia | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Flesch Kincaid Grade Level… | 1 |
| Fry Readability Formula | 1 |
| Hidden Figures Test | 1 |
| National Adult Literacy… | 1 |
| Peabody Picture Vocabulary… | 1 |
| Progress in International… | 1 |
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…
Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods
Beifang Ma; Maximilian Krötz; Viola Deutscher; Esther Winther – International Journal of Training and Development, 2025
The rapid digital transformation of vocational education and training (VET) has underscored the need to adapt traditional assessment methods to digital formats. However, when transitioning to digital modes, it is crucial to consider factors beyond mere technical implementation, particularly the potential impact of altered presentation formats on…
Descriptors: Job Skills, Competence, Test Format, Computer Assisted Testing
Cassondra M. Eng; Aria Tsegai-Moore; Anna V. Fisher – Grantee Submission, 2024
Computerized assessments and digital games have become more prevalent in childhood, necessitating a systematic investigation of the effects of gamified executive function assessments on performance and engagement. This study examined the feasibility of incorporating gamification and a machine learning algorithm that adapts task difficulty to…
Descriptors: Preschool Children, Preschool Curriculum, Preschool Education, Preschool Tests
Kerrigan, Sarah; Norton, Anderson; Ulrich, Catherine – North American Chapter of the International Group for the Psychology of Mathematics Education, 2020
We report on and validate a system for ranking the cognitive demand of mathematical tasks. In our framework, task rankings are determined by the sequences of units and unit transformations students might use to solve each task. Using this framework, we ranked a set of 10 fractions tasks. We then interviewed 12 pre-service teachers to assess the…
Descriptors: Cognitive Processes, Difficulty Level, Fractions, Evaluation Methods
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Steven J. Howard; Hana Burianová; Alysha Calleia; Samuel Fynes-Clinton; Lisa Kervin; Sahar Bokosmaty – npj Science of Learning, 2017
Standardised educational assessments are now widespread, yet their development has given comparatively more consideration to what to assess than how to optimally assess students' competencies. Existing evidence from behavioural studies with children and neuroscience studies with adults suggest that the method of assessment may affect neural…
Descriptors: Children, Standardized Tests, Spelling, Evaluation Methods
Wilcox, Bethany R.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015
Standardized conceptual assessment represents a widely used tool for educational researchers interested in student learning within the standard undergraduate physics curriculum. For example, these assessments are often used to measure student learning across educational contexts and instructional strategies. However, to support the large-scale…
Descriptors: Science Instruction, Scientific Concepts, College Science, Physics
Alexander, Patricia A.; Dumas, Denis; Grossnickle, Emily M.; List, Alexandra; Firetto, Carla M. – Journal of Experimental Education, 2016
Relational reasoning is the foundational cognitive ability to discern meaningful patterns within an informational stream, but its reliable and valid measurement remains problematic. In this investigation, the measurement of relational reasoning unfolded in three stages. Stage 1 entailed the establishment of a research-based conceptualization of…
Descriptors: Cognitive Ability, Logical Thinking, Thinking Skills, Cognitive Processes
Hadenfeldt, Jan C.; Bernholt, Sascha; Liu, Xiufeng; Neumann, Knut; Parchmann, Ilka – Journal of Chemical Education, 2013
Helping students develop a sound understanding of scientific concepts can be a major challenge. Lately, learning progressions have received increasing attention as a means to support students in developing understanding of core scientific concepts. At the center of a learning progression is a sequence of developmental levels reflecting an…
Descriptors: Elementary School Science, Secondary School Science, Science Instruction, Chemistry
Barniol, Pablo; Zavala, Genaro – Physical Review Special Topics - Physics Education Research, 2014
In this article we discuss the findings of our research on students' understanding of vector concepts in problems without physical context. First, we develop a complete taxonomy of the most frequent errors made by university students when learning vector concepts. This study is based on the results of several test administrations of open-ended…
Descriptors: Multiple Choice Tests, Geometric Concepts, Algebra, Psychometrics
Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S. – Journal of Applied Testing Technology, 2014
This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…
Descriptors: Standard Setting, Difficulty Level, Test Items, Rating Scales
Henning, Grant – English Teaching Forum, 2012
To some extent, good testing procedure, like good language use, can be achieved through avoidance of errors. Almost any language-instruction program requires the preparation and administration of tests, and it is only to the extent that certain common testing mistakes have been avoided that such tests can be said to be worthwhile selection,…
Descriptors: Testing, English (Second Language), Testing Problems, Student Evaluation
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Simos, Panagiotis G.; Sideridis, Georgios D.; Protopapas, Athanassios; Mouzaki, Angeliki – Assessment for Effective Intervention, 2011
Assessment of lexical/semantic knowledge is performed with a variety of tests varying in response requirements. The present study exemplifies the application of modern statistical approaches in the adaptation and assessment of the psychometric properties of the "Peabody Picture Vocabulary Test--Revised" (PPVT-R) Greek. Confirmatory…
Descriptors: Elementary School Students, Reading Comprehension, Semantics, Educational Assessment
Kouame, Julien B. – Journal of MultiDisciplinary Evaluation, 2010
Background: Readability tests are indicators that measure how easy a document can be read and understood. Simple, but very often ignored, readability statistics cannot only provide information about the level of difficulty of the readability of particular documents but also can increase an evaluator's credibility. Purpose: The purpose of this…
Descriptors: Readability, Readability Formulas, Evaluation Methods, Literacy
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
