Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 26 |
Since 2006 (last 20 years) | 66 |
Descriptor
Evaluation Methods | 104 |
Test Items | 104 |
Test Validity | 104 |
Test Construction | 42 |
Test Reliability | 36 |
Student Evaluation | 25 |
Psychometrics | 21 |
Scores | 17 |
Item Analysis | 16 |
Foreign Countries | 15 |
Test Bias | 15 |
More ▼ |
Source
Author
Hill, Heather C. | 3 |
Abedi, Jamal | 2 |
Blunk, Merrie | 2 |
Bowles, Ryan P. | 2 |
Goffney, Imani Masters | 2 |
Goodwin, Sarah | 2 |
Hambleton, Ronald K. | 2 |
Konishi, Haruka | 2 |
Skibbe, Lori E. | 2 |
Troia, Gary A. | 2 |
Ahmed, Wondimu | 1 |
More ▼ |
Publication Type
Education Level
Laws, Policies, & Programs
Every Student Succeeds Act… | 3 |
Individuals with Disabilities… | 3 |
Rehabilitation Act 1973… | 3 |
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…
Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods
Fu Chen; Ying Cui; Alina Lutsyk-King; Yizhu Gao; Xiaoxiao Liu; Maria Cutumisu; Jacqueline P. Leighton – Education and Information Technologies, 2024
Post-secondary data literacy education is critical to students' academic and career success. However, the literature has not adequately addressed the conceptualization and assessment of data literacy for post-secondary students. In this study, we introduced a novel digital performance-based assessment for teaching and evaluating post-secondary…
Descriptors: Performance Based Assessment, College Students, Information Literacy, Evaluation Methods
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022
Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…
Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies
Philomina Abena Anyidoho; Rebecca Berenbon; Bridget McHugh – International Journal of Training and Development, 2024
Many workforce development training programmes use learning gains as a measure of programme effectiveness. However, research on K-12 education suggests that posttest scores may be influenced by pretesting effects. Pretesting may improve posttest performance by giving learners preknowledge of posttest content. Alternatively, pretesting may enhance…
Descriptors: Trainees, Trainers, Labor Force Development, High Stakes Tests
Meng, Yaru; Fu, Hua – Modern Language Journal, 2023
The distinguishing feature of dynamic assessment (DA) is the dialectical integration of assessment and instruction. However, how to design the targeted instruction or mediation has been relatively underexplored. To address this gap, this study proposes the attribute-based mediation model (AMM), an English-as-a-foreign-language listening mediation…
Descriptors: Evaluation Methods, Teaching Methods, Models, English (Second Language)
Toker, Turker – International Journal of Curriculum and Instruction, 2023
Achievement tests are among the most widely used data collection tools to measure the knowledge and skill levels of individuals. For this reason, the existence of valid and reliable achievement tests that can perfectly reveal the competencies that a person should have in any discipline is of great importance. The purpose of this research is to…
Descriptors: Basic Skills, Evaluation Methods, Test Items, Test Validity
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Thapelo Ncube Whitfield – ProQuest LLC, 2021
Student Experience surveys are used to measure student attitudes towards their campus as well as to initiate conversations for institutional change. Validity evidence to support the interpretations of these surveys' results, however, is lacking. The first purpose of this study was to compare three Differential Item Functioning (DIF) methods on…
Descriptors: College Students, Student Surveys, Student Experience, Student Attitudes
Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022
In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…
Descriptors: Computer Assisted Testing, Tests, Scores, Scoring
Anani Sarab, Mohammad Reza; Rahmani, Simindokht – International Journal of Language Testing, 2023
Language testing and assessment have grown in popularity and gained significance in the last few decades, and there is a rising need for assessment literate stakeholders in the field of language education. As teachers play a major role in assessing students, there is a need to make sure they have the right level of assessment knowledge and skills…
Descriptors: Language Tests, Literacy, Second Language Learning, Factor Analysis
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Clauser, Brian E.; Baldwin, Peter; Margolis, Melissa J.; Mee, Janet; Winward, Marcia – Journal of Educational Measurement, 2017
Validating performance standards is challenging and complex. Because of the difficulties associated with collecting evidence related to external criteria, validity arguments rely heavily on evidence related to internal criteria--especially evidence that expert judgments are internally consistent. Given its importance, it is somewhat surprising…
Descriptors: Evaluation Methods, Standard Setting, Cutting Scores, Expertise
Skibbe, Lori E.; Bowles, Ryan P.; Goodwin, Sarah; Troia, Gary A.; Konishi, Haruka – Language, Speech, and Hearing Services in Schools, 2020
Purpose: The Access to Literacy Assessment System--Phonological Awareness (ATLAS-PA) was developed for use with children with speech and/or language impairment. The subtests (Rhyming, Blending, and Segmenting) are appropriate for children who are 3-7 years of age. ATLAS-PA is composed entirely of receptive items, incorporates individualized levels…
Descriptors: Phonological Awareness, Speech Impairments, Language Impairments, Young Children
Skibbe, Lori E.; Bowles, Ryan P.; Goodwin, Sarah; Troia, Gary A.; Konishi, Haruka – Grantee Submission, 2020
Purpose: The Access to Literacy Assessment System--Phonological Awareness (ATLAS-PA) was developed for use with children with speech and/or language impairment. The subtests (rhyming, blending, segmenting) are appropriate for children who are 3 to 7 years of age. ATLAS-PA is comprised entirely of receptive items, incorporates individualized levels…
Descriptors: Phonological Awareness, Speech Impairments, Language Impairments, Young Children