NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 151 to 165 of 3,713 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2021
Equating the scores from different forms of a test requires collecting data that link the forms. Problems arise when the test forms to be linked are given to groups that are not equivalent and the forms share no common items by which to measure or adjust for this group nonequivalence. We compared three approaches to adjusting for group…
Descriptors: Equated Scores, Weighted Scores, Sampling, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Hjärne, Marcus S. – Scandinavian Journal of Educational Research, 2021
Extended time is a commonly-used test adaptation for standardised high-stakes tests. In this study, extended time provided for test-takers with dyslexia is examined. Data from standard versions of the Swedish Scholastic Aptitude Test (SweSAT) and data from test administrations where extra time is provided was used. Indications are that the…
Descriptors: Foreign Countries, College Entrance Examinations, Testing Accommodations, Dyslexia
Peer reviewed Peer reviewed
Direct linkDirect link
Anastasia Dimiski – Educational Research for Policy and Practice, 2025
This paper addresses the influence of ethnic and gender disparities on educational outcomes and recognizes the significance of maternal education in shaping students' academic achievements. The study aims to evaluate the efficacy of policies aimed at integrating maternal involvement in education among 15-year-old students from OECD countries.…
Descriptors: Equal Education, Achievement Tests, Scores, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Laila El-Hamamsy; María Zapata-Cáceres; Estefanía Martín-Barroso; Francesco Mondada; Jessica Dehler Zufferey; Barbara Bruno; Marcos Román-González – Technology, Knowledge and Learning, 2025
The introduction of computing education into curricula worldwide requires multi-year assessments to evaluate the long-term impact on learning. However, no single Computational Thinking (CT) assessment spans primary school, and no group of CT assessments provides a means of transitioning between instruments. This study therefore investigated…
Descriptors: Cognitive Tests, Computation, Thinking Skills, Test Validity
Meyer, J. Patrick; Dahlin, Michael – NWEA, 2022
The MAP® Growth™ theory of action describes key features of MAP Growth and its position in a comprehensive assessment system. The basic premise of the theory of action is that all students learn when MAP Growth is situated in a comprehensive assessment system and used for its intended purposes to yield information about student learning and enable…
Descriptors: Achievement Tests, Academic Achievement, Achievement Gains, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Ronkin, Emily; Tully, Erin C.; Branum-Martin, Lee; Cohen, Lindsey L.; Hall, Christine; Dilly, Laura; Tone, Erin B. – Autism: The International Journal of Research and Practice, 2022
The Autism Diagnostic Observation Schedule, 2nd-edition (ADOS-2) Toddler Module is the current gold-standard measure of autism spectrum disorder (ASD), a neurodevelopmental condition more frequently diagnosed in toddler boys than girls. Some evidence suggests that behaviors assessed by the Toddler Module may capture an ASD phenotype that is more…
Descriptors: Diagnostic Tests, Autism Spectrum Disorders, Gender Differences, Interpersonal Communication
Peer reviewed Peer reviewed
Direct linkDirect link
Sam Bamkin – Ethnography and Education, 2024
The iterative process of ethnography not only constructs theory, but its methodology should embody theory. Developing a theoretical framework often demands adjustments in methodology, to leverage previous work and to avoid assumptions compounding through the magnification of blind spots. New theory in policy-engaged ethnography has emphasised the…
Descriptors: Foreign Countries, Teachers, Ethnography, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Karina Mostert; Clarisse van Rensburg; Reitumetse Machaba – Journal of Applied Research in Higher Education, 2024
Purpose: This study examined the psychometric properties of intention to drop out and study satisfaction measures for first-year South African students. The factorial validity, item bias, measurement invariance and reliability were tested. Design/methodology/approach: A cross-sectional design was used. For the study on intention to drop out, 1,820…
Descriptors: Intention, Potential Dropouts, Student Satisfaction, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024
Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…
Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Steven Lee; Matthew Schaelling – Society for Research on Educational Effectiveness, 2024
Background: Inequality along racial and economic dimensions is well-documented and widespread in educational contexts. Achievement gaps are observed among children as early as primary school and are especially notable in standardized testing (Fryer & Levitt, 2004; Fryer & Levitt, 2013; Bond & Lang 2013). In response, some observers and…
Descriptors: Elementary School Students, Middle School Students, Standardized Tests, Achievement Gap
Peer reviewed Peer reviewed
Direct linkDirect link
Nishizawa, Hitoshi – Language Testing, 2023
In this study, I investigate the construct validity and fairness pertaining to the use of a variety of Englishes in listening test input. I obtained data from a post-entry English language placement test administered at a public university in the United States. In addition to expectedly familiar American English, the test features Hawai'i,…
Descriptors: Construct Validity, Listening Comprehension Tests, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Shreya Sunderram – Journal of Curriculum Studies, 2023
Postcolonial studies have long identified history curriculum as a site of empire building. High stakes exams like the Global History Regents Exam in New York (NYGHR) undoubtedly impact curriculum but have yet to be examined through a postcolonial lens. This study evaluates to what extent, if at all, the NYGHR perpetuates eurocentrism as defined by…
Descriptors: Postcolonialism, Decolonization, History Instruction, High Stakes Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Grantee Submission, 2020
In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…
Descriptors: Virtual Classrooms, Item Response Theory, Test Bias, Test Items
Pages: 1  |  ...  |  7  |  8  |  9  |  10  |  11  |  12  |  13  |  14  |  15  |  ...  |  248