NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1,216 to 1,230 of 9,552 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020
Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…
Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gübes, Nese; Uyar, Seyma – International Journal of Progressive Education, 2020
This study aims to compare the performance of different small sample equating methods in the presence and absence of differential item functioning (DIF) in common items. In this research, Tucker linear equating, Levine linear equating, unsmoothed and pre-smoothed (C=4) chained equipercentile equating, and simplified circle arc equating methods…
Descriptors: Test Bias, Equated Scores, Test Items, Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kurnaz-Adibatmaz, Fatma Betül; Yildiz, Hüseyin – Journal of Theoretical Educational Science, 2020
In this study logistic regression and Lord's Chi Square methods were used to research the items that have DIF. The study utilized Peabody Picture Vocabulary Test (PPVT). The original form of the PPVT includes four options. Three different forms (A, B and C) were formed by removing one of the distractors respectively. The original form of PPVT was…
Descriptors: Item Analysis, Test Items, Vocabulary, Verbal Ability
Peer reviewed Peer reviewed
Direct linkDirect link
Sullivan, Alice – International Journal of Social Research Methodology, 2020
This article replies to the responses to my article on "Sex and the Census: Why surveys should not conflate sex and gender identity". Fugard conflates sex itself with the characteristics associated with sex, such as finger length ratios, leading to the erroneous implication that binary sex is not a useful explanatory variable. Hines…
Descriptors: Foreign Countries, National Surveys, Census Figures, Test Items
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Aborisade, Olatunbosun James; Fajobi, Olutoyin Olufunke – Educational Research and Reviews, 2020
West Africa Examination Council (WAEC) and National Examination Council (NECO) are the two major examination bodies saddled with the responsibility of awarding Senior Secondary School Certificate in Nigeria. This study examined the comparability of the psychometric properties of the items constructed by the two examination bodies using Item…
Descriptors: Foreign Countries, Mathematics Tests, Psychometrics, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Harrison, George M. – Journal of Environmental Education, 2020
The Children's New Ecological Paradigms scale was originally developed for children ages 10-12 and was presented as valuable for comparing that age group with older participants. This study uses cognitive interviews and measurement invariance testing to investigate how well the scores maintain the same meaning between these two age groups. The…
Descriptors: Attitude Measures, Children, Test Validity, Middle School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020
Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…
Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Menold, Natalja – Field Methods, 2020
In randomized experiments, inventories with reverse-keyed items are compared with inventories in which all the items are either positively or negatively associated with the underlying concept. The results show that with reverse keying, a control of the potential bias was not sufficient; likewise, the factorial structure, reliability, and validity…
Descriptors: Test Items, Measures (Individuals), Eye Movements, Factor Structure
Peer reviewed Peer reviewed
Direct linkDirect link
Eshani N. Lee; MaryKay Orgill – Chemistry Education Research and Practice, 2025
Multilingual learners face significant challenges when navigating the linguistic complexities of chemistry assessments. This study, employing the Equitable Framework for Classroom Assessment, identified these specific challenging features in general chemistry assessment items on the topics of limiting reactant and percent yield. Through in-depth,…
Descriptors: Multilingualism, Second Language Learning, Language Proficiency, Syntax
Peer reviewed Peer reviewed
Direct linkDirect link
Mihyun Son; Minsu Ha – Education and Information Technologies, 2025
Digital literacy is essential for scientific literacy in a digital world. Although the NGSS Practices include many activities that require digital literacy, most studies have examined digital literacy from a generic perspective rather than a curricular context. This study aimed to develop a self-report tool to measure elements of digital literacy…
Descriptors: Test Construction, Measures (Individuals), Digital Literacy, Scientific Literacy
Peer reviewed Peer reviewed
Direct linkDirect link
Anlu Yang; Xiaofen D. Hamilton; Yongshun Wang; Peter Smolianov; Jose Castro-Piñero; Jianmin Guan; Tamara Dolmatova; Xin Zhang; Jiren Zhang; Enyan Zhan; Mark F. Hamilton – Journal of Teaching in Physical Education, 2025
Purpose: This study aimed to compare the result assessment approaches used in the widely implemented health-related fitness batteries in school-based physical education programs. Method: Fitness test batteries implemented in the European Union (Assessing Levels of Physical Activity and Fitness), China (China's National Physical Fitness Testing),…
Descriptors: Physical Fitness, Tests, Health Promotion, Physical Education
Peer reviewed Peer reviewed
Direct linkDirect link
Bianca Böhmer; Gabrielle Wills – Large-scale Assessments in Education, 2025
This paper examines the effect of COVID-19 on learning loss and learning inequality in South Africa using 2016 and 2021 Grade 4 PIRLS datasets. On average, South African Grade 4 reading achievement declined by 31 PIRLS points from 320 in 2016 to 288 in 2021, equivalent to a decline of 0.29 standard deviations or 50-60% of a year of learning. The…
Descriptors: COVID-19, Pandemics, Grade 4, Elementary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019
This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Palermo, Corey; Bunch, Michael B.; Ridge, Kirk – Journal of Educational Measurement, 2019
Although much attention has been given to rater effects in rater-mediated assessment contexts, little research has examined the overall stability of leniency and severity effects over time. This study examined longitudinal scoring data collected during three consecutive administrations of a large-scale, multi-state summative assessment program.…
Descriptors: Scoring, Interrater Reliability, Measurement, Summative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Reed, Jessica J.; Raker, Jeffrey R.; Murphy, Kristen L. – Journal of Chemical Education, 2019
The ability to assess students' content knowledge and make meaningful comparisons of student performance is an important component of instruction. ACS exams have long served as tools for standardized assessment of students' chemistry knowledge. Because these exams are designed by committees of practitioners to cover a breadth of topics in the…
Descriptors: Science Tests, Standardized Tests, Chemistry, Student Evaluation
Pages: 1  |  ...  |  78  |  79  |  80  |  81  |  82  |  83  |  84  |  85  |  86  |  ...  |  637