NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)3
Since 2006 (last 20 years)12
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Brown, Ted – Journal of Occupational Therapy, Schools & Early Intervention, 2019
The Bruininks-Oseretsky Test of Motor Proficiency -- Second Edition (BOT-2) is a commonly used assessment of children's skills. It is important that assessments have validity evidence reported about them. The objective of the study was to investigate the structural validity of the BOT-2's eight subscales and four composite scales. A sample of 117…
Descriptors: Performance Tests, Psychomotor Skills, Test Validity, Children
Peer reviewed Peer reviewed
Direct linkDirect link
Garn, Alex C.; Webster, E. Kipling – Measurement in Physical Education and Exercise Science, 2018
The Test of Gross Motor Development--Second Edition (TGMD-2) is a widely used evaluation tool of children's fundamental motor skills (FMS). This study illustrates how exploratory structural equation modeling (ESEM) addresses current limitations associated with TGMD-2 factor structure. Using the normative dataset from the TGMD-2 manual, we test…
Descriptors: Motor Development, Performance Tests, Factor Structure, Structural Equation Models
Peer reviewed Peer reviewed
Direct linkDirect link
Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019
This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…
Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Kluwe, Margret; Miyahara, Motohide; Heveldt, Kate – Physical Education and Sport Pedagogy, 2012
Background: Specificity and transfer of learning have been examined in experimental studies. However, their findings may not be relevant to practitioners because of the difference between the experiment conditions and teaching situations. This case study investigates the theoretical issue of specificity vs. transfer of learning by conducting…
Descriptors: Learning Disabilities, Performance Tests, Test Items, Teaching Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Zhu, Weimo; Fox, Connie; Park, Youngsik; Fisette, Jennifer L.; Dyson, Ben; Graber, Kim C.; Avery, Marybell; Franck, Marian; Placek, Judith H.; Rink, Judy; Raynes, De – Measurement in Physical Education and Exercise Science, 2011
The purpose of this study was to develop and calibrate an assessment system, or bank, using the latest measurement theories and methods to promote valid and reliable student assessment in physical education. Using an anchor-test equating design, a total of 30 items or assessments were administered to 5,021 (2,568 boys and 2,453 girls) students in…
Descriptors: Video Technology, Physical Education, Scoring Rubrics, Kindergarten
Peer reviewed Peer reviewed
Direct linkDirect link
Aryadoust, Vahid – International Journal of Listening, 2012
This article investigates a version of the International English Language Testing System (IELTS) listening test for evidence of differential item functioning (DIF) based on gender, nationality, age, and degree of previous exposure to the test. Overall, the listening construct was found to be underrepresented, which is probably an important cause…
Descriptors: Evidence, Test Bias, Testing, Listening Comprehension Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Verheggen, M. M.; Muijtjens, A. M. M.; Os, J. Van; Schuwirth, L. W. T. – Advances in Health Sciences Education, 2008
Background: To establish credible, defensible and acceptable passing scores for written tests is a challenge for health profession educators. Angoff procedures are often used to establish pass/fail decisions for written and performance tests. In an Angoff procedure judges' expertise and professional skills are assumed to influence their ratings of…
Descriptors: Health Occupations, Performance Tests, Scoring, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Zhu, Weimo; Rink, Judy; Placek, Judith H.; Graber, Kim C.; Fox, Connie; Fisette, Jennifer L.; Dyson, Ben; Park, Youngsik; Avery, Marybell; Franck, Marian; Raynes, De – Measurement in Physical Education and Exercise Science, 2011
New testing theories, concepts, and psychometric methods (e.g., item response theory, test equating, and item bank) developed during the past several decades have many advantages over previous theories and methods. In spite of their introduction to the field, they have not been fully accepted by physical educators. Further, the manner in which…
Descriptors: Physical Education, Quality Control, Psychometrics, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Judd, Wallace – Practical Assessment, Research & Evaluation, 2009
Over the past twenty years in performance testing a specific item type with distinguishing characteristics has arisen time and time again. It's been invented independently by dozens of test development teams. And yet this item type is not recognized in the research literature. This article is an invitation to investigate the item type, evaluate…
Descriptors: Test Items, Test Format, Evaluation, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Fox, Connie; Zhu, Weimo; Park, Youngsik; Fisette, Jennifer L.; Graber, Kim C.; Dyson, Ben; Avery, Marybell; Franck, Marian; Placek, Judith H.; Rink, Judy; Raynes, De – Measurement in Physical Education and Exercise Science, 2011
In addition to validity and reliability evidence, other psychometric qualities of the PE Metrics assessments needed to be examined. This article describes how those critical psychometric issues were addressed during the PE Metrics assessment bank construction. Specifically, issues included (a) number of items or assessments needed, (b) training…
Descriptors: Measures (Individuals), Psychometrics, Interrater Reliability, Training
Peer reviewed Peer reviewed
Direct linkDirect link
Huang, Chiungjung – Educational and Psychological Measurement, 2009
This study examined the percentage of task-sampling variability in performance assessment via a meta-analysis. In total, 50 studies containing 130 independent data sets were analyzed. Overall results indicate that the percentage of variance for (a) differential difficulty of task was roughly 12% and (b) examinee's differential performance of the…
Descriptors: Test Bias, Research Design, Performance Based Assessment, Performance Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Seock-Ho; Cohen, Allan S.; Alagoz, Cigdem; Kim, Sukwoo – Journal of Educational Measurement, 2007
Data from a large-scale performance assessment (N = 105,731) were analyzed with five differential item functioning (DIF) detection methods for polytomous items to examine the congruence among the DIF detection methods. Two different versions of the item response theory (IRT) model-based likelihood ratio test, the logistic regression likelihood…
Descriptors: Performance Based Assessment, Performance Tests, Item Response Theory, Test Bias