NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
Laws, Policies, & Programs
No Child Left Behind Act 20011
Assessments and Surveys
Advanced Placement…1
Praxis Series1
What Works Clearinghouse Rating
Showing 1 to 15 of 24 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Peer reviewed Peer reviewed
Direct linkDirect link
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Peer reviewed Peer reviewed
Direct linkDirect link
Berkhout, Louise; Hoekman, Joop; Goorhuis-Brouwer, Sieneke M. – Early Child Development and Care, 2012
The objective of this study was to develop an instrument to observe the play behaviour of a whole group of children from four to six years of age in a classroom setting on the basis of video recording. The instrument was developed in collaboration with experienced teachers and experts on play. Categories of play were derived from the literature…
Descriptors: Observation, Video Technology, Play, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Krukowski, Rebecca A.; Philyaw Perez, Amanda G.; Bursac, Zoran; Goodell, Melanie; Raczynski, James M.; Smith West, Delia; Phillips, Martha M. – Journal of School Health, 2011
Background: Foods provided in schools represent a substantial portion of US children's dietary intake; however, the school food environment has proven difficult to describe due to the lack of comprehensive, standardized, and validated measures. Methods: As part of the Arkansas Act 1220 evaluation project, we developed the School Cafeteria…
Descriptors: Health Promotion, Nutrition, Public Health, Interrater Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sood, Vishal – Journal on Educational Psychology, 2013
For identifying children with four major kinds of verbal learning disabilities viz. reading disability, speech and language comprehension disability, writing disability and mathematics disability, the present task was undertaken to construct and standardize verbal learning disabilities checklist. This checklist was developed by keeping in view the…
Descriptors: Verbal Learning, Learning Disabilities, Children, Disability Identification
McIver, Kerry L.; Brown, William H.; Pfeiffer, Karin A.; Dowda, Marsha; Pate, Russell R. – Journal of Applied Behavior Analysis, 2009
The present study describes the development and pilot testing of the Observation System for Recording Physical Activity in Children-Home version. This system was developed to document physical activity and related physical and social contexts while children are at home. An analysis of interobserver agreement and a description of children's…
Descriptors: Physical Activities, Observation, Family Environment, Physical Activity Level
Peer reviewed Peer reviewed
Direct linkDirect link
Zhu, Weimo; Rink, Judy; Placek, Judith H.; Graber, Kim C.; Fox, Connie; Fisette, Jennifer L.; Dyson, Ben; Park, Youngsik; Avery, Marybell; Franck, Marian; Raynes, De – Measurement in Physical Education and Exercise Science, 2011
New testing theories, concepts, and psychometric methods (e.g., item response theory, test equating, and item bank) developed during the past several decades have many advantages over previous theories and methods. In spite of their introduction to the field, they have not been fully accepted by physical educators. Further, the manner in which…
Descriptors: Physical Education, Quality Control, Psychometrics, Item Response Theory
Gerlick, Robert Edward – ProQuest LLC, 2010
The research presented in this manuscript was focused on the development of assessments for engineering design outcomes. The primary goal was to support efforts by the Transferrable Integrated Design Engineering Education (TIDEE) consortium in developing assessment instruments for multidisciplinary engineering capstone courses. Research conducted…
Descriptors: Engineering Education, Student Evaluation, Formative Evaluation, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Nicastro, Gerilee; Moreton, Kyle M. – Assessment Update, 2008
Western Governors University (WGU) is an online competency-based university in which students demonstrate content competence through a series of assessments. Assessments most often are performance-based or objective assessments that are developed in accordance with specific content objectives. Objective assessments generally assess lower-level…
Descriptors: Evaluators, Performance Based Assessment, Interrater Reliability, Educational Objectives
Peer reviewed Peer reviewed
Direct linkDirect link
Griffin, Helen Louise; Beech, Anthony; Print, Bobbie; Bradshaw, Helen; Quayle, Jeremy – Journal of Sexual Aggression, 2008
This paper describes the AIM2 assessment framework and the process of its development and initial testing. AIM2 is used to assess areas of concerns and strengths of young people. Some preliminary analysis is described, including the correlation of assessment items, their ability to discriminate between cases, their inter-rater reliability and a…
Descriptors: Interrater Reliability, Aggression, At Risk Persons, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Banda, Sekelani S. – Anatomical Sciences Education, 2009
There are concerns in the literature that the use of case-based teaching of anatomy could be compromising the depth and scope of anatomy learned by students in a problem-based learning curriculum. Poor selection of clinical cases that are used as vehicles for teaching/learning anatomy may be the root problem because some clinical cases do not…
Descriptors: Problem Based Learning, Anatomy, Case Method (Teaching Technique), Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Linehan, Marsha M.; Comtois, Katherine Anne; Brown, Milton Z.; Heard, Heidi L.; Wagner, Amy – Psychological Assessment, 2006
The authors describe the development of the Suicide Attempt Self-Injury Interview (SASII), an instrument designed to assess the factors involved in nonfatal suicide attempts and intentional self-injury. Using 4 cohorts of participants, authors generated SASII items and evaluated them with factor and content analyses and internal consistency…
Descriptors: Interrater Reliability, Suicide, Evaluation Methods, Self Destructive Behavior
Szapocznik, Jose; And Others – 1987
Research showing psychodynamic child therapy to be less effective than other forms of child treatment have used outcome measures focusing on symptomatic and behavioral change rather than on psychodynamic processes. A child therapy assessment procedure than measures the psychological functioning of the child in a psychodynamically meaningful way is…
Descriptors: Child Development, Children, Counseling Effectiveness, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Matson, Johnny L.; Laud, Rinita B.; Gonzalez, Melissa L.; Malone, Carrie J.; Swender, Stephen L. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2005
The use of anti-epileptic medications (AEDs) is much higher in individuals with intellectual disabilities than in the general population. As many of these individuals rely on such medications, clinicians should consider psychometrically sound instruments for assessing adverse side effects of these medications as one aspect of routine clinical…
Descriptors: Evaluation Methods, Seizures, Epilepsy, Developmental Disabilities
Previous Page | Next Page »
Pages: 1  |  2