NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 13 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ayfer Sayin; Mark Gierl – Educational Measurement: Issues and Practice, 2024
The purpose of this study is to introduce and evaluate a method for generating reading comprehension items using template-based automatic item generation. To begin, we describe a new model for generating reading comprehension items called the text analysis cognitive model assessing inferential skills across different reading passages. Next, the…
Descriptors: Algorithms, Reading Comprehension, Item Analysis, Man Machine Systems
David Bamat – ProQuest LLC, 2021
The State NAEP program only reports the mean achievement estimate of a subgroup within a given state if it samples at least 62 students who identify with the subgroup. Since some subgroups of students constitute small proportions of certain states' general student populations, these low-incidence groups of students are seldom sufficiently sampled…
Descriptors: National Competency Tests, Mathematics Education, Academic Achievement, Evaluation Research
Peer reviewed Peer reviewed
Direct linkDirect link
Rutkowski, David; Rutkowski, Leslie; Flores, Charity – Educational Assessment, 2022
As more states move to universal computer-based assessments, an emergent issue concerns the effect that device type might have on student results. Although, several research studies have explored device effects, most of these studies focused on the differences between tablets and desktops/laptops. In the current study, we distinguish between…
Descriptors: Computer Assisted Testing, Computers, Laptop Computers, Handheld Devices
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dogan, Enis – Practical Assessment, Research & Evaluation, 2018
Several large scale assessments include student, teacher, and school background questionnaires. Results from such questionnaires can be reported for each item separately, or as indices based on aggregation of multiple items into a scale. Interpreting scale scores is not always an easy task though. In disseminating results of achievement tests, one…
Descriptors: Rating Scales, Benchmarking, Questionnaires, Achievement Tests
Samosa, Resty C. – Online Submission, 2022
Due to the unprecedented COVID-19 incident, basic education institutions have faced different challenges in their teaching-learning activities. Particularly conducting assessments remotely during COVID-19 has posed extraordinary challenges for basic education institutions owing to lack of preparation superimposed with the inherent problems of…
Descriptors: Educational Change, COVID-19, Pandemics, Teaching Methods
Hayes, Ann Milligan – ProQuest LLC, 2015
Over the last two decades, there has been renewed interest in formative assessment, in large part due to the increasing pressures and prevalence of "high stakes" summative assessments. As states try to meet the requirements of the No Child Left Behind law, teachers and administrators are realizing that formative assessment offers an…
Descriptors: Middle School Teachers, Language Arts, Computer Assisted Testing, Formative Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Harris, Douglas N.; Anderson, Andrew – Carnegie Foundation for the Advancement of Teaching, 2013
There is a growing body of research on the validity and reliability of value-added measures, but most of this research has focused on elementary grades. Driven by several federal initiatives such as Race to the Top, Teacher Incentive Fund, and ESEA waivers, however, many states have incorporated value-added measures into the evaluations not only…
Descriptors: Teacher Effectiveness, Teacher Evaluation, Evaluation Methods, Evaluation Research
Magno, Carlo – Online Submission, 2009
The present report demonstrates the difference between classical test theory (CTT) and item response theory (IRT) approach using an actual test data for chemistry junior high school students. The CTT and IRT were compared across two samples and two forms of test on their item difficulty, internal consistency, and measurement errors. The specific…
Descriptors: Private Schools, Measurement, Error of Measurement, Foreign Countries
Peer reviewed Peer reviewed
PDF on ERIC Download full text
National Center for Education Research, 2009
Since 2002, the Institute of Education Sciences (IES) has funded more than 400 research grants through the National Center for Education Research. This document lists the publications that have resulted from these projects. Publications from IES grantees include articles intended for scientific audiences, as well as articles written for general…
Descriptors: Audiences, Grants, Educational Research, Publications
Peer reviewed Peer reviewed
Direct linkDirect link
Pegg, Phillip O.; Plybon, Laura E. – Journal of Early Adolescence, 2005
The purpose of this study was to examine the psychometric qualities of two theoretical subscales of the Multigroup Ethnic Identity Measure (MEIM), Ethnic Identity Exploration and Ethnic Identity Commitment, that have been supported in research with early adolescent samples. The study was conducted to further validate the MEIM as a two-factor…
Descriptors: Early Adolescents, Psychometrics, Ethnicity, African Americans
Peer reviewed Peer reviewed
Direct linkDirect link
Chang, Shun-Wen – Educational and Psychological Measurement, 2006
This study evaluates the effects of employing the linear, normalizing, and arcsine transformation methods for constructing scale scores on the Basic Competence Test (BCTEST). Tests in three subject areas (Chinese, English, and Mathematics) were studied using the data of test administrations from 2001 to 2003. The resulting scale scores for each…
Descriptors: Standardized Tests, Achievement Tests, Test Theory, True Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Manswell-Butty, Jo-Anne L.; Reid, Malva Daniel; LaPoint, Velma – New Directions for Evaluation, 2004
Program evaluation has long been used to reveal program characteristics, merits, and challenges. While providing information about program effectiveness, evaluations can also ensure understanding of program outcomes, efficiency, and quality. Furthermore, evaluations can analyze and examine a program's political and social environment as well as…
Descriptors: Urban Schools, Evaluation Research, Evaluators, Intervention