Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Source
Educational Measurement:… | 14 |
Author
An, Chen | 1 |
Bandalos, Deborah L. | 1 |
Braun, Henry | 1 |
Burroughs, Susie | 1 |
Cawthon, Stephanie W. | 1 |
Chavez, Carlos | 1 |
Cui, Zhongmin | 1 |
Davidson, Anne H. | 1 |
Eckhout, Teresa J. | 1 |
Fennessey, James | 1 |
Ferrara, Steve | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Descriptive | 6 |
Reports - Research | 5 |
Opinion Papers | 4 |
Reports - Evaluative | 2 |
Information Analyses | 1 |
Education Level
Elementary Secondary Education | 6 |
Elementary Education | 2 |
Middle Schools | 2 |
Adult Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Secondary Education | 1 |
Audience
Teachers | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Cui, Zhongmin – Educational Measurement: Issues and Practice, 2021
Commonly used machine learning applications seem to relate to big data. This article provides a gentle review of machine learning and shows why machine learning can be applied to small data too. An example of applying machine learning to screen irregularity reports is presented. In the example, the support vector machine and multinomial naïve…
Descriptors: Artificial Intelligence, Man Machine Systems, Data, Bayesian Statistics
Rios, Joseph A.; Ihlenfeldt, Samuel D.; Chavez, Carlos – Educational Measurement: Issues and Practice, 2020
The objectives of this two-part study were to: (a) investigate English learner (EL) accommodation practices on state accountability assessments of reading/English language arts and mathematics in grades 3-8, and (b) conduct a meta-analysis of EL accommodation effectiveness on improving test performance. Across all distinct testing programs, we…
Descriptors: Testing Accommodations, English Language Learners, Program Effectiveness, Evidence Based Practice
An, Chen; Braun, Henry; Walsh, Mary E. – Educational Measurement: Issues and Practice, 2018
Making causal inferences from a quasi-experiment is difficult. Sensitivity analysis approaches to address hidden selection bias thus have gained popularity. This study serves as an introduction to a simple but practical form of sensitivity analysis using Monte Carlo simulation procedures. We examine estimated treatment effects for a school-based…
Descriptors: Statistical Inference, Intervention, Program Effectiveness, Quasiexperimental Design
Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011
Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…
Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Kingston, Neal; Nash, Brooke – Educational Measurement: Issues and Practice, 2011
An effect size of about 0.70 (or 0.40-0.70) is often claimed for the efficacy of formative assessment, but is not supported by the existing research base. More than 300 studies that appeared to address the efficacy of formative assessment in grades K-12 were reviewed. Many of the studies had severely flawed research designs yielding…
Descriptors: Elementary Secondary Education, Formative Evaluation, Program Effectiveness, Effect Size
Cawthon, Stephanie W. – Educational Measurement: Issues and Practice, 2009
Students who are deaf or hard of hearing (SDHH) often use test accommodations when they participate in large-scale, standardized assessments. The purpose of this article is to present findings from the "Third Annual Survey of Assessment and Accommodations for Students who are Deaf or Hard of Hearing". The "big five" accommodations were reported by…
Descriptors: Standardized Tests, Testing Accommodations, Measures (Individuals), Partial Hearing

Koretz, Daniel; Stecher, Brian; Klein, Stephen; McCaffrey, Daniel – Educational Measurement: Issues and Practice, 1994
Reports on an ongoing evaluation of the Vermont portfolio assessment program. Indicates that the positive news about the instructional effects of the assessment program are in contrast with the empirical findings about the quality of the data the program has yielded. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment

Educational Measurement: Issues and Practice, 1982
Recommendations from the Wirtz/Lapointe report, "Measuring The Quality of Education" (ED 213 769), which evaluates National Assessment of Educational Progress (NAEP), are presented. Six educators respond to the concern that NAEP's reports and material are not widely known or as useful to educators as they should be. (CM)
Descriptors: Educational Assessment, Educational Quality, Elementary Secondary Education, Information Utilization

Fennessey, James; Salganik, Laura Hersh – Educational Measurement: Issues and Practice, 1983
A conceptual analysis of gain scores and a technically sound procedure for using such scores to compare different instructional programs are presented. The procedure proposed makes use of an index called Rescaled and Adjusted Gains within Strata (RAGS), together with a flexible but explicit procedure for score analysis and reporting. (LC)
Descriptors: Achievement Gains, Comparative Analysis, Elementary Secondary Education, Outcomes of Education
Burroughs, Susie; Groce, Eric; Webeck, Mary Lee – Educational Measurement: Issues and Practice, 2005
With 3 years and counting since its inception, the scope and impact of "No Child Left Behind" is now being felt in classrooms across the nation. Although some successes have been identified, concerns about the implementation and expectations of the legislation are emerging. As a result of the legislation's emphasis on the development of…
Descriptors: Program Effectiveness, Federal Legislation, Testing, Accountability
Lukin, Leslie E.; Bandalos, Deborah L.; Eckhout, Teresa J.; Mickelson, Kristine – Educational Measurement: Issues and Practice, 2004
When STARS reform efforts were launched in 2000, teacher training in assessment was seen as crucial to the success of the program. The STARS reform efforts focus on both supporting the implementation of quality classroom assessment practices and implementing a district-based accountability system. The training programs described in this article…
Descriptors: Program Effectiveness, Accountability, Evaluation Methods, Teacher Competencies
Porter, Andrew C.; Linn, Robert L.; Trimble, C. Scott – Educational Measurement: Issues and Practice, 2005
The No Child Left Behind Act allows states to vary (a) the trajectories they select to move from the baseline percent proficient or above in 2002 to the 100% proficient goal in 2014, (b) the minimum number of students required for reporting of disaggregated subgroup results, and (c) whether or not they will use confidence intervals when…
Descriptors: Federal Legislation, Educational Improvement, Educational Policy, State Legislation

Guthrie, John T.; Lissitz, Robert W. – Educational Measurement: Issues and Practice, 1985
This reaction paper stresses the importance of matching the qualitatively different types of educational decisions with appropriate types of tests. Reading instruction is used to illustrate that while standardized tests are helpful for student classification and program accountability, they are hazardous to instructional process decisions which…
Descriptors: Decision Making, Educational Assessment, Educational Improvement, Educational Testing