Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 15 |
Descriptor
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 4 |
Higher Education | 4 |
Postsecondary Education | 3 |
Secondary Education | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
More ▼ |
Audience
Researchers | 7 |
Practitioners | 2 |
Policymakers | 1 |
Teachers | 1 |
Location
Canada | 4 |
New York | 2 |
California | 1 |
California (Los Angeles) | 1 |
Maine | 1 |
Michigan | 1 |
Netherlands | 1 |
New Hampshire | 1 |
Nigeria | 1 |
Oregon | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
Education Consolidation… | 2 |
Education for All Handicapped… | 1 |
Elementary and Secondary… | 1 |
Hawkins Stafford Act 1988 | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Advanced Placement… | 1 |
Minnesota Multiphasic… | 1 |
National Assessment of… | 1 |
Stanford Achievement Tests | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Ken O'Connor; Matt Townsley – Phi Delta Kappan, 2025
Decisions about assessment are often built on myths about teacher professional judgment and subjectivity that prioritize standardized assessment over classroom assessment. Ken O'Connor and Matt Townsley discuss some of the most common myths and explain how to dispel them by developing clear guidelines in which teachers can exercise their judgment,…
Descriptors: Decision Making, Student Evaluation, Standardized Tests, Testing Problems
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019
Lord's (1980) equity theorem claims observed-score equating to be possible only when two test forms are perfectly reliable or strictly parallel. An analysis of its proof reveals use of an incorrect statistical assumption. The assumption does not invalidate the theorem itself though, which can be shown to follow directly from the discrete nature of…
Descriptors: Equated Scores, Testing Problems, Item Response Theory, Evaluation Methods
Judith Enriquez – Prism: Casting New Light on Learning, Theory & Practice, 2020
Documents are usually circulated as carriers of transparent information. They can serve as evidence of accountability. In fact, they embody the most desired value of managerialism, where the culture of audit and compliance is fully served and delivered in written and textual form. This article explores assessment by attending to its principal…
Descriptors: Colleges, Evaluation Methods, Documentation, Outcome Based Education
Allehaiby, Wid Hasen; Al-Bahlani, Sara – Arab World English Journal, 2021
One of the main challenges higher educational institutions encounter amid the recent COVID-19 crisis is transferring assessment approaches from the traditional face-to-face form to the online Emergency Remote Teaching approach. A set of language assessment principles, practicality, reliability, validity, authenticity, and washback, which can be…
Descriptors: Barriers, Distance Education, Evaluation Methods, Teaching Methods
Brown, Nathaniel J.; Afflerbach, Peter P.; Croninger, Robert G. – Educational Psychology Review, 2014
National policy and standards documents, including the National Assessment of Educational Progress frameworks, the "Common Core State Standards" and the "Next Generation Science Standards," assert the need to assess critical-analytic thinking (CAT) across subject areas. However, assessment of CAT poses several challenges for…
Descriptors: Critical Thinking, Thinking Skills, National Standards, National Competency Tests
Orrill, Chandra Hawley; Kim, Ok-Kyeong; Peters, Susan A.; Lischka, Alyson E.; Jong, Cindy; Sanchez, Wendy B.; Eli, Jennifer A. – Mathematics Teacher Education and Development, 2015
Developing and writing assessment items that measure teachers' knowledge is an intricate and complex undertaking. In this paper, we begin with an overview of what is known about measuring teacher knowledge. We then highlight the challenges inherent in creating assessment items that focus specifically on measuring teachers' specialised knowledge…
Descriptors: Specialization, Knowledge Base for Teaching, Educational Strategies, Testing Problems
Osuji, U. S. A. – Turkish Online Journal of Distance Education, 2012
The rapid development of the information and communication technologies has brought about the modernization of teaching and learning across the globe. Today, teaching and learning are made easy through the use of computer technologies and/or the electronic and other multi-media elements. It is widely accepted that electronic learning naturally…
Descriptors: Foreign Countries, Electronic Learning, Electronic Equipment, Computer Assisted Testing
Schifter, Catherine C.; Carey, Martha – International Association for Development of the Information Society, 2014
The No Child Left Behind (NCLB) legislation spawned a plethora of standardized testing services for all the high stakes testing required by the law. We argue that one-size-fits all assessments disadvantage students who are English Language Learners, in the USA, as well as students with limited economic resources, special needs, and not reading on…
Descriptors: Standardized Tests, Models, Evaluation Methods, Educational Legislation
Sturgis, Chris – International Association for K-12 Online Learning, 2014
This paper is part of a series investigating the implementation of competency education. The purpose of the paper is to explore how districts and schools can redesign grading systems to best help students to excel in academics and to gain the skills that are needed to be successful in college, the community, and the workplace. In order to make the…
Descriptors: Grading, Competency Based Education, Evaluation Methods, Evaluation Research
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
de La Torre, Jimmy; Karelitz, Tzur M. – Journal of Educational Measurement, 2009
Compared to unidimensional item response models (IRMs), cognitive diagnostic models (CDMs) based on latent classes represent examinees' knowledge and item requirements using discrete structures. This study systematically examines the viability of retrofitting CDMs to IRM-based data with a linear attribute structure. The study utilizes a procedure…
Descriptors: Simulation, Item Response Theory, Psychometrics, Evaluation Methods
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Laurie, Robert – Education Canada, 2009
The practice of handing out excellent grades to students who don't deserve them (grade inflation) is not a new phenomenon. Indeed grade inflation is among the oldest and most difficult issues to address in higher education. The author first studied the impact of grade inflation on student performance on standardized tests at the high school level…
Descriptors: Grade Inflation, Standardized Tests, Academic Achievement, Correlation
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007
A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…
Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time