ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Testing Problems	31
Scores	23
Standardized Tests	10
Test Construction	9
Elementary Secondary Education	8
Academic Achievement	6
Student Evaluation	6
Test Validity	6
Accountability	5
Computer Assisted Testing	5
Equated Scores	5
Error of Measurement	5
Test Interpretation	5
Test Reliability	5
Testing	5
Evaluation Methods	4
Secondary Education	4
Test Results	4
Achievement Tests	3
Cutting Scores	3
Educational Policy	3
Higher Education	3
Latent Trait Theory	3
Measurement Techniques	3
Program Evaluation	3
More ▼

Source

Educational Assessment	2
Evaluation and the Health…	2
Journal of Educational and…	2
Language, Speech, and Hearing…	2
Applied Measurement in…	1
Assessment in Education:…	1
Center for Assessment and…	1
Communique	1
Education Week	1
Educational Leadership	1
International Journal of…	1
NASSP Bulletin	1
National Center for Research…	1
New Directions for Testing…	1
Phi Delta Kappan	1
Policy Analysis for…	1
Studies in Educational…	1
More ▼

Publication Type

Reports - Descriptive	31
Journal Articles	18
Speeches/Meeting Papers	8
Guides - Non-Classroom	2
Reports - Evaluative	2
Opinion Papers	1
Reports - Research	1

Education Level

Elementary Secondary Education	2
Secondary Education	1

Audience

Researchers	4
Community	1
Parents	1
Practitioners	1

Location

Australia	1
California	1
Georgia (Atlanta)	1
Netherlands	1
Virginia	1

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Iowa Tests of Basic Skills	1
Peabody Picture Vocabulary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Bad Questions: An Essay Involving Item Response Theory

Peer reviewed

Direct link

Thissen, David – Journal of Educational and Behavioral Statistics, 2016

David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…

Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation

Prevention, Detection, and Follow-Up of High-Stakes Testing Irregularities

Direct link

Looser, Joshua – Communique, 2013

Since the passage of No Child Left Behind (NCLB), the education system has seen immense shifts in its approach to schooling. Previously, students were taught using an extant curriculum with the instructional methods of the teachers at the school; there was little systematic modification to curriculum and methods; and the variable underlying…

Descriptors: Prevention, High Stakes Tests, Teaching Methods, Scores

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Fixing the Academic Performance Index. Policy Brief 13-1

Download full text

Polikoff, Morgan S.; McEachin, Andrew – Policy Analysis for California Education, PACE, 2013

The Academic Performance Index (API) is the centerpiece of California's state assessment and accountability system. With the recent passage of SB1458 and the pending reauthorization of both state and federal accountability legislation, there is now an unprecedented opportunity to improve the API for next generation accountability in California. In…

Descriptors: Academic Achievement, Standardized Tests, Accountability, State Legislation

Testing Officials Again Tackle Accommodations and Exclusions for Special Student Populations

Direct link

Cavanagh, Sean – Education Week, 2008

Perhaps no topic has as thoroughly vexed officials who oversee the nation's leading test of academic progress as the wide variation among states and cities in the proportion of students with disabilities and limited English proficiency whom they exclude from taking the exam or provide with special accommodations for it. The board that sets policy…

Descriptors: National Competency Tests, Testing Accommodations, Special Needs Students, Individualized Education Programs

The Truth about Scores Children Achieve on Tests.

Peer reviewed

Brown, Jonathan R. – Language, Speech, and Hearing Services in Schools, 1989

The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)

Descriptors: Elementary Secondary Education, Error of Measurement, Scores, Standardized Tests

Basic Concepts in Classical Test Theory: Tests Aren't Reliable, the Nature of Alpha, and Reliability Generalization as a Meta-analytic Method.

Download full text

Helms, LuAnn Sherbeck – 1999

This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…

Descriptors: Effect Size, Meta Analysis, Reliability, Scores

What Did the Massachusetts Teacher Tests Say about American Education?

Fowler, R. Clarke – Phi Delta Kappan, 2001

Research says the school-improvement mechanisms favored by policymakers-more certification tests (like the Massachusetts Educator Certification Test that 59 percent of candidates failed in 1998), higher cut scores, and severe penalties for institutions not meeting pass rates-are unlikely to deliver increased accountability and better teachers.…

Descriptors: Cutting Scores, Elementary Secondary Education, Instructional Improvement, Mass Media

Score Equating and Item Response Theory: Some Practical Considerations.

Download full text

Cook, Linda L.; Eignor, Daniel R. – 1981

The purposes of this paper are five-fold to discuss: (1) when item response theory (IRT) equating methods should provide better results than traditional methods; (2) which IRT model, the three-parameter logistic or the one-parameter logistic (Rasch), is the most reasonable to use; (3) what unique contributions IRT methods can offer the equating…

Descriptors: Equated Scores, Latent Trait Theory, Mathematical Models, Test Construction

Reliability Estimation When a Test Is Split into Two Parts of Unknown Effective Length.

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 2002

Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…

Descriptors: Error of Measurement, Reliability, Scores, Test Construction

Setting Passing Scores on Competency Tests.

Peer reviewed

Berk, Ronald A. – NASSP Bulletin, 1987

Reviews a dozen basic questions about passing scores on state-mandated competency testing programs for students, teachers, administrators, and other education personnel. Condemns the "cardiac" approach (or traditonal 80 percent standard) in favor of procedures that systematically incorporate judgment with a variety of performance data.…

Descriptors: Scores, Secondary Education, Standardized Tests, Student Evaluation

Effects of State Testing. Assessment Brief. Number 2

Download full text

Dietel, Ron – Center for Assessment and Evaluation of Student Learning (CAESL) at WestEd, 2004

The increase in large-scale standardized testing in the nation's schools is one major result of demands for accountability. Such testing also impacts many aspects of school and family life and culture. For some, certain unintended impacts may be like side effects of medicines that need to be tolerated; for others they raise larger questions about…

Descriptors: Testing, Standardized Tests, Scores, Accountability

Equating Achievement Across Subjects: Is It Possible? The Queensland Experience.

Peer reviewed

Nuyen, N. A. – Studies in Educational Evaluation, 1986

This paper looks at some of the problems arising from the procedures used by the Queensland, Australia, Board of Secondary School Studies to compile percentile scores known as Tertiary Entrance Scores. Although efforts to equate across schools and across subjects are being challenged, at present there is no alternative. (LMO)

Descriptors: Equated Scores, Foreign Countries, Norm Referenced Tests, Scaling

Threats to Score Comparability with Applications to Performance Assessments and Computerized Adaptive Tests

Peer reviewed

Direct link

Kolen, Michael J. – Educational Assessment, 1999

Develops a conceptual framework that addresses score comparability for performance assessments, adaptive tests, paper-and-pencil tests, and alternate item pools for computerized tests. Outlines testing situation aspects that might threaten score comparability and describes procedures for evaluating the degree of score comparability. Suggests ways…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Performance Based Assessment

Previous Page | Next Page »

Pages: 1 | 2 | 3

Baker, Eva L.	1
Beguin, A. A.	1
Berk, Ronald A.	1
Brown, Jonathan R.	1
Cavanagh, Sean	1
Chang, Moon K.	1
Chen, Yunxiao	1
Cook, Linda L.	1
Dietel, Ron	1
Dyer, Henry S.	1
Echternacht, Gary	1
Eignor, Daniel R.	1
Feldt, Leonard S.	1
Foster, Jeff L.	1
Fowler, R. Clarke	1
Goldberg, Gail Lynn	1
Hambleton, Ronald K	1
Hambleton, Ronald K.	1
Heaney, Kevin J.	1
Helms, LuAnn Sherbeck	1
Holmes, Susan E.	1
Kolen, Michael J.	1
Koretz, Daniel	1
Lee, Yi-Hsuan	1
More ▼