ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	13

Descriptor

Testing Programs	77
Validity	77
State Programs	43
Elementary Secondary Education	41
Reliability	26
Test Use	24
Test Construction	19
Academic Achievement	16
Accountability	16
Educational Assessment	16
Scores	13
Psychometrics	12
Achievement Tests	10
Evaluation Methods	10
Scoring	10
Performance Based Assessment	9
Standards	9
Testing Problems	9
High Stakes Tests	8
Test Results	8
Educational Change	7
Educational Policy	7
Educational Testing	7
Elementary School Students	7
Student Evaluation	7
More ▼

Publication Type

Journal Articles	28
Reports - Descriptive	25
Reports - Research	21
Speeches/Meeting Papers	18
Reports - Evaluative	16
Numerical/Quantitative Data	9
Guides - Non-Classroom	5
Opinion Papers	3
Legal/Legislative/Regulatory…	2
Tests/Questionnaires	2
Book/Product Reviews	1
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Guides - General	1
Historical Materials	1
Information Analyses	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Elementary Education	5
Elementary Secondary Education	5
Middle Schools	3
Secondary Education	3
Grade 3	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Grade 8	2
Junior High Schools	2
Early Childhood Education	1
Grade 10	1
Grade 2	1
Grade 9	1
More ▼

Audience

Researchers	3
Administrators	2
Practitioners	2
Parents	1
Teachers	1

Location

Kentucky	6
Australia	2
New York	2
Texas	2
United States	2
Washington	2
Arizona	1
Hawaii	1
Hong Kong	1
Indonesia	1
Massachusetts	1
Nebraska	1
New Hampshire	1
New Zealand	1
Oregon	1
Rhode Island	1
South Korea	1
Taiwan	1
Thailand	1
United Kingdom	1
Vermont	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Kentucky Education Reform Act…	1

Assessments and Surveys

National Assessment of…	7
Texas Assessment of Academic…	5
Massachusetts Comprehensive…	2
Stanford Achievement Tests	2
Delaware Student Testing…	1
Early Childhood Longitudinal…	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
SAT (College Admission Test)	1
Washington Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 77 results Save | Export

The Sensitivity of Teacher Value-Added Scores to the Use of Fall or Spring Test Scores

Peer reviewed

Direct link

Atteberry, Allison; Mangan, Daniel – Educational Researcher, 2020

Papay (2011) noticed that teacher value-added measures (VAMs) from a statistical model using the most common pre/post testing timeframe--current-year spring relative to previous spring (SS)--are essentially unrelated to those same teachers' VAMs when instead using next-fall relative to current-fall (FF). This is concerning since this choice--made…

Descriptors: Correlation, Value Added Models, Pretests Posttests, Decision Making

The Impact of National Standardized Literacy and Numeracy Testing on Children and Teaching Staff in Remote Australian Indigenous Communities

Peer reviewed

Direct link

Macqueen, Susy; Knoch, Ute; Wigglesworth, Gillian; Nordlinger, Rachel; Singer, Ruth; McNamara, Tim; Brickle, Rhianna – Language Testing, 2019

All educational testing is intended to have consequences, which are assumed to be beneficial, but tests may also have unintended, negative consequences (Messick, 1989). The issue is particularly important in the case of large-scale standardized tests, such as Australia's "National Assessment Program--Literacy and Numeracy" (NAPLAN), the…

Descriptors: Numeracy, Standardized Tests, National Curriculum, Testing Programs

Identifying and Evaluating External Validity Evidence for Passing Scores

Peer reviewed

Direct link

Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013

A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…

Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores

Secondary Analysis of Large-Scale Assessment Data: An Alternative to Variable-Centred Analysis

Peer reviewed

Direct link

Chow, Kui Foon; Kennedy, Kerry John – Educational Research and Evaluation, 2014

International large-scale assessments are now part of the educational landscape in many countries and often feed into major policy decisions. Yet, such assessments also provide data sets for secondary analysis that can address key issues of concern to educators and policymakers alike. Traditionally, such secondary analyses have been based on a…

Descriptors: Measurement, Data Analysis, Educational Assessment, Multivariate Analysis

Large-Scale Academic Achievement Testing of Deaf and Hard-of-Hearing Students: Past, Present, and Future

Peer reviewed

Direct link

Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012

The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…

Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement

Consequences of Assessment and Accountability Systems Are Integral to the Argument-Based Approach to Validity

Peer reviewed

Direct link

Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012

Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…

Descriptors: Educational Opportunities, Accountability, Validity, Inferences

NAPLaN Test Data, ESL Bandscales and the Validity of EAL/D Teacher Judgement of Student Performance

Peer reviewed
PDF on ERIC

Download full text

Creagh, Sue – TESOL in Context, 2014

Teachers are now experiencing the age of quantitative test-driven assessment, in which there is little weight accorded to teacher-based judgement about student progress. In the Australian context, the NAPLaN test has become a driving force in school and teacher accountability. The language of NAPLaN is one of bands and numerical scores and…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Student Evaluation

Conducting a Lifecycle Audit of the National Assessment of Educational Progress

Peer reviewed

Direct link

Buckendahl, Chad W.; Plake, Barbara S.; Davis, Susan L. – Applied Measurement in Education, 2009

The National Assessment of Educational Progress (NAEP) program is a series of periodic assessments administered nationally to samples of students and designed to measure different content areas. This article describes a multi-year study that focused on the breadth of the development, administration, maintenance, and renewal of the assessments in…

Descriptors: National Competency Tests, Audits (Verification), Testing Programs, Program Evaluation

Investigating the Justifiability of an Additional Test Use: An Application of Assessment Use Argument to an English as a Foreign Language Test

Direct link

Wang, Huan – ProQuest LLC, 2010

Multiple uses of the same assessment may present challenges for both the design and use of an assessment. Little advice, however, has been given to assessment developers as to how to understand the phenomena of multiple assessment use and meet the challenges these present. Particularly problematic is the case in which an assessment is used for…

Descriptors: Test Use, Testing Programs, Program Effectiveness, Test Construction

Construct Equivalence across Grades in a Vertical Scale for a K-12 Large-Scale Reading Assessment

Peer reviewed

Direct link

Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009

In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…

Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics

Technical Adequacy of the easyCBM Grade 2 Reading Measures. Technical Report #1004

Download full text

Jamgochian, Elisa; Park, Bitnara Jasmine; Nese, Joseph F. T.; Lai, Cheng-Fei; Saez, Leilani; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2010

In this technical report, we provide reliability and validity evidence for the easyCBM[R] Reading measures for grade 2 (word and passage reading fluency and multiple choice reading comprehension). Evidence for reliability includes internal consistency and item invariance. Evidence for validity includes concurrent, predictive, and construct…

Descriptors: Grade 2, Reading Comprehension, Testing Programs, Reading Fluency

Evaluating Nurse Competency: Evidence of Validity for a Skills Recredentialing Program.

Peer reviewed

Jones, Terry; Cason, Carolyn L.; Mancini, Mary E. – Journal of Professional Nursing, 2002

Registered nurses (n=368) participated in a skills recredentialing program in which competencies were assessed by a knowledge test and performance test under simulated conditions and evaluator ratings in actual patient-care situations. No significant differences in results between the simulated and actual conditions support the validity of the…

Descriptors: Competence, Credentials, Interrater Reliability, Nurses

Ranking School Districts on the Basis of Statewide Test Results: Is It Meaningful or Misleading?

Peer reviewed

Guskey, Thomas R.; Kifer, Edward W. – Educational Measurement: Issues and Practice, 1990

How state educational authorities in Kentucky use statewide test data to rank the state's 178 school districts was studied, using data from the "Kentucky Essential Skills Test: Statewide Testing Results" (1987). The methods used, means of refining those methods, the fairness/accuracy/validity of resulting interpretations, and problems…

Descriptors: School Districts, School Effectiveness, State Programs, Test Results

Validity of Student Scores in the New Hampshire Educational Assessment and Improvement Program.

Download full text

Nering, Michael L.; Bay, Luz G.; Meijer, Rob R. – 2000

In state assessment programs in which performance has no real immediate consequence for the individual examinee, the issue of examinee motivation arises. Some examinees may respond to questions in ways that do not reflect their real knowledge of the test domain. In this study, a new approach was developed to identify students who have responded to…

Descriptors: Elementary Secondary Education, Responses, Scores, State Programs

Assessment Validation in the Context of High-Stakes Assessment.

Peer reviewed

Ryan, Katherine – Educational Measurement: Issues and Practice, 2002

Proposes a process approach to validity that addresses assessment validation in the context of high-stakes assessment. This approach includes a test evaluator or validator who considers the perspectives of five stakeholder groups at four different stages of assessment maturity in relation to six aspects of construct validity. Illustrates each…

Descriptors: Educational Assessment, Elementary Secondary Education, Evaluators, High Stakes Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational Measurement:…	8
Applied Measurement in…	5
Behavioral Research and…	2
American Journal of Education	1
Education Policy Analysis…	1
Educational Assessment	1
Educational Research and…	1
Educational Researcher	1
Educational and Psychological…	1
International Journal of…	1
Journal of Consulting and…	1
Journal of Deaf Studies and…	1
Journal of Personnel…	1
Journal of Professional…	1
Language Testing	1
Measurement:…	1
Practical Assessment,…	1
ProQuest LLC	1
TESOL in Context	1
Yearbook of the National…	1
More ▼

Buckendahl, Chad W.	3
Haertel, Edward H.	3
Lane, Suzanne	3
Alonzo, Julie	2
Anderson, Daniel	2
Crocker, Linda	2
Dings, Jonathan	2
Guskey, Thomas R.	2
Herman, Joan L.	2
Jamgochian, Elisa	2
Kane, Michael	2
Kifer, Edward W.	2
Lai, Cheng-Fei	2
Linn, Robert L.	2
Nese, Joseph F. T.	2
Plake, Barbara S.	2
Saez, Leilani	2
Stone, Clement A.	2
Tindal, Gerald	2
Abedi, Jamal	1
Atteberry, Allison	1
Barron, Sheila I.	1
Bay, Luz G.	1
Bene, Nancy	1
More ▼