ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	10

Source

Educational Measurement:…	8
Applied Measurement in…	5
American Journal of Education	1
Education Policy Analysis…	1
Educational Assessment	1
Educational Research and…	1
Educational Researcher	1
Educational and Psychological…	1
International Journal of…	1
Journal of Deaf Studies and…	1
Journal of Personnel…	1
Journal of Professional…	1
Language Testing	1
Measurement:…	1
Practical Assessment,…	1
TESOL in Context	1
Yearbook of the National…	1
More ▼

Publication Type

Journal Articles	28
Reports - Descriptive	12
Reports - Research	10
Reports - Evaluative	4
Speeches/Meeting Papers	3
Book/Product Reviews	1
Legal/Legislative/Regulatory…	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Elementary Secondary Education	3
Secondary Education	3
Grade 8	2
Junior High Schools	2
Middle Schools	2
Grade 10	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 9	1
More ▼

Audience

Location

Australia	2
United States	2
Hong Kong	1
Indonesia	1
Kentucky	1
South Korea	1
Taiwan	1
Texas	1
Thailand	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Texas Assessment of Academic…	5
National Assessment of…	2
Early Childhood Longitudinal…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

The Sensitivity of Teacher Value-Added Scores to the Use of Fall or Spring Test Scores

Peer reviewed

Direct link

Atteberry, Allison; Mangan, Daniel – Educational Researcher, 2020

Papay (2011) noticed that teacher value-added measures (VAMs) from a statistical model using the most common pre/post testing timeframe--current-year spring relative to previous spring (SS)--are essentially unrelated to those same teachers' VAMs when instead using next-fall relative to current-fall (FF). This is concerning since this choice--made…

Descriptors: Correlation, Value Added Models, Pretests Posttests, Decision Making

The Impact of National Standardized Literacy and Numeracy Testing on Children and Teaching Staff in Remote Australian Indigenous Communities

Peer reviewed

Direct link

Macqueen, Susy; Knoch, Ute; Wigglesworth, Gillian; Nordlinger, Rachel; Singer, Ruth; McNamara, Tim; Brickle, Rhianna – Language Testing, 2019

All educational testing is intended to have consequences, which are assumed to be beneficial, but tests may also have unintended, negative consequences (Messick, 1989). The issue is particularly important in the case of large-scale standardized tests, such as Australia's "National Assessment Program--Literacy and Numeracy" (NAPLAN), the…

Descriptors: Numeracy, Standardized Tests, National Curriculum, Testing Programs

Identifying and Evaluating External Validity Evidence for Passing Scores

Peer reviewed

Direct link

Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013

A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…

Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores

Secondary Analysis of Large-Scale Assessment Data: An Alternative to Variable-Centred Analysis

Peer reviewed

Direct link

Chow, Kui Foon; Kennedy, Kerry John – Educational Research and Evaluation, 2014

International large-scale assessments are now part of the educational landscape in many countries and often feed into major policy decisions. Yet, such assessments also provide data sets for secondary analysis that can address key issues of concern to educators and policymakers alike. Traditionally, such secondary analyses have been based on a…

Descriptors: Measurement, Data Analysis, Educational Assessment, Multivariate Analysis

Large-Scale Academic Achievement Testing of Deaf and Hard-of-Hearing Students: Past, Present, and Future

Peer reviewed

Direct link

Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012

The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…

Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement

Consequences of Assessment and Accountability Systems Are Integral to the Argument-Based Approach to Validity

Peer reviewed

Direct link

Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012

Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…

Descriptors: Educational Opportunities, Accountability, Validity, Inferences

NAPLaN Test Data, ESL Bandscales and the Validity of EAL/D Teacher Judgement of Student Performance

Peer reviewed
PDF on ERIC

Download full text

Creagh, Sue – TESOL in Context, 2014

Teachers are now experiencing the age of quantitative test-driven assessment, in which there is little weight accorded to teacher-based judgement about student progress. In the Australian context, the NAPLaN test has become a driving force in school and teacher accountability. The language of NAPLaN is one of bands and numerical scores and…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Student Evaluation

Conducting a Lifecycle Audit of the National Assessment of Educational Progress

Peer reviewed

Direct link

Buckendahl, Chad W.; Plake, Barbara S.; Davis, Susan L. – Applied Measurement in Education, 2009

The National Assessment of Educational Progress (NAEP) program is a series of periodic assessments administered nationally to samples of students and designed to measure different content areas. This article describes a multi-year study that focused on the breadth of the development, administration, maintenance, and renewal of the assessments in…

Descriptors: National Competency Tests, Audits (Verification), Testing Programs, Program Evaluation

Construct Equivalence across Grades in a Vertical Scale for a K-12 Large-Scale Reading Assessment

Peer reviewed

Direct link

Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009

In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…

Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics

Evaluating Nurse Competency: Evidence of Validity for a Skills Recredentialing Program.

Peer reviewed

Jones, Terry; Cason, Carolyn L.; Mancini, Mary E. – Journal of Professional Nursing, 2002

Registered nurses (n=368) participated in a skills recredentialing program in which competencies were assessed by a knowledge test and performance test under simulated conditions and evaluator ratings in actual patient-care situations. No significant differences in results between the simulated and actual conditions support the validity of the…

Descriptors: Competence, Credentials, Interrater Reliability, Nurses

Ranking School Districts on the Basis of Statewide Test Results: Is It Meaningful or Misleading?

Peer reviewed

Guskey, Thomas R.; Kifer, Edward W. – Educational Measurement: Issues and Practice, 1990

How state educational authorities in Kentucky use statewide test data to rank the state's 178 school districts was studied, using data from the "Kentucky Essential Skills Test: Statewide Testing Results" (1987). The methods used, means of refining those methods, the fairness/accuracy/validity of resulting interpretations, and problems…

Descriptors: School Districts, School Effectiveness, State Programs, Test Results

Assessment Validation in the Context of High-Stakes Assessment.

Peer reviewed

Ryan, Katherine – Educational Measurement: Issues and Practice, 2002

Proposes a process approach to validity that addresses assessment validation in the context of high-stakes assessment. This approach includes a test evaluator or validator who considers the perspectives of five stakeholder groups at four different stages of assessment maturity in relation to six aspects of construct validity. Illustrates each…

Descriptors: Educational Assessment, Elementary Secondary Education, Evaluators, High Stakes Tests

Defending a State Graduation Test: "GI Forum v. Texas Education Agency." Measurement Perspectives from an External Evaluator.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 2000

Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…

Descriptors: Curriculum, Psychometrics, Reliability, Standards

Stakeholders in Comprehensive Validation of Standards-Based Assessments: A Commentary.

Peer reviewed

Crocker, Linda – Educational Measurement: Issues and Practice, 2002

Introduces the articles of this theme issue focusing on the involvement of key stakeholder groups in the validation of large-scale high-stakes assessments. Each makes a unique but complementary contribution to the understanding of the demands of a comprehensive validation effort. (SLD)

Descriptors: Elementary Secondary Education, High Stakes Tests, Performance Based Assessment, Stakeholders

Validating High-Stakes Testing Programs.

Peer reviewed

Kane, Michael – Educational Measurement: Issues and Practice, 2002

Makes the point that the interpretations and use of high-stakes test scores rely on policy assumptions about what should be taught and the content standards and performance standards that should be applied. The assumptions built into an assessment need to be subjected to scrutiny and criticism if a strong case is to be made for the validity of the…

Descriptors: Educational Policy, Elementary Secondary Education, High Stakes Tests, Scores

Previous Page | Next Page »

Pages: 1 | 2

Testing Programs	28
Validity	28
Elementary Secondary Education	13
State Programs	10
Test Use	9
Accountability	8
Standards	7
Academic Achievement	6
Educational Assessment	6
High Stakes Tests	6
Reliability	6
Scores	6
Foreign Countries	4
Performance Based Assessment	4
Program Evaluation	4
Achievement Gains	3
Achievement Tests	3
Court Litigation	3
Educational Change	3
Educational Policy	3
Evaluation Methods	3
High School Students	3
High Schools	3
National Competency Tests	3
Stakeholders	3
More ▼

Haertel, Edward H.	3
Lane, Suzanne	3
Buckendahl, Chad W.	2
Stone, Clement A.	2
Atteberry, Allison	1
Brickle, Rhianna	1
Brookhart, Susan M.	1
Cason, Carolyn L.	1
Chow, Kui Foon	1
Creagh, Sue	1
Crocker, Linda	1
Davis, Susan L.	1
Davis-Becker, Susan L.	1
Guskey, Thomas R.	1
Haney, Walt	1
Heller, Joan I.	1
Herman, Joan L.	1
Jiao, Hong	1
Jones, Terry	1
Kane, Michael	1
Kennedy, Kerry John	1
Kifer, Edward W.	1
Knoch, Ute	1
Loadman, William E.	1
Macqueen, Susy	1
More ▼