Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 20 |
Descriptor
Student Evaluation | 35 |
Test Content | 35 |
Test Validity | 25 |
Test Construction | 18 |
Evaluation Methods | 14 |
Test Reliability | 12 |
Test Items | 9 |
Testing | 9 |
Scores | 7 |
Test Format | 7 |
Elementary Secondary Education | 6 |
More ▼ |
Source
Author
Johnson, Bil | 2 |
Abdullah, Nor Athiyah | 1 |
Ackerman, Debra J. | 1 |
Adkins, Deborah | 1 |
Ahmed, S. | 1 |
Amit Sevak | 1 |
Baxter, G. P. | 1 |
Bello, Hassan | 1 |
Breakstone, Joel | 1 |
Brown, Richard S. | 1 |
Bruce, Bertram C. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 5 |
Teachers | 5 |
Policymakers | 1 |
Researchers | 1 |
Location
Delaware | 2 |
Illinois | 2 |
Maryland | 2 |
Ohio | 2 |
Washington | 2 |
Arizona | 1 |
California | 1 |
Colorado | 1 |
Florida | 1 |
Idaho | 1 |
Indiana | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Individuals with Disabilities… | 2 |
Every Student Succeeds Act… | 1 |
Assessments and Surveys
SAT (College Admission Test) | 2 |
Delaware Student Testing… | 1 |
Florida Comprehensive… | 1 |
Language Assessment Scales | 1 |
National Assessment of… | 1 |
Purdue Spatial Visualization… | 1 |
Woodcock Language Proficiency… | 1 |
What Works Clearinghouse Rating
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Kristin Bartlett – ProQuest LLC, 2023
At the highest level, this dissertation is a case study on how bias can become encoded into the tools used to measure a construct and into the very definition of the construct itself. In this case, the construct is spatial ability. This dissertation focuses on the validity and accuracy of spatial tests and illuminates gender bias that is…
Descriptors: Spatial Ability, Student Evaluation, Measures (Individuals), Validity
Dadey, Nathan; Gong, Brian – Smarter Balanced Assessment Consortium, 2023
This document is written primarily for policy makers and state department of education staff who are considering through-year assessments, as well as consultants and contractors state departments rely on. The document identifies essential things to consider when designing or evaluating a through-year assessment program. The paper is organized into…
Descriptors: Student Evaluation, Progress Monitoring, Summative Evaluation, Standardized Tests
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias
Mattern, Krista – ACT, Inc., 2019
A great deal has been written on the topic of test validity. Guiding our work at ACT are "The Standards for Educational and Psychological Testing" (2014), which outlines best practices in test development and validation. As ACT transitions from an assessment company to a learning, measurement, and navigation organization, a framework for…
Descriptors: Test Validity, Measurement Techniques, Evidence, Test Content
Bello, Hassan; Abdullah, Nor Athiyah – Electronic Journal of e-Learning, 2021
Computer-based assessment or e-assessment system is an e-learning system where information communication technology is utilized for examination activity, grading, and recording of responses of the examinees. It includes the entire assessment process from the examinees, teachers, institutions, examination agencies, and the public. E-assessment…
Descriptors: Evaluation Methods, Computer Assisted Testing, Technology Uses in Education, Program Effectiveness
Ackerman, Debra J. – ETS Research Report Series, 2018
Kindergarten entry assessments (KEAs) have increasingly been incorporated into state education policies over the past 5 years, with much of this interest stemming from Race to the Top--Early Learning Challenge (RTT-ELC) awards, Enhanced Assessment Grants, and nationwide efforts to develop common K-12 state learning standards. Drawing on…
Descriptors: Screening Tests, Kindergarten, Test Validity, Test Reliability
Fives, Helenrose; DiDonato-Barnes, Nicole – Practical Assessment, Research & Evaluation, 2013
Classroom tests provide teachers with essential information used to make decisions about instruction and student grades. A table of specification (TOS) can be used to help teachers frame the decision making process of test construction and improve the validity of teachers' evaluations based on tests constructed for classroom use. In this article…
Descriptors: Student Evaluation, Test Construction, Test Content, Teacher Made Tests
Breakstone, Joel – Theory and Research in Social Education, 2014
This article considers the design process for new formative history assessments. Over the course of 3 years, my colleagues from the Stanford History Education Group and I designed, piloted, and revised dozens of "History Assessments of Thinking" (HATs). As we created HATs, we sought to gather information about their cognitive validity,…
Descriptors: History Instruction, Formative Evaluation, Tests, Correlation
Nunan, Anna – Language Learning in Higher Education, 2014
The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…
Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Sadler, D. Royce – Assessment & Evaluation in Higher Education, 2010
If a grade is to be trusted as an authentic representation of a student's level of academic achievement, one of the requirements is that all the elements that contribute to that grade must qualify as achievement, and not be something else. The implications of taking this proposition literally turn out to be far reaching. Many elements that are…
Descriptors: Student Evaluation, Academic Achievement, Integrity, Credits
Gorin, Joanna S. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for validity theory and terminology, emphasizing a shift in theory and practice toward issues of test content rather than constructs. The author of this article argues that several of Lissitz and Samuelsen's critiques of validity theory focus on previously considered, but subsequently discarded,…
Descriptors: Test Content, Test Validity, Construct Validity, Test Construction
Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007
This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…
Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness