Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 8 |
Descriptor
Test Construction | 38 |
Test Use | 38 |
Validity | 38 |
Evaluation Methods | 13 |
Reliability | 13 |
Elementary Secondary Education | 12 |
Educational Assessment | 10 |
Student Evaluation | 8 |
Testing Programs | 8 |
State Programs | 7 |
Educational Testing | 6 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 4 |
Teachers | 3 |
Administrators | 2 |
Researchers | 2 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Texas Assessment of Academic… | 2 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Psychometrics, Validity, Child Development
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Usability, Decision Making, Validity
Torres Irribarra, David – Measurement: Interdisciplinary Research and Perspectives, 2017
Maul's paper, "Rethinking Traditional Methods of Survey Validation," is a clever and pointed indictment of a set of specific but widespread practices in psychological measurement and the social sciences at large. Through it, Maul highlights central issues in the way to approach theory building and theory testing, bringing to mind the…
Descriptors: Surveys, Validity, Methods, Psychological Characteristics
American Educational Research Association (AERA), 2014
Developed jointly by the American Educational Research Association, American Psychological Association, and the National Council on Measurement in Education, "Standards for Educational and Psychological Testing" (Revised 2014) addresses professional and technical issues of test development and use in education, psychology, and…
Descriptors: Standards, Educational Testing, Psychological Testing, Test Construction
Harris, Sandra M.; Larrier, Yvonne I.; Castano-Bishop, Marianne – Online Journal of Distance Learning Administration, 2011
The problem of attrition in online learning has drawn attention from distance education administrators and chief academic officers of higher education institutions. Many studies have addressed factors related to student attrition, persistence and retention in online courses. However, few studies have examined how student expectations influence…
Descriptors: Electronic Learning, Student Attitudes, Distance Education, Academic Persistence
Wang, Huan – ProQuest LLC, 2010
Multiple uses of the same assessment may present challenges for both the design and use of an assessment. Little advice, however, has been given to assessment developers as to how to understand the phenomena of multiple assessment use and meet the challenges these present. Particularly problematic is the case in which an assessment is used for…
Descriptors: Test Use, Testing Programs, Program Effectiveness, Test Construction
Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009
Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…
Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment

Burney, DeAnna McKinnie; Kromrey, Jeffrey – Educational and Psychological Measurement, 2001
Studied the construct validity of scores on the Adolescent Anger Rating Scale (AARS) developed to measure instrumental and reactive anger. Results for 792 12- to 19-year-olds indicate that AARS scores are internally consistent and stable when anger subtypes are measured. (SLD)
Descriptors: Adolescents, Anger, Scores, Test Construction

Feldt, Leonard S. – Applied Measurement in Education, 1997
It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)
Descriptors: Correlation, Criteria, Reliability, Test Construction

Green, Donald Ross – Educational Measurement: Issues and Practice, 1998
Asserts that publishers of achievement tests are, for the most part, not in a position to obtain on their own any decent evidence about the consequences of uses made of their tests. Reasons why this is so are discussed, and what publishers can be expected to do is outlined. (SLD)
Descriptors: Achievement Tests, Elementary Secondary Education, Test Construction, Test Use

Yen, Wendy M. – Educational Measurement: Issues and Practice, 1998
The articles in this issue, written from the perspectives of academics, practitioners, and publishers, show that examining the consequences of assessment is an important, large, and difficult task. Collaborative action by assessment developers, users, and the educational measurement community is needed if progress is to be made. (SLD)
Descriptors: Cooperation, Evaluation Methods, Program Evaluation, Responsibility
Mullis, Ina V. S. – 2003
This paper addresses three key topics related to making state National Assessment of Educational Progress (NAEP) assessments more efficient: (1) reducing the burden for the states; (2) stabilizing the assessment schedule; and (3) facilitating and promoting the use of state NAEP data. The paper recommends promoting the use of state NAEP data for…
Descriptors: Data Analysis, Elementary Secondary Education, National Surveys, Test Construction

Moss, Pamela A. – Educational Measurement: Issues and Practice, 1998
Provides an argument for incorporating consideration of consequences into validity theory that is grounded in the reflexive nature of social knowledge. It also calls for the consideration of evidence of validity based on the actual discourse surrounding the practices and products of testing. (SLD)
Descriptors: Evaluation Methods, Evaluation Utilization, Program Evaluation, Test Construction

Mehrens, William A. – Applied Measurement in Education, 2000
Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…
Descriptors: Curriculum, Psychometrics, Reliability, Standards