Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 7 |
Descriptor
Test Construction | 14 |
Test Validity | 5 |
Validity | 4 |
Achievement Tests | 3 |
State Programs | 3 |
Test Content | 3 |
Testing Programs | 3 |
Advanced Placement Programs | 2 |
Alignment (Education) | 2 |
Curriculum | 2 |
Data Collection | 2 |
More ▼ |
Source
Applied Measurement in… | 14 |
Author
Twing, Jon S. | 2 |
Anne Traynor | 1 |
Ayala, Carlos C. | 1 |
Brandon, Paul R. | 1 |
Brennan, Robert L. | 1 |
Clauser, Brian E. | 1 |
Cruse, Keith L. | 1 |
Denny, Patricia | 1 |
Feldt, Leonard S. | 1 |
Furtak, Erin Marie | 1 |
Huff, Kristen | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Descriptive | 14 |
Historical Materials | 1 |
Opinion Papers | 1 |
Education Level
High Schools | 2 |
Secondary Education | 2 |
Elementary Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Texas Assessment of Academic… | 4 |
What Works Clearinghouse Rating
Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024
Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…
Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction
O'Dwyer, Eowyn P.; Sparks, Jesse R.; Nabors Oláh, Leslie – Applied Measurement in Education, 2023
A critical aspect of the development of culturally relevant classroom assessments is the design of tasks that affirm students' racial and ethnic identities and community cultural practices. This paper describes the process we followed to build a shared understanding of what culturally relevant assessments are, to pursue ways of bringing more…
Descriptors: Evaluation Methods, Culturally Relevant Education, Test Construction, Educational Research
Leighton, Jacqueline P. – Applied Measurement in Education, 2021
The objective of this paper is to comment on the think-aloud methods presented in the three papers included in this special issue. The commentary offered stems from the author's own psychological investigations of unobservable information processes and the conditions under which the most defensible claims can be advanced. The structure of this…
Descriptors: Protocol Analysis, Data Collection, Test Construction, Test Validity
Rupp, André A. – Applied Measurement in Education, 2018
This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…
Descriptors: Design, Automation, Scoring, Test Scoring Machines
Brennan, Robert L. – Applied Measurement in Education, 2010
This paper provides an overview of evidence-centered assessment design (ECD) and some general information about of the Advanced Placement (AP[R]) Program. Then the papers in this special issue are discussed, as they relate to the use of ECD in the revision of various AP tests. This paper concludes with some observations about the need to validate…
Descriptors: Advanced Placement Programs, Equivalency Tests, Evidence, Test Construction
Huff, Kristen; Steinberg, Linda; Matts, Thomas – Applied Measurement in Education, 2010
The cornerstone of evidence-centered assessment design (ECD) is an evidentiary argument that requires that each target of measurement (e.g., learning goal) for an assessment be expressed as a "claim" to be made about an examinee that is relevant to the specific purpose and audience(s) for the assessment. The "observable evidence" required to…
Descriptors: Advanced Placement Programs, Equivalency Tests, Evidence, Test Construction
Ayala, Carlos C.; Shavelson, Richard J.; Ruiz-Primo, Maria Araceli; Brandon, Paul R.; Yin, Yue; Furtak, Erin Marie; Young, Donald B.; Tomita, Miki K. – Applied Measurement in Education, 2008
The idea that formative assessments embedded in a curriculum could help guide teachers toward better instructional practices that lead to greater student learning has taken center stage in science assessment research. In order to embed formative assessments in a curriculum, curriculum developers and assessment specialists should collaborate to…
Descriptors: Student Evaluation, Formative Evaluation, Teaching Methods, Alignment (Education)

Wilson, Mark; Sloane, Kathryn – Applied Measurement in Education, 2000
Describes the principles that guided the creation and implementation of a system of embedded assessments, the Berkeley Evaluation and Assessment Research System (BEAR). The assessment system builds on methodological advances in alternative assessment. Discusses how the application of the principles generates the component parts of the system. (SLD)
Descriptors: Educational Practices, Evaluation Methods, Research, Student Evaluation

Smisko, Ann; Twing, Jon S.; Denny, Patricia – Applied Measurement in Education, 2000
Describes the Texas test development process in detail, showing how each test development step is linked to the "Standards for Educational and Psychological Testing." The routine use of this process provides evidence of the content and curricular validity of the Texas Assessment of Academic Skills. (SLD)
Descriptors: Achievement Tests, Curriculum, Models, Test Construction

Feldt, Leonard S. – Applied Measurement in Education, 2002
Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…
Descriptors: Error of Measurement, Reliability, Scores, Test Construction

Mehrens, William A. – Applied Measurement in Education, 2000
Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…
Descriptors: Curriculum, Psychometrics, Reliability, Standards

Clauser, Brian E.; Kane, Michael T.; Swanson, David B. – Applied Measurement in Education, 2002
Attempts to place the issues associated with computer-automated scoring within the context of current validity theory and presents a taxonomy of automated scoring procedures as a framework for discussing threats to validity that may take on increased importance for specific approaches to automated scoring. (SLD)
Descriptors: Classification, Computer Uses in Education, Performance Based Assessment, Test Construction

Schafer, William D. – Applied Measurement in Education, 2000
Draws seven conclusions for professionals who administer state assessment programs from the "GI Forum V. Texas Education Agency" ruling. These conclusions are grouped into observations about test development and observations about test use. Discusses some implications for test use in other states. (SLD)
Descriptors: Court Litigation, High School Students, High Schools, State Programs

Cruse, Keith L.; Twing, Jon S. – Applied Measurement in Education, 2000
Provides a chronological summary of the evolution of statewide achievement tests in Texas to facilitate understanding of the issues behind the "GI Forum v. Texas Education Agency" litigation (1999), which challenged the curricular links of the Texas Assessment of Academic Skills and other "opportunity to learn" issues. (SLD)
Descriptors: Achievement Tests, Educational History, Elementary Secondary Education, Graduation Requirements