ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Performance Based Assessment	21
Scores	21
Test Interpretation	21
Educational Assessment	9
Standardized Tests	6
Test Construction	6
Test Validity	6
Construct Validity	5
Student Evaluation	5
Validity	5
Evaluation Methods	4
Scoring	4
Standards	4
Test Results	4
Test Use	4
Elementary Secondary Education	3
Evaluation Utilization	3
Knowledge Level	3
Measurement Techniques	3
Multiple Choice Tests	3
Norm Referenced Tests	3
Portfolio Assessment	3
Test Reliability	3
Academic Achievement	2
Alternative Assessment	2
More ▼

Source

Applied Measurement in…	3
Council for Aid to Education	1
Educational Measurement:…	1
Empirical Research in…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Journal of Research in…	1
Mackinac Center for Public…	1
ProQuest LLC	1
Rowman & Littlefield…	1

Publication Type

Journal Articles	8
Reports - Evaluative	8
Reports - Research	7
Books	3
Guides - Non-Classroom	2
Speeches/Meeting Papers	2
Book/Product Reviews	1
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	4
Elementary Education	2
High Schools	2
Postsecondary Education	2
Secondary Education	2
Elementary Secondary Education	1
Grade 5	1
Grade 7	1
Grade 8	1
Grade 9	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Practitioners	2
Community	1
Teachers	1

Location

California	1
California (Los Angeles)	1
Canada	1
Italy	1
Massachusetts	1
Michigan	1
United States	1
Vermont	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Gates MacGinitie Reading Tests	1
General Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

Digital Measurement of Hands-On Performance? Ecological Validation of a Computer-Based Assessment of Automotive Repair Skills

Peer reviewed

Direct link

Hartmann, Stefan; Güzel, Emre; Gschwendtner, Tobias – Empirical Research in Vocational Education and Training, 2023

We investigated the ecological validity of performance measures from a computer-based assessment tool that utilises scripted video vignettes. The intended purpose of this tool is to assess the maintenance and repair skills of automotive technician apprentices, complementing traditional hands-on assessment formats from the German journeymen's…

Descriptors: Performance Based Assessment, Computer Assisted Testing, Auto Mechanics, Job Skills

Evaluating Score and Decision Consistency across Claims in a Validation Argument

Peer reviewed

Direct link

Schmidgall, Jonathan – Applied Measurement in Education, 2017

This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…

Descriptors: Scores, Reliability, Validity, Generalizability Theory

Interpreting Secondary Students' Performance on a Timed, Multiple-Choice Reading Comprehension Assessment: The Prevalence and Impact of Non-Attempted Items

Peer reviewed

Direct link

Clemens, Nathan H.; Davis, John L.; Simmons, Leslie E.; Oslund, Eric L.; Simmons, Deborah C. – Journal of Psychoeducational Assessment, 2015

Standardized measures are often used as an index of students' reading comprehension and scores have important implications, particularly for students who perform below expectations. This study examined secondary-level students' patterns of responding and the prevalence and impact of non-attempted items on a timed, group-administered,…

Descriptors: Secondary School Students, Performance Based Assessment, Multiple Choice Tests, Reading Comprehension

A Case Study of an International Performance-Based Assessment of Critical Thinking Skills

Download full text

Wolf, Raffaela; Zahner, Doris; Kostoris, Fiorella; Benjamin, Roger – Council for Aid to Education, 2014

The measurement of higher-order competencies within a tertiary education system across countries presents methodological challenges due to differences in educational systems, socio-economic factors, and perceptions as to which constructs should be assessed (Blömeke, Zlatkin-Troitschanskaia, Kuhn, & Fege, 2013). According to Hart Research…

Descriptors: Case Studies, International Assessment, Performance Based Assessment, Critical Thinking

Statistical Profiling of Academic Oral English Proficiency Based on an ITA Screening Test

Direct link

Choi, Ick Kyu – ProQuest LLC, 2013

At the University of California, Los Angeles, the Test of Oral Proficiency (TOP), an internally developed oral proficiency test, is administered to international teaching assistant (ITA) candidates to ensure an appropriate level of academic oral English proficiency. Test taker performances are rated live by two raters according to four subscales.…

Descriptors: Screening Tests, Profiles, Oral Language, English

The 2014 Michigan Public High School Context and Performance Report Card

Download full text

Spalding, Audrey – Mackinac Center for Public Policy, 2014

The 2014 Michigan Public High School Context and Performance Report Card is the Mackinac Center's second effort to measure high school performance. The first high school assessment was published in 2012, followed by the Center's 2013 elementary and middle school report card, which used a similar methodology to evaluate school performance. The…

Descriptors: Academic Achievement, Achievement Rating, Comparative Analysis, Comparative Testing

"I Never Thought of It as Freezing": How Students Answer Questions on Large-Scale Science Tests and What They Know about Science

Peer reviewed

Direct link

Noble, Tracy; Suarez, Catherine; Rosebery, Ann; O'Connor, Mary Catherine; Warren, Beth; Hudicourt-Barnes, Josiane – Journal of Research in Science Teaching, 2012

Education policy in the U.S. in the last two decades has emphasized large-scale assessment of students, with growing consequences for schools, teachers, and students. Given the high stakes of such tests, it is important to understand the relationships between students' answers to test items and their knowledge and skills in the tested content…

Descriptors: Testing, Science Tests, Second Language Learning, Measures (Individuals)

Classroom Assessment in Action

Direct link

Shermis, Mark D.; DiVesta, Francis J. – Rowman & Littlefield Publishers, Inc., 2011

"Classroom Assessment in Action" clarifies the multi-faceted roles of measurement and assessment and their applications in a classroom setting. Comprehensive in scope, Shermis and Di Vesta explain basic measurement concepts and show students how to interpret the results of standardized tests. From these basic concepts, the authors then…

Descriptors: Student Evaluation, Standardized Tests, Scores, Measurement

Validating Licensing and Certification Test Score Interpretations and Decisions: A Response.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 1997

This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)

Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment

Standards of Validity and the Validity of Standards in Performance Assessment.

Peer reviewed

Messick, Samuel – Educational Measurement: Issues and Practice, 1995

Six distinguishable aspects of construct validity are discussed as they apply to performance assessment, emphasizing content, substantive, structural, generalizability, external, and consequential aspects. Taken together, these aspects provide a way to address validity questions in score interpretation and use. (SLD)

Descriptors: Construct Validity, Content Validity, Educational Assessment, Generalization

Standards-Based Score Interpretation: Establishing Valid Grounds for Valid Inferences. Research Report.

Download full text

Messick, Samuel – 1994

The construct validity of content standards is addressed in terms of their representative coverage of a construct domain and their alignment with the students' cognitive level of developing expertise in the subject matter. The construct validity of performance standards is addressed in terms of the extent to which they reflect increasing levels of…

Descriptors: Construct Validity, Educational Assessment, Inferences, Knowledge Level

Alternative Modes of Assessment, Uniform Standards of Validity. Research Report.

Download full text

Messick, Samuel – 1994

In contrast to multiple choice, alternative modes of assessment afford varying degrees of openness in the allowable responses. Prominent among the alternatives is the assessment of performance, sometimes in its own right where the issue is the quality of the particular performance per se, but more often as a vehicle for the assessment of…

Descriptors: Alternative Assessment, Construct Validity, Educational Assessment, Inferences

Validity and Washback in Language Testing.

Download full text

Messick, Samuel – 1996

The concept of "washback," especially prominent in the field of applied linguistics, refers to the extent to which a test influences teachers and learners to do things they would not otherwise necessarily do. Some writers invoke the notion of washback validity, holding that a test's validity should be gauged by the degree to which it has…

Descriptors: Applied Linguistics, Construct Validity, Criteria, Language Tests

Test Scores and What They Mean. Sixth Edition.

Lyman, Howard B. – 1998

The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…

Descriptors: Educational Testing, Norm Referenced Tests, Performance Based Assessment, Psychometrics

Previous Page | Next Page »

Pages: 1 | 2

Messick, Samuel	4
Archbald, Doug A.	1
Benjamin, Roger	1
Carolin Hahnel	1
Choi, Ick Kyu	1
Clemens, Nathan H.	1
Davis, John L.	1
DiVesta, Francis J.	1
Dunbar, Stephen B.	1
Frank Goldhammer	1
Gschwendtner, Tobias	1
Güzel, Emre	1
Hartmann, Stefan	1
Hudicourt-Barnes, Josiane	1
Johannes Naumann	1
Koretz, Daniel	1
Kostoris, Fiorella	1
Lyman, Howard B.	1
Mehrens, William A.	1
Moran, Joseph J.	1
Noble, Tracy	1
O'Connor, Mary Catherine	1
Oslund, Eric L.	1
Paul De Boeck	1
More ▼