ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	17

Descriptor

Evaluation Methods	132
Test Use	132
Test Validity	93
Test Construction	54
Test Reliability	45
Educational Assessment	40
Student Evaluation	37
Validity	33
Elementary Secondary Education	31
Test Interpretation	24
Testing Problems	19
Performance Based Assessment	18
Higher Education	17
Psychometrics	17
Foreign Countries	16
Testing	16
Reliability	15
Measurement Techniques	14
Test Bias	14
Educational Testing	13
Models	12
Comparative Analysis	11
Scores	11
Test Format	11
Test Selection	11
More ▼

Education Level

Elementary Secondary Education	7
Elementary Education	2
Higher Education	2
Postsecondary Education	2
Adult Basic Education	1
Adult Education	1
Early Childhood Education	1
Preschool Education	1

Audience

Practitioners	11
Teachers	6
Researchers	4
Community	2
Parents	1
Students	1

Location

Canada	4
United Kingdom	3
United Kingdom (England)	3
United States	2
Australia	1
California	1
Iran	1
Netherlands	1
North Carolina	1
South Carolina	1
United Kingdom (Great Britain)	1
United Kingdom (Wales)	1
Virginia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Every Student Succeeds Act…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 132 results Save | Export

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Core Considerations for Selecting a Screener. Improving Literacy Brief

Direct link

National Center on Improving Literacy, 2022

There are many available screeners for reading and other education or social-emotional outcomes. This brief outlines important things to consider when choosing and using a screener.

Descriptors: Screening Tests, Literacy, Social Emotional Learning, Decision Making

Critical Language Assessment Literacy of EFL Teachers: Scale Construction and Validation

Peer reviewed

Direct link

Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022

Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

A Brief Guide to Selecting and Using Pre-Post Assessments

Download full text

Sanders, Sara – National Technical Assistance Center for the Education of Neglected or Delinquent Children and Youth (NDTAC), 2019

This guide is designed to assist States, agencies, and/or facilities who work with youth who are neglected, delinquent, or at-risk (N or D). The information in the guide will benefit those who are (a) interested in implementing pre-posttests, (b) in the process of identifying an appropriate pre-posttest, or (c) ready to evaluate current testing…

Descriptors: At Risk Students, Delinquency, Pretests Posttests, Testing

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Standards for Educational and Psychological Testing, 2014 Edition

Peer reviewed

Direct link

American Educational Research Association (AERA), 2014

Developed jointly by the American Educational Research Association, American Psychological Association, and the National Council on Measurement in Education, "Standards for Educational and Psychological Testing" (Revised 2014) addresses professional and technical issues of test development and use in education, psychology, and…

Descriptors: Standards, Educational Testing, Psychological Testing, Test Construction

Assessment Issues in Languages for Specific Purposes

Peer reviewed

Direct link

O'Sullivan, Barry – Modern Language Journal, 2012

While Grosse and Voght (1991) set out a well-considered overview of LSP and identified areas in need of development, they limited their observations on the topic of assessment to a short section devoted to what they called the "proficiency movement." While it is true that they really did not have a lot to report on at the time they wrote their…

Descriptors: Theory Practice Relationship, Work Environment, Languages for Special Purposes, Language Tests

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

Assessment of Prior Learning in Higher Education: A Review from a Validity Perspective

Peer reviewed

Direct link

Stenlund, Tova – Assessment & Evaluation in Higher Education, 2010

The process of giving official acknowledgment to formal, informal and non-formal prior learning is commonly labelled as assessment, accreditation or recognition of prior learning (APL), representing a practice that is expanding in higher education in many countries. This paper focuses specifically on the assessment part of APL, which undoubtedly…

Descriptors: Higher Education, Validity, Prior Learning, Program Effectiveness

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

A Framework for Evaluating and Planning Assessments Intended to Improve Student Achievement

Peer reviewed

Direct link

Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009

Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…

Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods

Benchmark Assessment for Improved Learning. AACC Report

Download full text

Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010

This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…

Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment

A Comparison of the Kansas Marital Satisfaction Scale and the Locke-Wallace Marital Adjustment Test.

White, Mark B.; And Others – 1990

Past research has suggested that the Kansas Marital Satisfaction Scale (KMS) is a brief, reliable, and valid measure of marital satisfaction. This study was conducted to: (1) examine responses on the KMS from a national sample of couples; (2) assess the construct validity of the KMS through a comparison with the Locke-Wallace Marital Adjustment…

Descriptors: Adjustment (to Environment), Construct Validity, Evaluation Methods, Marital Satisfaction

Methods for Evaluating the Validity of Test Scores for English Language Learners

Peer reviewed

Direct link

Sireci, Stephen G.; Han, Kyung T.; Wells, Craig S. – Educational Assessment, 2008

In the United States, when English language learners (ELLs) are tested, they are usually tested in English and their limited English proficiency is a potential cause of construct-irrelevant variance. When such irrelevancies affect test scores, inaccurate interpretations of ELLs' knowledge, skills, and abilities may occur. In this article, we…

Descriptors: Test Use, Educational Assessment, Psychological Testing, Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational Measurement:…	6
Psychological Assessment	4
Measurement:…	3
Alberta Journal of…	2
Applied Measurement in…	2
Counseling Psychologist	2
Educational Assessment	2
Evaluation Review	2
Journal of Outcome Measurement	2
Modern Language Journal	2
Academic Medicine	1
American Educational Research…	1
American Journal of Education	1
Assessing Writing	1
Assessment	1
Assessment & Evaluation in…	1
Assessment and Accountability…	1
British Educational Research…	1
Canadian Journal of English…	1
Center for Assessment and…	1
Child Welfare	1
Children & Schools	1
Community College Journal of…	1
Early Child Development and…	1
Education and Urban Society	1
More ▼

Linn, Robert L.	4
Herman, Joan L.	3
Baker, Eva L.	2
Clark, John L. D.	2
Johnson, Bil	2
Moss, Pamela A.	2
Mott, Michael S.	2
Shepard, Lorrie A.	2
Ackerman, Terry A.	1
Aiken, Lewis R.	1
Amery D. Wu	1
Archer, Robert P.	1
Arter, Judith A.	1
Bailey, Earletta	1
Baird, Jo-Anne	1
Baxter, Gail P.	1
Bishop, Laurence A.	1
Blake, Jennifer M.	1
Blakemore, Thomas	1
Bouwens, M. R. J.	1
Boyle, J. David	1
Bracey, Gerald W.	1
Bricker, Diane	1
Brown, Elissa J.	1
More ▼

Journal Articles	63
Reports - Evaluative	44
Reports - Research	34
Speeches/Meeting Papers	29
Opinion Papers	22
Guides - Non-Classroom	12
Information Analyses	12
Reports - Descriptive	11
Books	9
Tests/Questionnaires	6
Guides - Classroom - Teacher	5
Guides - General	3
Book/Product Reviews	2
ERIC Digests in Full Text	2
ERIC Publications	2
Legal/Legislative/Regulatory…	2
Numerical/Quantitative Data	2
Reference Materials -…	2
Collected Works - Serials	1
Guides - Classroom - Learner	1
Reports - General	1
More ▼

National Assessment of…	5
Minnesota Multiphasic…	2
SAT (College Admission Test)	2
Advanced Placement…	1
Child Abuse Potential…	1
Group Embedded Figures Test	1
Language Development Survey	1
Learning Style Inventory	1
Locke Wallace Marital…	1
MacArthur Communicative…	1
Maslach Burnout Inventory	1
Measures of Academic Progress	1
Motivated Strategies for…	1
Myers Briggs Type Indicator	1
Pennsylvania Educational…	1
Productivity Environmental…	1
Raven Progressive Matrices	1
School Level Environment…	1
Self Directed Learning…	1
Test of Adult Basic Education	1
Wide Range Achievement Test	1
Woodcock Johnson Tests of…	1
More ▼