ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	34

Descriptor

Evaluation Methods	61
Scoring	61
Test Validity	61
Test Reliability	22
Student Evaluation	16
Elementary Secondary Education	11
Interrater Reliability	11
Test Construction	11
Measurement Techniques	10
Computer Assisted Testing	9
Foreign Countries	9
Scores	9
Testing	9
Writing Evaluation	8
Psychometrics	7
Comparative Analysis	6
Higher Education	6
Rating Scales	6
Test Items	6
Writing Skills	6
Construct Validity	5
English (Second Language)	5
Models	5
Second Language Learning	5
Academic Achievement	4
More ▼

Publication Type

Journal Articles	37
Reports - Research	30
Reports - Evaluative	12
Reports - Descriptive	7
Speeches/Meeting Papers	7
Information Analyses	5
Tests/Questionnaires	4
Reports - General	3
Dissertations/Theses -…	2
Numerical/Quantitative Data	2
Opinion Papers	2
Guides - Classroom - Teacher	1
Guides - General	1
Guides - Non-Classroom	1
Historical Materials	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	6
Elementary Secondary Education	4
Kindergarten	3
Postsecondary Education	3
Early Childhood Education	2
Secondary Education	2
Elementary Education	1
Grade 1	1
Grade 11	1
Grade 2	1
High Schools	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Practitioners	6
Policymakers	2
Researchers	1
Teachers	1

Location

Australia	3
Canada	1
Finland	1
Nebraska (Lincoln)	1
Poland	1
Singapore	1
Tennessee	1
Utah	1
Vermont	1

Laws, Policies, & Programs

Comprehensive Education…	1
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

ACT Assessment	1
Bem Sex Role Inventory	1
Child Behavior Checklist	1
Childrens Depression Inventory	1
Early Childhood Environment…	1
National Teacher Examinations	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Digital Games for Creativity Assessment: Strengths, Weaknesses and Opportunities

Peer reviewed

Direct link

Rafner, Janet; Biskjaer, Michael Mose; Zana, Blanka; Langsford, Steven; Bergenholtz, Carsten; Rahimi, Seyedahmad; Carugati, Andrea; Noy, Lior; Sherson, Jacob – Creativity Research Journal, 2022

Creativity assessments should be valid, reliable, and scalable to support various stakeholders (e.g., policy-makers, educators, corporations, and the general public) in their decision-making processes. Established initiatives toward scalable creativity assessments have relied on well-studied standardized tests. Although robust in many ways, most…

Descriptors: Creativity, Evaluation Methods, Video Games, Computer Assisted Testing

Impact of Superscoring on Subgroup Differences. Issue Brief

Download full text

Mattern, Krista; Radunzel, Justine – ACT, Inc., 2019

When applicants take the ACT® more than once, how do colleges and universities reconcile and make sense of the multiple scores? In terms of validity, fairness, and impact on subgroup differences, are certain score-use polices better than others? The focus of this issue brief is to summarize evidence on the validity and fairness of various…

Descriptors: Scoring, College Entrance Examinations, Test Validity, Evaluation Methods

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Modeling Statistics ITAs' Speaking Performances in a Certification Test

Direct link

Ziwei Zhou – ProQuest LLC, 2020

In light of the ever-increasing capability of computer technology and advancement in speech and natural language processing techniques, automated speech scoring of constructed responses is gaining popularity in many high-stakes assessment and low-stakes educational settings. Automated scoring is a highly interdisciplinary and complex subject, and…

Descriptors: Certification, Speech Skills, Automation, Scoring

Applying Kane's Validity Framework to a Simulation Based Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018

Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…

Descriptors: Competence, Simulation, Allied Health Personnel, Certification

Assessing Quality of Kindergarten Classrooms in Singapore: Psychometric Properties of the Early Childhood Environment Rating Scale-Revised

Peer reviewed

Direct link

Bull, Rebecca; Yao, Shih-Ying; Ng, Ee Lynn – International Journal of Early Childhood, 2017

The early childhood sector in Singapore has witnessed vast changes in the past two decades. One of the key policy aims is to improve classroom quality. To ensure a rigorous evaluation of the quality of early childhood environments in Singapore, it is important to determine whether commonly used assessments of quality are valid indicators across…

Descriptors: Foreign Countries, Rating Scales, Educational Environment, Educational Quality

The Validity of Inferences from Locally Developed Assessments Administered Globally. Research Report. ETS RR-18-35

Peer reviewed
PDF on ERIC

Download full text

Oliveri, María Elena; Lawless, René – ETS Research Report Series, 2018

In this paper, we first examine the challenges of score comparability associated with the use of assessments that are exported. By exported assessments, we mean assessments that are developed for domestic use and are then administered in other countries in either the same or a different language. Second, we provide suggestions to better support…

Descriptors: Scores, Scoring, Higher Education, College Students

Development of a Situational Judgment Task for Assessing Teacher Leadership in Mathematics

Peer reviewed

Direct link

Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017

Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…

Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education

Polish Listening SPAN: A New Tool for Measuring Verbal Working Memory

Peer reviewed
PDF on ERIC

Download full text

Zychowicz, Katarzyna; Biedron, Adriana; Pawlak, Miroslaw – Studies in Second Language Learning and Teaching, 2017

Individual differences in second language acquisition (SLA) encompass differences in working memory capacity, which is believed to be one of the most crucial factors influencing language learning. However, in Poland research on the role of working memory in SLA is scarce due to a lack of proper Polish instruments for measuring this construct. The…

Descriptors: Verbal Ability, Short Term Memory, Individual Differences, Second Language Learning

Measurement: Facilitating the Goal of Literacy

Peer reviewed
PDF on ERIC

Download full text

Gorin, Joanna S.; O'Reilly, Tenaha; Sabatini, John; Song, Yi; Deane, Paul – Grantee Submission, 2014

Recent advances in cognitive science and psychometrics have expanded the possibilities for the next generation of literacy assessment as an integrated domain (Bennett, 2011a; Deane, Sabatini, & O'Reilly, 2011; Leighton & Gierl, 2011; Sabatini, Albro, & O'Reilly, 2012). In this paper, we discuss four key areas supporting innovations in…

Descriptors: Literacy Education, Evaluation Methods, Measurement Techniques, Student Evaluation

Interpreting Linked Psychomotor Performance Scores

Peer reviewed

Direct link

Looney, Marilyn A. – Research Quarterly for Exercise and Sport, 2013

Given that equating/linking applications are now appearing in kinesiology literature, this article provides an overview of the different types of linked test scores: equated, concordant, and predicted. It also addresses the different types of evidence required to determine whether the scores from two different field tests (measuring the same…

Descriptors: Scores, Psychomotor Skills, Scoring, Measurement Techniques

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

How IRT Can Solve Problems of Ipsative Data in Forced-Choice Questionnaires

Peer reviewed

Direct link

Brown, Anna; Maydeu-Olivares, Alberto – Psychological Methods, 2013

In multidimensional forced-choice (MFC) questionnaires, items measuring different attributes are presented in blocks, and participants have to rank order the items within each block (fully or partially). Such comparative formats can reduce the impact of numerous response biases often affecting single-stimulus items (aka rating or Likert scales).…

Descriptors: Test Validity, Item Response Theory, Scoring, Questionnaires

In Search of Validity Evidence in Support of the Interpretation and Use of Assessments of Complex Constructs: Discussion of Research on Assessing 21st Century Skills

Peer reviewed

Direct link

Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016

Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…

Descriptors: Evaluation Methods, Test Construction, Design, Scaling

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

ETS Research Report Series	3
Applied Measurement in…	2
Bill & Melinda Gates…	2
Journal of Psychoeducational…	2
ProQuest LLC	2
ACT, Inc.	1
AERA Online Paper Repository	1
Advances in Health Sciences…	1
Assessment	1
Contemporary Education	1
Creativity Research Journal	1
Early Child Development and…	1
Early Education and…	1
Education and Training in…	1
Educational Research	1
Educational Researcher	1
English Teaching Forum	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Consulting and…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Special Education…	1
Language Testing	1
More ▼

Baker, Eva L.	2
Bejar, Isaac I.	2
Johnson, Robert L.	2
Kane, Thomas J.	2
Oliveri, María Elena	2
Staiger, Douglas O.	2
Zechner, Klaus	2
Andrews, Jac	1
Apache, R. R.	1
Bae, Yunhee	1
Bayton, James A.	1
Bergenholtz, Carsten	1
Bergman, Teresa	1
Bewley, William L.	1
Biedron, Adriana	1
Biskjaer, Michael Mose	1
Boccaccini, Marcus T.	1
Borders, L. DiAnne	1
Bowman, Harry L.	1
Boyd, Joseph L., Jr.	1
Brill, David G.	1
Brown, Anna	1
Brydges, Ryan	1
Bull, Rebecca	1
More ▼