Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 34 |
Descriptor
Source
Author
Baker, Eva L. | 2 |
Bejar, Isaac I. | 2 |
Johnson, Robert L. | 2 |
Kane, Thomas J. | 2 |
Oliveri, María Elena | 2 |
Staiger, Douglas O. | 2 |
Zechner, Klaus | 2 |
Andrews, Jac | 1 |
Apache, R. R. | 1 |
Bae, Yunhee | 1 |
Bayton, James A. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 6 |
Elementary Secondary Education | 4 |
Kindergarten | 3 |
Postsecondary Education | 3 |
Early Childhood Education | 2 |
Secondary Education | 2 |
Elementary Education | 1 |
Grade 1 | 1 |
Grade 11 | 1 |
Grade 2 | 1 |
High Schools | 1 |
More ▼ |
Audience
Practitioners | 6 |
Policymakers | 2 |
Researchers | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Comprehensive Education… | 1 |
Elementary and Secondary… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
ACT Assessment | 1 |
Bem Sex Role Inventory | 1 |
Child Behavior Checklist | 1 |
Childrens Depression Inventory | 1 |
Early Childhood Environment… | 1 |
National Teacher Examinations | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Han, Chao – Language Testing, 2022
Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…
Descriptors: Translation, Language Tests, Testing, Evaluation Methods
Rafner, Janet; Biskjaer, Michael Mose; Zana, Blanka; Langsford, Steven; Bergenholtz, Carsten; Rahimi, Seyedahmad; Carugati, Andrea; Noy, Lior; Sherson, Jacob – Creativity Research Journal, 2022
Creativity assessments should be valid, reliable, and scalable to support various stakeholders (e.g., policy-makers, educators, corporations, and the general public) in their decision-making processes. Established initiatives toward scalable creativity assessments have relied on well-studied standardized tests. Although robust in many ways, most…
Descriptors: Creativity, Evaluation Methods, Video Games, Computer Assisted Testing
Mattern, Krista; Radunzel, Justine – ACT, Inc., 2019
When applicants take the ACT® more than once, how do colleges and universities reconcile and make sense of the multiple scores? In terms of validity, fairness, and impact on subgroup differences, are certain score-use polices better than others? The focus of this issue brief is to summarize evidence on the validity and fairness of various…
Descriptors: Scoring, College Entrance Examinations, Test Validity, Evaluation Methods
Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022
In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…
Descriptors: Computer Assisted Testing, Tests, Scores, Scoring
Ziwei Zhou – ProQuest LLC, 2020
In light of the ever-increasing capability of computer technology and advancement in speech and natural language processing techniques, automated speech scoring of constructed responses is gaining popularity in many high-stakes assessment and low-stakes educational settings. Automated scoring is a highly interdisciplinary and complex subject, and…
Descriptors: Certification, Speech Skills, Automation, Scoring
Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018
Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…
Descriptors: Competence, Simulation, Allied Health Personnel, Certification
Bull, Rebecca; Yao, Shih-Ying; Ng, Ee Lynn – International Journal of Early Childhood, 2017
The early childhood sector in Singapore has witnessed vast changes in the past two decades. One of the key policy aims is to improve classroom quality. To ensure a rigorous evaluation of the quality of early childhood environments in Singapore, it is important to determine whether commonly used assessments of quality are valid indicators across…
Descriptors: Foreign Countries, Rating Scales, Educational Environment, Educational Quality
Oliveri, María Elena; Lawless, René – ETS Research Report Series, 2018
In this paper, we first examine the challenges of score comparability associated with the use of assessments that are exported. By exported assessments, we mean assessments that are developed for domestic use and are then administered in other countries in either the same or a different language. Second, we provide suggestions to better support…
Descriptors: Scores, Scoring, Higher Education, College Students
Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017
Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…
Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education
Zychowicz, Katarzyna; Biedron, Adriana; Pawlak, Miroslaw – Studies in Second Language Learning and Teaching, 2017
Individual differences in second language acquisition (SLA) encompass differences in working memory capacity, which is believed to be one of the most crucial factors influencing language learning. However, in Poland research on the role of working memory in SLA is scarce due to a lack of proper Polish instruments for measuring this construct. The…
Descriptors: Verbal Ability, Short Term Memory, Individual Differences, Second Language Learning
Gorin, Joanna S.; O'Reilly, Tenaha; Sabatini, John; Song, Yi; Deane, Paul – Grantee Submission, 2014
Recent advances in cognitive science and psychometrics have expanded the possibilities for the next generation of literacy assessment as an integrated domain (Bennett, 2011a; Deane, Sabatini, & O'Reilly, 2011; Leighton & Gierl, 2011; Sabatini, Albro, & O'Reilly, 2012). In this paper, we discuss four key areas supporting innovations in…
Descriptors: Literacy Education, Evaluation Methods, Measurement Techniques, Student Evaluation
Looney, Marilyn A. – Research Quarterly for Exercise and Sport, 2013
Given that equating/linking applications are now appearing in kinesiology literature, this article provides an overview of the different types of linked test scores: equated, concordant, and predicted. It also addresses the different types of evidence required to determine whether the scores from two different field tests (measuring the same…
Descriptors: Scores, Psychomotor Skills, Scoring, Measurement Techniques
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Brown, Anna; Maydeu-Olivares, Alberto – Psychological Methods, 2013
In multidimensional forced-choice (MFC) questionnaires, items measuring different attributes are presented in blocks, and participants have to rank order the items within each block (fully or partially). Such comparative formats can reduce the impact of numerous response biases often affecting single-stimulus items (aka rating or Likert scales).…
Descriptors: Test Validity, Item Response Theory, Scoring, Questionnaires
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling