Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 20 |
Descriptor
Models | 40 |
Test Validity | 40 |
Scoring | 24 |
Test Reliability | 20 |
Test Construction | 12 |
Scoring Rubrics | 11 |
Evaluation Methods | 9 |
Measurement Techniques | 9 |
Test Items | 8 |
Computer Assisted Testing | 6 |
Item Analysis | 6 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 1 |
Researchers | 1 |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Family Educational Rights and… | 1 |
Health Insurance Portability… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
Test of English as a Foreign… | 1 |
Test of Standard Written… | 1 |
Torrance Tests of Creative… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Beheshti, Shima; Safa, Mohammad Ahmadi – Iranian Journal of Language Teaching Research, 2023
The indefinite nature of test fairness and different interpretations and definitions of the concept have stirred a lot of controversy over the years, necessitating the reconceptualization of the concept. On this basis, this study aimed to explore the empirical validity of Kunnan's (2008) Test Fairness Framework (TFF) and revisit the established…
Descriptors: Test Bias, Equal Education, Grounded Theory, Test Construction
Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018
The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…
Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators
Ziwei Zhou – ProQuest LLC, 2020
In light of the ever-increasing capability of computer technology and advancement in speech and natural language processing techniques, automated speech scoring of constructed responses is gaining popularity in many high-stakes assessment and low-stakes educational settings. Automated scoring is a highly interdisciplinary and complex subject, and…
Descriptors: Certification, Speech Skills, Automation, Scoring
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
Warsono; Nursuhud, Puji Iman; Darma, Rio Sandhika; Supahar – International Journal of Instruction, 2020
The study was conducted to analyze the items about the ability of high school students diagram representation and obtain Item Curve Characteristic. Grid test instruments are compiled based on competencies and indicators of diagram representation which are then used to compile items. The test instrument consisted of five items and was validated by…
Descriptors: High School Students, Problem Solving, Visual Aids, Scoring
Lavi, Rea; Dori, Yehudit Judy; Wengrowicz, Niva; Dori, Dov – IEEE Transactions on Education, 2020
Contribution: A rubric for assessing the systems thinking expressed in conceptual models of technological systems has been constructed and assessed using a formal methodology. The rubric, a synthesis of prior findings in science and engineering education, forms a framework for improving communication between science and engineering educators.…
Descriptors: Models, Engineering Education, Teamwork, Scoring Rubrics
Carlson, Tiffany; Crepeau-Hobson, Franci – Communique, 2021
When the coronavirus pandemic was declared a public health crisis in March 2020, school psychologists were forced into situations where face-to-face interaction with their students was discouraged and in some cases, prohibited. Consequently, the traditional practice of school psychology abruptly ended. Individualized Education Plans (IEP) and…
Descriptors: Cognitive Tests, Ethics, Decision Making, Models
Ackerman, Debra J. – ETS Research Report Series, 2020
Over the past 8 years, U.S. kindergarten classrooms have been impacted by policies mandating or recommending the administration of a specific kindergarten entry assessment (KEA) in the initial months of school as well as the increasing reliance on digital technology in the form of mobile apps, touchscreen devices, and online data platforms. Using…
Descriptors: Kindergarten, School Readiness, Computer Assisted Testing, Preschool Teachers
Dickison, Philip; Luo, Xiao; Kim, Doyoung; Woo, Ada; Muntean, William; Bergstrom, Betty – Journal of Applied Testing Technology, 2016
Designing a theory-based assessment with sound psychometric qualities to measure a higher-order cognitive construct is a highly desired yet challenging task for many practitioners. This paper proposes a framework for designing a theory-based assessment to measure a higher-order cognitive construct. This framework results in a modularized yet…
Descriptors: Thinking Skills, Cognitive Tests, Test Construction, Nursing
Gorin, Joanna S.; O'Reilly, Tenaha; Sabatini, John; Song, Yi; Deane, Paul – Grantee Submission, 2014
Recent advances in cognitive science and psychometrics have expanded the possibilities for the next generation of literacy assessment as an integrated domain (Bennett, 2011a; Deane, Sabatini, & O'Reilly, 2011; Leighton & Gierl, 2011; Sabatini, Albro, & O'Reilly, 2012). In this paper, we discuss four key areas supporting innovations in…
Descriptors: Literacy Education, Evaluation Methods, Measurement Techniques, Student Evaluation
Amrein-Beardsley, Audrey; Holloway-Libell, Jessica; Cirell, Anna Montana; Hays, Alice; Chapman, Kathryn – Practical Assessment, Research & Evaluation, 2015
There is something incalculable about teacher expertise and whether it can be observed, detected, quantified, and as per current educational policies, used as an accountability tool to hold America's public school teachers accountable for that which they do (or do not do well). In this commentary, authors (all of whom are former public school…
Descriptors: Accountability, Educational Change, Educational Policy, Expertise
Castellano, Katherine E.; Duckor, Brent; Wihardini, Diah; Telléz, Kip; Wilson, Mark – Teacher Education Quarterly, 2016
With the adoption by most states of the Common Core State Standards (CCSS) for English language arts and literacy and for mathematics (CCSS Initiative, 2010a, 2010b) comes major changes in public education that will affect instructional practice, curriculum, and assessment across the nation. Heritage, Walqui, and Linquanti (2015) argued that the…
Descriptors: Elementary School Mathematics, Mathematics Teachers, Teacher Certification, Language Usage
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Bauer, Christopher F.; Cole, Renee – Journal of Chemical Education, 2012
A rubric that embodies the key features of the process-oriented, guided-inquiry learning (POGIL) model was subjected to systematic study of validity and reliability. Nearly 60 college instructors used the rubric to evaluate four intentionally modified versions of an established POGIL activity. The modifications strengthened or weakened key…
Descriptors: Scoring Rubrics, Test Reliability, Science Instruction, College Science
Cavanagh, Robert F.; Koehler, Matthew J. – Journal of Research on Technology in Education, 2013
The impetus for this paper stems from a concern about directions and progress in the measurement of the Technological Pedagogical Content Knowledge (TPACK) framework for effective technology integration. In this paper, we develop the rationale for using a seven-criterion lens, based upon contemporary validity theory, for critiquing empirical…
Descriptors: Technological Literacy, Pedagogical Content Knowledge, Measurement Techniques, Technology Integration