Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 35 |
Descriptor
Psychometrics | 162 |
Testing Problems | 162 |
Test Construction | 53 |
Test Validity | 50 |
Educational Assessment | 38 |
Elementary Secondary Education | 36 |
Evaluation Methods | 33 |
Measurement Techniques | 29 |
Test Use | 27 |
Test Items | 26 |
Educational Testing | 25 |
More ▼ |
Source
Author
Thurlow, Martha | 4 |
Hambleton, Ronald K. | 3 |
Bielinski, John | 2 |
Davis, W. Alan | 2 |
Dings, Jonathan | 2 |
Drasgow, Fritz | 2 |
Figueroa, Richard A. | 2 |
Hurley, Christine | 2 |
Lord, Frederic M. | 2 |
Minnema, Jane | 2 |
Murphy, Edward | 2 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 16 |
Elementary Education | 3 |
Higher Education | 3 |
Postsecondary Education | 2 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Junior High Schools | 1 |
Preschool Education | 1 |
Audience
Researchers | 11 |
Practitioners | 5 |
Counselors | 1 |
Policymakers | 1 |
Students | 1 |
Location
United States | 5 |
United Kingdom | 4 |
Kentucky | 3 |
United Kingdom (England) | 3 |
Canada | 2 |
Japan | 2 |
United Kingdom (Wales) | 2 |
Australia | 1 |
China | 1 |
Colombia | 1 |
Germany | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Debra P v Turlington | 1 |
Education of the Handicapped… | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025
Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…
Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Janssen, Gerriet – Language Testing, 2022
This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…
Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Kato, Pamela M.; de Klerk, Sebastiaan – Journal of Applied Testing Technology, 2017
Serious games are increasingly being explored for use as assessment tools in broad domains. Drawing from research in these domains, we present important advantages and challenges that arise when using games for assessment. In light of this context and as an introduction to this special issue on Serious Games and Assessments, we introduce the…
Descriptors: Evaluation Methods, Formative Evaluation, Design, Educational Games
Hipkins, Rosemary – set: Research Information for Teachers, 2019
PISA [Programme for International Student Assessment] will be in the news again this year. The 2018 results are due to be released at the end of 2019 and they usually generate media interest. This Rangahau Whakarapopoto is a research brief which outlines things to watch out for as you think about what the results might mean.
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
Norfolk, Philip A.; Farmer, Ryan L.; Floyd, Randy G.; Woods, Isaac L.; Hawkins, Haley K.; Irby, Sarah M. – Journal of Psychoeducational Assessment, 2015
The representativeness, recency, and size of norm samples strongly influence the accuracy of inferences drawn from their scores. Inadequate norm samples may lead to inflated or deflated scores for individuals and poorer prediction of developmental and academic outcomes. The purpose of this study was to apply Kranzler and Floyd's method for…
Descriptors: Intelligence Tests, Psychometrics, Sample Size, Norm Referenced Tests
McGill, Ryan J.; Styck, Kara M.; Palomares, Ronald S.; Hass, Michael R. – Learning Disability Quarterly, 2016
As a result of the upcoming Federal reauthorization of the Individuals With Disabilities Education Improvement Act (IDEA), practitioners and researchers have begun vigorously debating what constitutes evidence-based assessment for the identification of specific learning disability (SLD). This debate has resulted in strong support for a method that…
Descriptors: Learning Disabilities, Disability Identification, Disabilities, Federal Legislation
Knell, Janie L.; Wilhoite, Andrea P.; Fugate, Joshua Z.; González-Espada, Wilson J. – Electronic Journal of Science Education, 2015
Current science education reform efforts emphasize teaching K-12 science using hands-on, inquiry activities. For maximum learning and probability of implementation among inservice teachers, these strategies must be modeled in college science courses for preservice teachers. About a decade ago, Morehead State University revised their science…
Descriptors: Item Response Theory, Multiple Choice Tests, Test Construction, Psychometrics
Orrill, Chandra Hawley; Kim, Ok-Kyeong; Peters, Susan A.; Lischka, Alyson E.; Jong, Cindy; Sanchez, Wendy B.; Eli, Jennifer A. – Mathematics Teacher Education and Development, 2015
Developing and writing assessment items that measure teachers' knowledge is an intricate and complex undertaking. In this paper, we begin with an overview of what is known about measuring teacher knowledge. We then highlight the challenges inherent in creating assessment items that focus specifically on measuring teachers' specialised knowledge…
Descriptors: Specialization, Knowledge Base for Teaching, Educational Strategies, Testing Problems
Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015
This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…
Descriptors: Models, Engineering Education, Test Items, Outcome Measures
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Feuer, Michael J. – Mid-Western Educational Researcher, 2011
In this keynote address, the author shares his reflections on politics, economics, and testing. He focuses on assessment and accountability and begins with some data from large scale written educational testing, "circa 1840". The author argues that people's penchant for accountability and their appetite for standardized testing are, in…
Descriptors: Testing Problems, Educational Testing, Standardized Tests, Risk