Publication Date
In 2025 | 3 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 14 |
Since 2016 (last 10 years) | 26 |
Since 2006 (last 20 years) | 44 |
Descriptor
Test Length | 108 |
Test Validity | 108 |
Test Reliability | 60 |
Test Construction | 46 |
Test Items | 29 |
Test Format | 22 |
Foreign Countries | 18 |
Computer Assisted Testing | 17 |
Testing Problems | 17 |
Psychometrics | 14 |
Comparative Analysis | 12 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 6 |
Wainer, Howard | 3 |
Michael, William B. | 2 |
Abrams, Matthew | 1 |
Acar, Selcuk | 1 |
Almeida, Leandro S. | 1 |
Alonso, Jordi | 1 |
Anthony, Christopher J. | 1 |
Arbet, Scott E. | 1 |
Arens, A. Katrin | 1 |
Aydin, Selami | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 5 |
Practitioners | 2 |
Community | 1 |
Support Staff | 1 |
Location
Turkey | 5 |
China | 3 |
United Kingdom | 3 |
Japan | 2 |
California | 1 |
Canada | 1 |
Germany | 1 |
Italy | 1 |
Kenya | 1 |
Michigan | 1 |
New Jersey | 1 |
More ▼ |
Laws, Policies, & Programs
Job Training Partnership Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Félix González-Carrasco; Felipe Espinosa Parra; Izaskun Álvarez-Aguado; Sebastián Ponce Olguín; Vanessa Vega Córdova; Miguel Roselló-Peñaloza – British Journal of Learning Disabilities, 2025
Background: The study focuses on the need to optimise assessment scales for support needs in individuals with intellectual and developmental disabilities. Current scales are often lengthy and redundant, leading to exhaustion and response burden. The goal is to use machine learning techniques, specifically item-reduction methods and selection…
Descriptors: Artificial Intelligence, Intellectual Disability, Developmental Disabilities, Individual Needs
He, Yinhong – Journal of Educational Measurement, 2023
Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…
Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods
Handan Narin Kiziltan; Hatice Cigdem Bulut – International Journal of Assessment Tools in Education, 2024
Mental imagery is a vital cognitive skill that significantly influences how reality is perceived while creating art. Its multifaceted nature reveals various dimensions of creative expression, amplifying the inherent complexities of measuring it. This study aimed to shorten the Mental Imagery Scale in Artistic Creativity (MISAC) via the Ant Colony…
Descriptors: Foreign Countries, Undergraduate Students, Art Education, Imagery
Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025
A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…
Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols
Hakyung Sung; Sooyeon Cho; Kristopher Kyle – Language Assessment Quarterly, 2024
Lexical diversity (LD) is an important indicator of second language lexical development. Much research has investigated LD indices, with a focus on learners of English. However, further research is needed in languages that are typologically distinct from English, such as Korean. In this study, we evaluated the reliability and validity of LD…
Descriptors: Second Language Learning, Korean, Persuasive Discourse, Language Tests
Kotera, Yasuhiro; Conway, Elaine; Green, Pauline – British Journal of Guidance & Counselling, 2023
Academic motivation is important to students' mental health and performance. One established measure is the Academic Motivation Scale (AMS), comprising 28 items. AMS assesses intrinsic motivation, extrinsic motivation, and amotivation, which are further categorised into seven subscales. One weakness of AMS is its length. In this study, we…
Descriptors: Test Construction, Test Validity, Factor Analysis, Learning Motivation
Basman, Munevver – International Journal of Assessment Tools in Education, 2023
To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…
Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory
Jones, Brett D.; Wilkins, Jesse L. M. – Journal of Psychoeducational Assessment, 2023
The purpose of this study was to investigate the validity evidence for the use of the 19-item and 20-item short forms of the MUSIC Model of Academic Motivation Inventory (College Student version) with undergraduate students. These shorter forms of the MUSIC Inventory could be beneficial to teachers and researchers. Our analysis included inventory…
Descriptors: Test Validity, Learning Motivation, Test Length, Undergraduate Students
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Jingwen Wang; Ying Zheng; Yi Zou – Language Testing in Asia, 2024
Pearson Test of English Academic (PTE Academic), a high-stakes English language proficiency test, underwent substantial revisions in 2021. The test duration was reduced from 3 h to 2 h by reducing specific task numbers and sections. This study investigates the impact of these changes on teachers' perceptions and teaching practices, areas…
Descriptors: Foreign Countries, High Stakes Tests, Language Proficiency, Language Tests
Dong, Yixiao; Clements, Douglas H.; Day-Hess, Crystal A.; Sarama, Julie; Dumas, Denis – Journal of Psychoeducational Assessment, 2021
Psychometric work with young children faces the particular challenge that children's attention spans are relatively short, and therefore, shorter assessments are required while retaining comprehensive coverage. This article reports on three empirical studies that encompass the development and validation of the research-based early mathematics…
Descriptors: Young Children, Numeracy, Test Construction, Test Validity
Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2022
This paper presents improvements made to a computerized adaptive testing (CAT)-based version of the FCI (FCI-CAT) in regards to test security and test efficiency. First, we will discuss measures to enhance test security by controlling for item overexposure, decreasing the risk that respondents may (i) memorize the content of a pretest for use on…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Risk Management
Hill, Andrew P.; Donachie, Tracy – Journal of Psychoeducational Assessment, 2020
The measurement of perfectionistic cognitions has recently caused disagreement among researchers. Flett, Hewitt, Blankstein, and Gray proposed that perfectionistic cognitions are unidimensional. However, after re-examining the factor structure of the instrument used to measure perfectionistic automatic thoughts (Perfectionism Cognitions Inventory…
Descriptors: Factor Structure, Test Length, Cognitive Processes, Personality Traits
Casanova, Joana R.; Almeida, Leandro S.; Peixoto, Francisco; Ribeiro, Rui-Bártolo; Marôco, João – SAGE Open, 2019
Academic expectations play a significant role in the quality of student adaptation and academic success. Previous research suggests that expectations are a multidimensional construct, making it crucial to test the measures used for this important characteristic. Because assessment of student adaptation to higher education comprises a multitude of…
Descriptors: Foreign Countries, College Freshmen, Questionnaires, Expectation
Rueger, Sandra Y.; Cipra, Alli; Choe, Hyungjoon; Steggerda, Jake C.; Kirby, Andrea E.; Stone, Lauren B. – Journal of Psychoeducational Assessment, 2021
Measurement limitations have hindered research on learned helplessness (LH) and mastery orientation (MO) in the classroom. We reduced the 24-item Student Behavior Checklist to a 6-item scale and tested the abbreviated measure for evidence of reliability and validity in a sample of 5th and 6th graders (N = 299). We then replicated findings in an…
Descriptors: Student Behavior, Check Lists, Helplessness, Orientation