Publication Date
In 2025 | 3 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 12 |
Since 2016 (last 10 years) | 32 |
Descriptor
Test Length | 32 |
Test Reliability | 32 |
Test Validity | 12 |
Test Items | 11 |
Psychometrics | 9 |
Scores | 9 |
Computer Assisted Testing | 8 |
Foreign Countries | 8 |
Item Response Theory | 7 |
Language Tests | 7 |
Elementary School Students | 6 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 29 |
Reports - Research | 27 |
Reports - Evaluative | 4 |
Speeches/Meeting Papers | 2 |
Numerical/Quantitative Data | 1 |
Reports - Descriptive | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 7 |
Postsecondary Education | 7 |
Elementary Education | 6 |
Early Childhood Education | 3 |
Grade 7 | 3 |
Junior High Schools | 3 |
Middle Schools | 3 |
Primary Education | 3 |
Secondary Education | 3 |
Grade 2 | 2 |
Grade 3 | 2 |
More ▼ |
Audience
Location
China | 3 |
Canada | 2 |
Alabama | 1 |
Australia | 1 |
California | 1 |
Illinois (Chicago) | 1 |
Indiana | 1 |
Ireland | 1 |
Japan | 1 |
Maryland | 1 |
New Zealand | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
ACTFL Oral Proficiency… | 1 |
MacArthur Communicative… | 1 |
Measures of Academic Progress | 1 |
Multidimensional… | 1 |
Peabody Picture Vocabulary… | 1 |
Positive and Negative Affect… | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024
This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…
Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition
María Vicent; Andrea Fuster; María Pérez-Marco; María del Pilar Aparicio-Flores – Journal of Psychoeducational Assessment, 2025
Although the original long version of the Hewitt Multidimensional Perfectionism Scale (HMPS) has been translated and validated in a Spanish population, no study to date has examined the psychometric properties of a short version of the HMPS with a Spanish-speaking sample. For this reason, the aim of this study is to analyze the psychometric…
Descriptors: Personality Measures, Personality Traits, Spanish, Psychometrics
Tom Benton – Research Matters, 2024
Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…
Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction
Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025
A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…
Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols
Hakyung Sung; Sooyeon Cho; Kristopher Kyle – Language Assessment Quarterly, 2024
Lexical diversity (LD) is an important indicator of second language lexical development. Much research has investigated LD indices, with a focus on learners of English. However, further research is needed in languages that are typologically distinct from English, such as Korean. In this study, we evaluated the reliability and validity of LD…
Descriptors: Second Language Learning, Korean, Persuasive Discourse, Language Tests
Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023
We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…
Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Ellis, Jules L. – Educational and Psychological Measurement, 2021
This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…
Descriptors: Test Length, Models, Error of Measurement, Measurement
Raborn, Anthony W.; Leite, Walter L.; Marcoulides, Katerina M. – Educational and Psychological Measurement, 2020
This study compares automated methods to develop short forms of psychometric scales. Obtaining a short form that has both adequate internal structure and strong validity with respect to relationships with other variables is difficult with traditional methods of short-form development. Metaheuristic algorithms can select items for short forms while…
Descriptors: Test Construction, Automation, Heuristics, Mathematics
Benton, Tom – Research Matters, 2021
Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…
Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Casanova, Joana R.; Almeida, Leandro S.; Peixoto, Francisco; Ribeiro, Rui-Bártolo; Marôco, João – SAGE Open, 2019
Academic expectations play a significant role in the quality of student adaptation and academic success. Previous research suggests that expectations are a multidimensional construct, making it crucial to test the measures used for this important characteristic. Because assessment of student adaptation to higher education comprises a multitude of…
Descriptors: Foreign Countries, College Freshmen, Questionnaires, Expectation
Rueger, Sandra Y.; Cipra, Alli; Choe, Hyungjoon; Steggerda, Jake C.; Kirby, Andrea E.; Stone, Lauren B. – Journal of Psychoeducational Assessment, 2021
Measurement limitations have hindered research on learned helplessness (LH) and mastery orientation (MO) in the classroom. We reduced the 24-item Student Behavior Checklist to a 6-item scale and tested the abbreviated measure for evidence of reliability and validity in a sample of 5th and 6th graders (N = 299). We then replicated findings in an…
Descriptors: Student Behavior, Check Lists, Helplessness, Orientation
Precision of Single-Skill Math CBM Time-Series Data: The Effect of Probe Stratification and Set Size
Solomon, Benjamin G.; Payne, Lexy L.; Campana, Kayla V.; Marr, Erin A.; Battista, Carmela; Silva, Alex; Dawes, Jillian M. – Journal of Psychoeducational Assessment, 2020
Comparatively little research exists on single-skill math (SSM) curriculum-based measurements (CBMs) for the purpose of monitoring growth, as may be done in practice or when monitoring intervention effectiveness within group or single-case research. Therefore, we examined a common variant of SSM-CBM: 1 digit × 1 digit multiplication. Reflecting…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Mathematics Skills, Multiplication
Rice, Kenneth G.; Srisarajivakul, Emily N.; Meyers, Joel; Varjas, Kris – School Psychology, 2019
One evaluation measure available through the Positive Behavioral Interventions and Supports framework is the Effective Behavior Support Self-Assessment Survey (SAS). Evaluations of the SAS have supported its factor structure. However, the SAS is designed to be completed by school personnel who are nested within other levels of analysis (e.g.,…
Descriptors: Factor Analysis, Factor Structure, Self Evaluation (Individuals), Teacher Surveys