Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 9 |
Descriptor
Psychometrics | 56 |
Test Reliability | 56 |
Test Use | 56 |
Test Validity | 46 |
Test Construction | 23 |
Foreign Countries | 12 |
Adults | 9 |
Scores | 9 |
Test Items | 9 |
Evaluation Methods | 8 |
Higher Education | 8 |
More ▼ |
Source
Author
Straus, Murray A. | 3 |
Ahnberg, Jamie L. | 1 |
Al-Owidha, Amjed A. | 1 |
Alatli, Betül | 1 |
Alvaro, Rosaria | 1 |
Amery D. Wu | 1 |
Bailey, E. J. | 1 |
Barnes, Laura L. B. | 1 |
Beller, Michal | 1 |
Birchler, Gary R. | 1 |
Blake, Jennifer M. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Preschool Education | 1 |
Audience
Practitioners | 3 |
Community | 1 |
Researchers | 1 |
Students | 1 |
Location
Australia | 2 |
Canada | 2 |
India | 1 |
Israel | 1 |
Italy (Rome) | 1 |
Michigan | 1 |
Poland | 1 |
Portugal | 1 |
Turkey | 1 |
United Kingdom | 1 |
United States | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Kotowicz, Justyna; Woll, Bencie; Herman, Rosalind – Language Testing, 2021
The evaluation of sign language proficiency needs to be based on measures with well-established psychometric proprieties. To date, no valid and reliable test is available to assess Polish Sign Language ("Polski Jezyk Migowy," PJM) skills in deaf children. Hence, our aim with this study was to adapt the British Sign Language Receptive…
Descriptors: Language Tests, Receptive Language, Sign Language, Language Proficiency
Al-Owidha, Amjed A. – Language Testing in Asia, 2018
Background: This study investigated the psychometric properties of the recently developed Qiyas for L1 Arabic language test using a Rasch measurement framework. Methods: Responses from 271 examinees were analyzed in this study. The test is hypothesized to involve one dominant factor that assesses four skills: reading comprehension, rhetorical…
Descriptors: Semitic Languages, Language Tests, Psychometrics, Reading Comprehension
Galeoto, Giovanni; D'Elpidio, Giuliana; Alvaro, Rosaria; Zicari, Anna Maria; Valente, Donatella; Riccio, Marianna – International Association for Development of the Information Society, 2021
The Italian Disciplinary section of Test of Competences (TECO-D) project is an important longitudinal study used to analyze learning outcomes of ungraded students and to measure quality of the educational process. The aim of the present study was to evaluate the psychometric properties of the TECO-D in students enrolled in the Bachelor's Degree in…
Descriptors: Case Studies, Nursing Education, Psychometrics, Longitudinal Studies
Ketterlin-Geller, Leanne R.; Perry, Lindsey; Platas, Linda M.; Sitbakhan, Yasmin – Global Education Review, 2018
Test scoring procedures should align with the intended uses and interpretations of test results. In this paper, we examine three test scoring procedures for an operational assessment of early numeracy, the Early Grade Mathematics Assessment (EGMA). The EGMA is an assessment that tests young children's foundational mathematics knowledge and has…
Descriptors: Alignment (Education), Scoring, Test Use, Mathematics Tests
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009
Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…
Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity
Caselman, Tonia D.; Self, Patricia A. – Children & Schools, 2008
Early identification of social-emotional behavioral problems in infants and preschoolers is critical. Nine parent-report and caregiver/teacher-report instruments measuring preschool social-emotional behavioral problems and strengths are reviewed. Advantages to the use of parent-report and caregiver/teacher-report instruments are that they are easy…
Descriptors: Identification, Psychometrics, Evaluation Methods, Child Caregivers
Ruble, Thomas L.; Stout, David E. – 1994
This paper reviews and critically evaluates the psychometric properties of Kolb's Learning Style Inventory (LSI). The LSI was developed originally in the 1970s (Kolb, 1976a) and was revised in the 1980s (Kolb, 1985). Although the LSI has been very popular, extensive evidence available in the published literature indicates that both the original…
Descriptors: Cognitive Style, Construct Validity, Learning, Psychometrics

Burrell, Brenda; And Others – Educational and Psychological Measurement, 1995
The measurement characteristics of the Perceived Adequacy of Resources Scale, a measure of family functioning, were investigated. The reliability and validity of total and subtest scores were studied with 113 mothers. Results were generally favorable regarding the integrity of scores from the measure. (SLD)
Descriptors: Family Characteristics, Mothers, Psychometrics, Scores

Johnson, Mark E.; Fisher, Dennis G.; Rhodes, Fen; Booth, Robert – Assessment, 1996
The Wide Range Achievement Test-Revised and the Woodcock Reading Mastery Tests-Revised were administered twice to 269 current drug abusers over an average time interval of 204.2 days. Overall, the study demonstrates that the two instruments have strong psychometric properties and that results from current drug abusers are reliable. (SLD)
Descriptors: Adults, Concurrent Validity, Drug Abuse, Psychometrics

Schutz, Richard E. – Educational Evaluation and Policy Analysis, 1985
This paper updates the concept of test validity. This new conception entails a set of 10 categories combined together in pairs: curriculum and instructional validity, statutory and forensic validity, media and journalistic validity, political and legislative validity, and partisan and activist validity. (Author/DWH)
Descriptors: Educational Testing, Politics of Education, Predictive Validity, Psychometrics

Birchler, Gary R.; Fals-Stewart, William – Assessment, 1994
The Response to Conflict Scale, a 24-item measure of maladaptive responses to marital conflict, was evaluated psychometrically with 420 couples. The inventory showed high internal consistency, test-retest reliability, construct and discriminant validity, and classification efficiency. Clinical utility is discussed. (SLD)
Descriptors: Classification, Conflict, Construct Validity, Marital Instability
Bailey, E. J.; Bricker, Diane – Journal of the Division for Early Childhood, 1986
Twenty-two handicapped and 10 nonhandicapped young children were administered the Evaluation and Programing System Level 1 (EPS-1), a measure designed to provide information to develop educational programs and assess program effectiveness. The article notes reliability, validity, and utility data. (Author/CL)
Descriptors: Criterion Referenced Tests, Disabilities, Early Childhood Education, Program Evaluation