Publication Date
In 2025 | 2 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 27 |
Since 2006 (last 20 years) | 49 |
Descriptor
Test Reliability | 87 |
Test Validity | 87 |
Scaling | 57 |
Test Construction | 43 |
Multidimensional Scaling | 30 |
Psychometrics | 27 |
Item Response Theory | 22 |
Scoring | 22 |
Item Analysis | 20 |
Factor Analysis | 19 |
Scores | 18 |
More ▼ |
Source
Author
Christensen, Rhonda | 3 |
Knezek, Gerald | 3 |
Abel, Jakob | 1 |
Abell, Neil | 1 |
Akaeze, Hope O. | 1 |
Alcorn, Charles L. | 1 |
Amery D. Wu | 1 |
Andersen, Martin S. | 1 |
Anderson, Ronald E. | 1 |
Anil Paswan | 1 |
Asikin, Yonathan | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 3 |
Location
New York | 4 |
Canada | 2 |
Germany | 2 |
India | 2 |
Michigan | 2 |
Texas | 2 |
United States | 2 |
Australia | 1 |
Belgium | 1 |
California | 1 |
Illinois | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024
The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…
Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing
Shivam Kumar; Shridhar Patil; Anil Paswan; Swaraj Kumar Dutta; R. K. Sohane – Journal of Agricultural Education and Extension, 2024
Purpose: The study was aimed at measuring farmers' helpline services quality in India using a standardized multi-factor scale (HELPQUAL) developed as part of this study. Design/methodology/approach: The present study is based on 360 farmers' and 45 experts' responses gathered using telephonic interviews and mailed questionnaires during the year…
Descriptors: Agricultural Occupations, Help Seeking, Counseling Services, Rural Extension
Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024
Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…
Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025
Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…
Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students
Lukaschyk, Julia; Abel, Jakob; Brockmann-Bauser, Meike; Keilmann, Annerose; Braun, Angelika; Rohlfs, Anna-Katharina – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The Vocal Tract Discomfort Scale (VTD Scale) is a self-rating questionnaire investigating physical symptoms in the larynx associated with vocal pathology. The aim of this work was to investigate the reliability, validity, sensitivity, and specificity of the first German version and to provide normative data with thresholds for pathology…
Descriptors: Questionnaires, Symptoms (Individual Disorders), Voice Disorders, Test Validity
Andersen, Martin S.; Makransky, Guido – Journal of Computer Assisted Learning, 2021
Measuring cognitive load is important in virtual learning environments (VLE). Thus, valid and reliable measures of cognitive load are important to support instructional design in VLE. Through three studies, we investigated the validity and reliability of Leppink's Cognitive Load Scale (CLS) and developed the extraneous cognitive load (EL)…
Descriptors: Test Construction, Test Validity, Test Reliability, Cognitive Processes
Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023
This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques
Bilgen, Özge Bikmaz – World Journal of Education, 2020
The purpose of this study is to examine the validity of the scale for identifying gifted children, whose validity was proven by exploratory, confirmatory factor analysis, and whose reliability was proven the Cronbach alpha coefficient for identifying children in the 3-6 age group, using Mokken scaling based on nonparametric item response theory.…
Descriptors: Academically Gifted, Talent Identification, Measures (Individuals), Test Validity
Christensen, Rhonda; Knezek, Gerald – Journal of Technology Education, 2022
This article describes the development and validation of an Innovation Attitude Survey (IAS) composed of 16 Likert-type items selected to measure middle school students' attitudes toward innovation and leadership in the advancement of new ideas. The goal of developing the IAS was to identify desirable dispositions that may be related to future…
Descriptors: Attitude Measures, Likert Scales, Test Construction, Test Validity
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2018
The psychometric properties of a new, multidimensional measure of test anxiety, the Test Anxiety Measure for College Students (TAM-C), were examined in a sample of 720 undergraduate students. Results of confirmatory factor analyses provided support for a six-factor (Cognitive Interference, Physiological Hyperarousal, Social Concerns,…
Descriptors: Psychometrics, Test Anxiety, College Students, Measures (Individuals)
Wong, Venus; Ruble, Lisa A.; McGrew, John H.; Yu, Yue – School Psychology Quarterly, 2018
Consultation is essential to the daily practice of school psychologists (National Association of School Psychologist, 2010). Successful consultation requires fidelity at both the consultant (implementation) and consultee (intervention) levels. We applied a multidimensional, multilevel conception of fidelity (Dunst, Trivette, & Raab, 2013) to a…
Descriptors: Fidelity, Consultation Programs, School Psychology, Intervention
Costa, Sebastiano; Ingoglia, Sonia; Inguglia, Cristiano; Liga, Francesca; Lo Coco, Alida; Larcan, Rosalba – Measurement and Evaluation in Counseling and Development, 2018
The purpose of this multistudy report was to adapt the Basic Psychological Need Satisfaction and Frustration Scale (BPNSFS) to the Italian context. Two studies were conducted. In Study 1, we investigated the dimensionality, reliability, and convergent and discriminant validity of the instrument in a sample of 544 participants (males = 41%) from 16…
Descriptors: Psychological Needs, Psychometrics, Need Gratification, Gender Differences
Romine, William L.; Schaffer, Dane L.; Barrow, Lloyd – International Journal of Science Education, 2015
We describe the development and validation of a three-tiered diagnostic test of the water cycle (DTWC) and use it to evaluate the impact of prior learning experiences on undergraduates' misconceptions. While most approaches to instrument validation take a positivist perspective using singular criteria such as reliability and fit with a measurement…
Descriptors: Undergraduate Students, Diagnostic Tests, Water, Item Response Theory