Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 30 |
Descriptor
Measurement Techniques | 115 |
Scaling | 115 |
Psychometrics | 23 |
Evaluation Methods | 21 |
Test Construction | 20 |
Statistical Analysis | 18 |
Test Validity | 18 |
Comparative Analysis | 16 |
Research Methodology | 16 |
Data Analysis | 14 |
Item Analysis | 14 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 9 |
Practitioners | 2 |
Counselors | 1 |
Students | 1 |
Location
Australia | 4 |
United Kingdom (England) | 2 |
United Kingdom (Wales) | 2 |
United States | 2 |
California | 1 |
Germany | 1 |
India | 1 |
Michigan | 1 |
Texas | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Stefanie A. Wind; Benjamin Lugu; Yurou Wang – International Journal of Testing, 2025
Mokken Scale Analysis (MSA) is a nonparametric approach that offers exploratory tools for understanding the nature of item responses while emphasizing invariance requirements. MSA is often discussed as it relates to Rasch measurement theory, which also emphasizes invariance, but uses parametric models. Researchers who have compared and combined…
Descriptors: Item Response Theory, Scaling, Surveys, Evaluation Methods
Sideridis, Georgios; Tsaousis, Ioannis; Ghamdi, Hanan – Educational and Psychological Measurement, 2023
The purpose of the present study was to provide the means to evaluate the "interval-scaling" assumption that governs the use of parametric statistics and continuous data estimators in self-report instruments that utilize Likert-type scaling. Using simulated and real data, the methodology to test for this important assumption is evaluated…
Descriptors: Intervals, Scaling, Computer Software, Likert Scales
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024
The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…
Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing
Williams, Ross; de Rassenfosse, Gaétan – Studies in Higher Education, 2016
National and international rankings of universities are now an accepted part of the higher education landscape. Rankings aggregate different performance measures into a single scale and therefore depend on the methods and weights used to aggregate. The most common method is to scale each variable relative to the highest performing entity prior to…
Descriptors: Higher Education, Achievement Rating, Outcome Measures, Scaling
He, Jia; Barrera-Pedemonte, Fabián; Buchholz, Janine – Assessment in Education: Principles, Policy & Practice, 2019
Noncognitive assessments in Programme for International Student Assessment (PISA) and Trends in International Mathematics and Science Study share certain similarities and provide complementary information, yet their comparability is seldom checked and convergence not sought. We made use of student self-report data of Instrumental Motivation,…
Descriptors: Foreign Countries, Secondary School Students, International Assessment, Elementary Secondary Education
Jacob, Brian A. – Center on Children and Families at Brookings, 2016
Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…
Descriptors: Scores, Common Core State Standards, Test Length, Test Content
Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015
When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…
Descriptors: Competence, Tests, Evaluation Methods, Adults
Geigle, Chase – ProQuest LLC, 2018
There are two primary challenges for instructors in offering a high-quality course at large scale. The first is scaling educational experiences to such a large audience. The second major challenge encountered is that of enabling adaptivity of the educational experience. This thesis addresses both major challenges in the way of high-quality…
Descriptors: Barriers, Educational Quality, Computer Assisted Testing, Educational Experience
Charles, Karen; Canales, J. D.; Smith, Angela; Zimmerman, Natalie – Science Scope, 2012
Scale measurement and ratio and proportion are topics that fall clearly in the middle-grades mathematics curriculum in Texas. So does the solar system. In their experience, the authors have found that students have trouble manipulating, much less comprehending, very large numbers and very small numbers. These concepts can be brought into students'…
Descriptors: Mathematics Curriculum, Scaling, Astronomy, Measures (Individuals)
Simon, Samuel H. – Music Educators Journal, 2014
In music education, current assessment trends emphasize student reflection, tracking progress over time, and formative as well as summative measures. This view of assessment requires instrumental music educators to modernize their approaches without interfering with methods that have proven to be successful. To this end, the Longitudinal Scales…
Descriptors: Longitudinal Studies, Music Education, Measurement Techniques, Evaluation Methods
Kwon, YoungOk – ProQuest LLC, 2011
Recommender systems are becoming an increasingly important research area due to the growing demand for personalized recommendations. The volume of information available to each user and the number of products carried in e-commerce marketplaces have grown tremendously. Thus, recommender systems are needed to help individual users find the most…
Descriptors: Criteria, Accuracy, Cultural Differences, Scaling
Almehrizi, Rashid S. – Applied Psychological Measurement, 2013
The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…
Descriptors: Raw Scores, Scaling, Reliability, Computation
Slocum-Gori, Suzanne L.; Zumbo, Bruno D. – Social Indicators Research, 2011
Whenever one uses a composite scale score from item responses, one is tacitly assuming that the scale is dominantly unidimensional. Investigating the unidimensionality of item response data is an essential component of construct validity. Yet, there is no universally accepted technique or set of rules to determine the number of factors to retain…
Descriptors: Sample Size, Construct Validity, Measures (Individuals), Hypothesis Testing
Reeve, Suzanne; Kitchen, Elizabeth; Sudweeks, Richard R.; Bell, John D.; Bradshaw, William S. – Journal of Applied Measurement, 2011
This article describes the development of a ten-item scale to assess biology majors' self-efficacy towards the critical thinking and data analysis skills taught in an upper-division cell biology course. The original seven-item scale was expanded to include three additional items based on the results of item analysis. Evidence of reliability and…
Descriptors: Majors (Students), Self Efficacy, Measures (Individuals), Biology
Milner-Bolotin, Marina – Science Education Review, 2009
This paper discusses the concept of scaling and its biological and engineering applications. Scaling, in a scientific context, means proportional adjustment of the dimensions of an object so that the adjusted and original objects have similar shapes yet different dimensions. The paper provides an example of a hands-on, minds-on activity on scaling…
Descriptors: Scaling, Science Education, Science Curriculum, Experiments