Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 11 |
Descriptor
Data Analysis | 20 |
Evaluation Methods | 20 |
Scaling | 15 |
Item Analysis | 6 |
Item Response Theory | 6 |
Models | 6 |
Multidimensional Scaling | 6 |
Test Construction | 5 |
Test Validity | 5 |
Comparative Analysis | 4 |
Data Collection | 4 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 13 |
Reports - Research | 12 |
Reports - Evaluative | 4 |
Reports - Descriptive | 3 |
Collected Works - General | 2 |
Numerical/Quantitative Data | 2 |
Speeches/Meeting Papers | 2 |
Books | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
Secondary Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Counselors | 1 |
Practitioners | 1 |
Researchers | 1 |
Location
Australia | 1 |
California | 1 |
Ecuador | 1 |
Germany | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019
Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…
Descriptors: Item Response Theory, Models, Scores, Comparative Analysis
Finch, Holmes – Practical Assessment, Research & Evaluation, 2022
Researchers in many disciplines work with ranking data. This data type is unique in that it is often deterministic in nature (the ranks of items "k"-1 determine the rank of item "k"), and the difference in a pair of rank scores separated by "k" units is equivalent regardless of the actual values of the two ranks in…
Descriptors: Data Analysis, Statistical Inference, Models, College Faculty
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Capuano, Nicola; Loia, Vincenzo; Orciuoli, Francesco – IEEE Transactions on Learning Technologies, 2017
Massive Open Online Courses (MOOCs) are becoming an increasingly popular choice for education but, to reach their full extent, they require the resolution of new issues like assessing students at scale. A feasible approach to tackle this problem is peer assessment, in which students also play the role of assessor for assignments submitted by…
Descriptors: Participative Decision Making, Models, Peer Evaluation, Online Courses
Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014
Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…
Descriptors: Simulation, Evaluation Methods, Games, Data Collection
Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015
When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…
Descriptors: Competence, Tests, Evaluation Methods, Adults
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Méndez, Gonzalo; Ochoa, Xavier; Chiluiza, Katherine; de Wever, Bram – Journal of Learning Analytics, 2014
Learning analytics has been as used a tool to improve the learning process mainly at the micro-level (courses and activities). However, another of the key promises of learning analytics research is to create tools that could help educational institutions at the meso- and macro-level to gain better insight into the inner workings of their programs…
Descriptors: Data Analysis, Data Collection, Educational Research, Curriculum Design
Stephen, Damian G.; Arzamarski, Ryan; Michaels, Claire F. – Journal of Experimental Psychology: Human Perception and Performance, 2010
Perceptual systems must learn to explore and to use the resulting information to hone performance. Optimal performance depends on using information available at many time scales, from the near instantaneous values of variables underlying perception (i.e., detection), to longer term information about appropriate scaling (i.e., calibration), to yet…
Descriptors: Scaling, Systems Approach, Geometric Concepts, Experimental Psychology
Turner, Carol J.; Smith, Jeffrey K. – Measurement and Evaluation in Guidance, 1982
Used aggregate ratings of teacher behavior as data for a multitrait-multimethod validity analysis. Scaled ratings using Rasch latent trait scaling model and traditional scaling techniques. Compared Rasch-scaled multitrait-multimethod matrix to the traditionally scaled multitrait-multimethod matrix. Results showed Rasch scaling resulted in higher…
Descriptors: Children, Comparative Testing, Data Analysis, Elementary Education
Zatkin, Judith; And Others – 1983
A scaling procedure has been developed for ordering binary parallelogram preference data. The procedure uses minimum variance of the item ranks averaged across persons as the optimization criterion. Two seriation strategies are employed. One is pairwise interchange. The second joins together the vector end points and breaks this circle between…
Descriptors: Data Analysis, Evaluation Methods, Item Analysis, Measurement Techniques

Sireci, Stephen G. – Educational Assessment, 1998
Describes content-validity theory and illustrates new and traditional approaches for conducting content-validity studies. Newer approaches are based on multidimensional scaling analysis of item-similarity ratings, while traditional approaches are based on ratings of item-objective congruence and relevance. (Author/SLD)
Descriptors: Content Validity, Data Analysis, Evaluation Methods, Multidimensional Scaling
Habing, Brian; Finch, Holmes; Roberts, James S. – Applied Psychological Measurement, 2005
Although there are many methods available for dimensionality assessment for items with monotone item response functions, there are few methods available for unfolding item response theory models. In this study, a modification of Yen's Q3 statistic is proposed for the case of these nonmonotone item response models. Through a simulation study, the…
Descriptors: Data Analysis, Simulation, Multidimensional Scaling, Item Response Theory
Micceri, Theodore; And Others – 1987
Several issues relating to agreement estimates for different types of data from performance evaluations are considered. New indices of agreement are presented for ordinal level items and for summative scores produced by nominal or ordinal level items. Two sets of empirical data illustrate the performance of the two formulas derived to estimate…
Descriptors: Correlation, Data Analysis, Educational Research, Estimation (Mathematics)
Linn, Robert L.; Baker, Eva L. – 1996
During the past 6 years, under a contract from the National Center for Education Statistics, a Technical Review Panel has overseen and conducted a series of research studies addressing a range of validity questions relevant to the various uses and interpretations of the National Assessment of Educational Progress (NAEP). Study topics included: (1)…
Descriptors: Achievement Tests, Comparative Analysis, Data Analysis, Educational Policy
Previous Page | Next Page »
Pages: 1 | 2