ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	11

Descriptor

Data Analysis	20
Evaluation Methods	20
Scaling	15
Item Analysis	6
Item Response Theory	6
Models	6
Multidimensional Scaling	6
Test Construction	5
Test Validity	5
Comparative Analysis	4
Data Collection	4
Measurement Techniques	4
Scores	4
Foreign Countries	3
Probability	3
Problem Solving	3
Questionnaires	3
Research Methodology	3
Simulation	3
Statistical Analysis	3
Student Evaluation	3
Test Items	3
Theories	3
Academic Achievement	2
Benchmarking	2
More ▼

Source

Educational Assessment	2
Applied Measurement in…	1
Applied Psychological…	1
ETS Research Report Series	1
Educational and Psychological…	1
IEEE Transactions on Learning…	1
International Association for…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Learning Analytics	1
Measurement and Evaluation in…	1
Ministerial Council on…	1
Practical Assessment,…	1
Routledge, Taylor & Francis…	1
Studies in Educational…	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	12
Reports - Evaluative	4
Reports - Descriptive	3
Collected Works - General	2
Numerical/Quantitative Data	2
Speeches/Meeting Papers	2
Books	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Two Year Colleges	1
More ▼

Audience

Counselors	1
Practitioners	1
Researchers	1

Location

Australia	1
California	1
Ecuador	1
Germany	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

An Introduction to the Analysis of Ranked Response Data

Peer reviewed
PDF on ERIC

Download full text

Finch, Holmes – Practical Assessment, Research & Evaluation, 2022

Researchers in many disciplines work with ranking data. This data type is unique in that it is often deterministic in nature (the ranks of items "k"-1 determine the rank of item "k"), and the difference in a pair of rank scores separated by "k" units is equivalent regardless of the actual values of the two ranks in…

Descriptors: Data Analysis, Statistical Inference, Models, College Faculty

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

A Fuzzy Group Decision Making Model for Ordinal Peer Assessment

Peer reviewed

Direct link

Capuano, Nicola; Loia, Vincenzo; Orciuoli, Francesco – IEEE Transactions on Learning Technologies, 2017

Massive Open Online Courses (MOOCs) are becoming an increasingly popular choice for education but, to reach their full extent, they require the resolution of new issues like assessing students at scale. A feasible approach to tackle this problem is peer assessment, in which students also play the role of assessor for assignments submitted by…

Descriptors: Participative Decision Making, Models, Peer Evaluation, Online Courses

Statistical Methods for Assessments in Simulations and Serious Games. Research Report. ETS RR-14-12

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014

Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…

Descriptors: Simulation, Evaluation Methods, Games, Data Collection

Taking the Missing Propensity into Account When Estimating Competence Scores: Evaluation of Item Response Theory Models for Nonignorable Omissions

Peer reviewed

Direct link

Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015

When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…

Descriptors: Competence, Tests, Evaluation Methods, Adults

In Search of Validity Evidence in Support of the Interpretation and Use of Assessments of Complex Constructs: Discussion of Research on Assessing 21st Century Skills

Peer reviewed

Direct link

Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016

Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…

Descriptors: Evaluation Methods, Test Construction, Design, Scaling

Curricular Design Analysis: A Data-Driven Perspective

Peer reviewed
PDF on ERIC

Download full text

Méndez, Gonzalo; Ochoa, Xavier; Chiluiza, Katherine; de Wever, Bram – Journal of Learning Analytics, 2014

Learning analytics has been as used a tool to improve the learning process mainly at the micro-level (courses and activities). However, another of the key promises of learning analytics research is to create tools that could help educational institutions at the meso- and macro-level to gain better insight into the inner workings of their programs…

Descriptors: Data Analysis, Data Collection, Educational Research, Curriculum Design

The Role of Fractality in Perceptual Learning: Exploration in Dynamic Touch

Peer reviewed

Direct link

Stephen, Damian G.; Arzamarski, Ryan; Michaels, Claire F. – Journal of Experimental Psychology: Human Perception and Performance, 2010

Perceptual systems must learn to explore and to use the resulting information to hone performance. Optimal performance depends on using information available at many time scales, from the near instantaneous values of variables underlying perception (i.e., detection), to longer term information about appropriate scaling (i.e., calibration), to yet…

Descriptors: Scaling, Systems Approach, Geometric Concepts, Experimental Psychology

Traditional versus Rasch Scaling of Aggregate Data in the Multitrait-Multimethod Matrix.

Turner, Carol J.; Smith, Jeffrey K. – Measurement and Evaluation in Guidance, 1982

Used aggregate ratings of teacher behavior as data for a multitrait-multimethod validity analysis. Scaled ratings using Rasch latent trait scaling model and traditional scaling techniques. Compared Rasch-scaled multitrait-multimethod matrix to the traditionally scaled multitrait-multimethod matrix. Results showed Rasch scaling resulted in higher…

Descriptors: Children, Comparative Testing, Data Analysis, Elementary Education

Parallelogram Scaling of Binary Items.

Zatkin, Judith; And Others – 1983

A scaling procedure has been developed for ordering binary parallelogram preference data. The procedure uses minimum variance of the item ranks averaged across persons as the optimization criterion. Two seriation strategies are employed. One is pairwise interchange. The second joins together the vector end points and breaks this circle between…

Descriptors: Data Analysis, Evaluation Methods, Item Analysis, Measurement Techniques

Gathering and Analyzing Content Validity Data.

Peer reviewed

Sireci, Stephen G. – Educational Assessment, 1998

Describes content-validity theory and illustrates new and traditional approaches for conducting content-validity studies. Newer approaches are based on multidimensional scaling analysis of item-similarity ratings, while traditional approaches are based on ratings of item-objective congruence and relevance. (Author/SLD)

Descriptors: Content Validity, Data Analysis, Evaluation Methods, Multidimensional Scaling

A Q3 Statistic for Unfolding Item Response Theory Models: Assessment of Unidimensionality with Two Factors and Simple Structure

Peer reviewed

Direct link

Habing, Brian; Finch, Holmes; Roberts, James S. – Applied Psychological Measurement, 2005

Although there are many methods available for dimensionality assessment for items with monotone item response functions, there are few methods available for unfolding item response theory models. In this study, a modification of Yen's Q3 statistic is proposed for the case of these nonmonotone item response models. Through a simulation study, the…

Descriptors: Data Analysis, Simulation, Multidimensional Scaling, Item Response Theory

Interrater Agreement: Same Data, Different Definitions, Different Outcomes.

Download full text

Micceri, Theodore; And Others – 1987

Several issues relating to agreement estimates for different types of data from performance evaluations are considered. New indices of agreement are presented for ordinal level items and for summative scores produced by nominal or ordinal level items. Two sets of empirical data illustrate the performance of the two formulas derived to estimate…

Descriptors: Correlation, Data Analysis, Educational Research, Estimation (Mathematics)

Assessing the Validity of the National Assessment of Educational Progress: NAEP Technical Review Panel White Paper.

Download full text

Linn, Robert L.; Baker, Eva L. – 1996

During the past 6 years, under a contract from the National Center for Education Statistics, a Technical Review Panel has overseen and conducted a series of research studies addressing a range of validity questions relevant to the various uses and interpretations of the National Assessment of Educational Progress (NAEP). Study topics included: (1)…

Descriptors: Achievement Tests, Comparative Analysis, Data Analysis, Educational Policy

Previous Page | Next Page »

Pages: 1 | 2

Finch, Holmes	2
Afrassa, Tilahun	1
Arzamarski, Ryan	1
Baker, Eva L.	1
Capuano, Nicola	1
Carstensen, Claus H.	1
Chiluiza, Katherine	1
Chrostowski, Steven J., Ed.	1
Denison, D. Brian, Ed.	1
Donovan, Jenny	1
Ercikan, Kadriye	1
Feuerstahler, Leah	1
Fu, Jianbin	1
Habing, Brian	1
Hungi, Njora	1
Hutton, Penny	1
Keeves, John P.	1
Köhler, Carmen	1
Lee, Minji K.	1
Lennon, Melissa	1
Linn, Robert L.	1
Loia, Vincenzo	1
Martin, Michael O., Ed.	1
Mavronikolas, Elia	1
More ▼