ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	13

Descriptor

Item Analysis	21
Statistical Analysis	21
Test Items	8
Test Construction	6
Foreign Countries	5
Factor Analysis	4
Computation	3
Computer Software	3
Correlation	3
Difficulty Level	3
Educational Assessment	3
Equated Scores	3
Higher Education	3
Secondary Education	3
Test Validity	3
Distance Education	2
Elementary Secondary Education	2
Evaluation Methods	2
Guidelines	2
Latent Trait Theory	2
Measures (Individuals)	2
Methods Research	2
Multidimensional Scaling	2
Qualitative Research	2
Reliability	2
More ▼

Publication Type

Reports - Descriptive	21
Journal Articles	18
Speeches/Meeting Papers	2
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Research	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Grade 3	2
Grade 5	2
Grade 8	2
Higher Education	2
Middle Schools	2
Elementary Secondary Education	1
Grade 4	1
Grade 6	1
Grade 7	1
Junior High Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Researchers	3
Practitioners	2

Location

Hong Kong	1
Israel	1
Malaysia	1
Maryland	1
Netherlands	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Using SAS PROC IRT for Multidimensional Item Response Theory Analysis

Peer reviewed

Direct link

Cole, Ki; Paek, Insu – Measurement: Interdisciplinary Research and Perspectives, 2022

Statistical Analysis Software (SAS) is a widely used tool for data management analysis across a variety of fields. The procedure for item response theory (PROC IRT) is one to perform unidimensional and multidimensional item response theory (IRT) analysis for dichotomous and polytomous data. This review provides a summary of the features of PROC…

Descriptors: Item Response Theory, Computer Software, Item Analysis, Statistical Analysis

Easier Said than Done: Rejoinder on Sijtsma and on Green and Yang

Peer reviewed

Direct link

Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016

The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…

Descriptors: Educational Assessment, Reliability, Validity, Test Construction

Analysis of the Difficulty and Discrimination Indices of Multiple-Choice Questions According to Cognitive Levels in an Open and Distance Learning Context

Peer reviewed
PDF on ERIC

Download full text

Koçdar, Serpil; Karadag, Nejdet; Sahin, Murat Dogan – Turkish Online Journal of Educational Technology - TOJET, 2016

This is a descriptive study which intends to determine whether the difficulty and discrimination indices of the multiple-choice questions show differences according to cognitive levels of the Bloom's Taxonomy, which are used in the exams of the courses in a business administration bachelor's degree program offered through open and distance…

Descriptors: Multiple Choice Tests, Difficulty Level, Distance Education, Open Education

Classical Item Analysis Using Latent Variable Modeling: A Note on a Direct Evaluation Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Structural Equation Modeling: A Multidisciplinary Journal, 2011

A directly applicable latent variable modeling procedure for classical item analysis is outlined. The method allows one to point and interval estimate item difficulty, item correlations, and item-total correlations for composites consisting of categorical items. The approach is readily employed in empirical research and as a by-product permits…

Descriptors: Item Analysis, Evaluation, Correlation, Test Items

Quality Control Charts in Large-Scale Assessment Programs

Peer reviewed

Direct link

Schafer, William D.; Coverdale, Bradley J.; Luxenberg, Harlan; Jin, Ying – Practical Assessment, Research & Evaluation, 2011

There are relatively few examples of quantitative approaches to quality control in educational assessment and accountability contexts. Among the several techniques that are used in other fields, Shewart charts have been found in a few instances to be applicable in educational settings. This paper describes Shewart charts and gives examples of how…

Descriptors: Charts, Quality Control, Educational Assessment, Statistical Analysis

SIMREL: Software for Coefficient Alpha and Its Confidence Intervals with Monte Carlo Studies

Peer reviewed

Direct link

Yurdugul, Halil – Applied Psychological Measurement, 2009

This article describes SIMREL, a software program designed for the simulation of alpha coefficients and the estimation of its confidence intervals. SIMREL runs on two alternatives. In the first one, if SIMREL is run for a single data file, it performs descriptive statistics, principal components analysis, and variance analysis of the item scores…

Descriptors: Intervals, Monte Carlo Methods, Computer Software, Factor Analysis

Cross-Cultural Equivalence and Psychometric Properties of the Traditional Chinese Version of the Inviting School Survey-Revised

Peer reviewed
PDF on ERIC

Download full text

Direct link

Smith, Kenneth H. – Journal of Invitational Theory and Practice, 2011

The Inviting School Survey-Revised (ISS-R) was adapted and translated into Traditional Chinese (ISS-RC), using a five-step process, based on international test administration guidelines, involving judgmental, logical, and empirical methods. Both versions were administered to a convenience sample of Chinese-English fluent Hong Kong school community…

Descriptors: School Surveys, Measures (Individuals), Foreign Countries, Psychometrics

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Examining the Quality of Statistical Mathematics Education Research

Peer reviewed

Direct link

Hill, Heather C.; Shih, Jeffrey – Journal for Research in Mathematics Education, 2009

This "Research Commentary" addresses the quality of statistical research in mathematics education. To do so, 10 years of Journal for Research in Mathematics Education (JRME) articles were analyzed on the basis of criteria suggested by the American Educational Research Association, American Psychological Association, and National Council for…

Descriptors: Mathematics Education, Educational Research, Statistical Surveys, Statistical Studies

A Bayesian Method for Studying DIF: A Cautionary Tale Filled with Surprises and Delights

Peer reviewed

Direct link

Wang, Xiaohui; Bradlow, Eric T.; Wainer, Howard; Muller, Eric S. – Journal of Educational and Behavioral Statistics, 2008

In the course of screening a form of a medical licensing exam for items that function differentially (DIF) between men and women, the authors used the traditional Mantel-Haenszel (MH) statistic for initial screening and a Bayesian method for deeper analysis. For very easy items, the MH statistic unexpectedly often found DIF where there was none.…

Descriptors: Bayesian Statistics, Licensing Examinations (Professions), Medicine, Test Items

Conflicting Findings in Mixed Methods Research: An Illustration from an Israeli Study on Immigration

Peer reviewed

Direct link

Slonim-Nevo, Vered; Nevo, Isaac – Journal of Mixed Methods Research, 2009

Combining diverse methods in a single study raises a problem: What should be done when the findings of one method of investigation conflict with those of another? The authors illustrate this problem using an example in which three study phases--quantitative, qualitative, and intervention--are applied. The findings from the quantitative phase did…

Descriptors: Methods Research, Immigration, Statistical Analysis, Qualitative Research

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Test Reliability: A Practical Approach for the Teacher.

Peer reviewed

Kibblewhite, D. – Educational Studies, 1981

Describes a practical approach that teachers can use to check for test-item validity in test construction. The Kuder-Richardson Reliability Formula is used. Detailed instructions describe the procedure for evaluating items for difficulty and using statistical methods to determine test validity. (AM)

Descriptors: Elementary Secondary Education, Higher Education, Item Analysis, Statistical Analysis

Weigan: a FORTRAN IV Program for Weighted G Analysis.

Peer reviewed

Vegelius, Jan – Educational and Psychological Measurement, 1979

The computer program WEIGAN makes the weighted G analysis available for computer users. The input and output of the program are described. (Author/JKS)

Descriptors: Computer Programs, Correlation, Factor Analysis, Item Analysis

Examining Temporal Stability of Scale Validity in Longitudinal Studies

Peer reviewed

Direct link

Raykov, Tenko – Multivariate Behavioral Research, 2006

A method for examining invariance in validity of multiple-component instruments in repeated measure designs is outlined. The approach is developed within the framework of covariance structure modeling and is applicable for purposes of ascertaining temporal stability in scale validity. In addition, the procedure provides a range of plausible values…

Descriptors: Longitudinal Studies, Evaluation Methods, Test Validity, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Applied Psychological…	1
Assessment in Education:…	1
Behavioral Research and…	1
Educational Measurement:…	1
Educational Studies	1
Educational and Psychological…	1
Journal for Research in…	1
Journal of Chemical Education	1
Journal of Educational and…	1
Journal of Invitational…	1
Journal of Mixed Methods…	1
Journal of Optometric…	1
Journal of Research in…	1
Measurement:…	1
Multivariate Behavioral…	1
Online Submission	1
Practical Assessment,…	1
Structural Equation Modeling:…	1
Turkish Online Journal of…	1
More ▼

Raykov, Tenko	2
Beaton, Albert E.	1
Beguin, A. A.	1
Bradlow, Eric T.	1
Burkett, Allan R.	1
Chase, Walter William	1
Cole, Ki	1
Coverdale, Bradley J.	1
Crovo, Mary L.	1
Davenport, Ernest C.	1
Davison, Mark L.	1
Gardner, P. L.	1
Hill, Heather C.	1
Jin, Ying	1
Jung, Eunju	1
Karadag, Nejdet	1
Ketterlin-Geller, Leanne R.	1
Kibblewhite, D.	1
Koçdar, Serpil	1
Liou, Pey-Yan	1
Liu, Kimy	1
Love, Quintin U.	1
Luxenberg, Harlan	1
Marcoulides, George A.	1
Muller, Eric S.	1
More ▼