ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Item Analysis	9
Mathematical Models	9
Reliability	9
Test Items	6
Statistical Analysis	5
Goodness of Fit	4
Comparative Analysis	3
Correlation	3
Latent Trait Theory	3
Sample Size	3
Sampling	3
Simulation	3
Test Construction	3
Difficulty Level	2
Equated Scores	2
Multiple Choice Tests	2
Responses	2
Scores	2
Test Interpretation	2
True Scores	2
Academic Achievement	1
Achievement Tests	1
Adaptive Testing	1
African American Students	1
Algorithms	1
More ▼

Source

Educational and Psychological…	1
Journal of Educational…	1
Journal of Educational and…	1

Author

Allan S. Cohen	1
Douglass, James B.	1
Farish, Stephen J.	1
Gustafsson, Jan-Eric	1
Jordan M. Wheeler	1
Kane, Michael T.	1
Kolen, Michael J.	1
Mansaray, Mahmud A.	1
Melzer, Charles W.	1
Moloney, James M.	1
Osler, James Edward	1
Reckase, Mark D.	1
Shiyu Wang	1
Whitney, Douglas R.	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	3
Numerical/Quantitative Data	1
Reports - General	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Technology Engineering Online Learner Metrics to Analyze Instructional Efficacy

Peer reviewed
PDF on ERIC

Download full text

Osler, James Edward; Mansaray, Mahmud A. – Journal of Educational Technology, 2013

The online deployment of Technology Engineered online Student Ratings of Instruction (SRIs) by colleges and universities in the United States has dynamically changed the deployment of course evaluation. This research investigation is the fourth part of a post hoc study that analytically and psychometrically examines the design, reliability, and…

Descriptors: Course Evaluation, Educational Technology, Black Colleges, Higher Education

Correction of Item-Test Correlations and Attempts at Improving Reproducibility in Item-Analysis: An Experimental Approach.

Peer reviewed

Melzer, Charles W.; And Others – Educational and Psychological Measurement, 1981

The magnitude of statistical bias for the phi-coefficient was investigated, using computer simulated examinations in which all the students had equal knowledge. Several modifications of phi were tested, but when applied to real examinations, none succeeded in improving its reproducibility when items are re-used on equivalent student groups.…

Descriptors: Correlation, Item Analysis, Mathematical Models, Multiple Choice Tests

Accuracy of Estimating Two Parameter Logistic Latent Trait Parameters and Implications for Classroom Tests.

Download full text

Kolen, Michael J.; Whitney, Douglas R. – 1978

The application of latent trait theory to classroom tests necessitates the use of small sample sizes for parameter estimation. Computer generated data were used to assess the accuracy of estimation of the slope and location parameters in the two parameter logistic model with fixed abilities and varying small sample sizes. The maximum likelihood…

Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models

A Comparison of the One- and Three-Parameter Logistic Models for Item Calibration.

Download full text

Reckase, Mark D. – 1978

Five comparisons were made relative to the quality of estimates of ability parameters and item calibrations obtained from the one-parameter and three-parameter logistic models. The results indicate: (1) The three-parameter model fit the test data better in all cases than did the one-parameter model. For simulation data sets, multi-factor data were…

Descriptors: Comparative Analysis, Goodness of Fit, Item Analysis, Mathematical Models

Item Reliabilities for a Family of Answer-Until-Correct (AUC) Scoring Rules.

PDF pending restoration

Kane, Michael T.; Moloney, James M. – 1976

The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…

Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests

A Process for Testing a Methematical Model for the Solution of a Practical Problem: Applications to Test Equating.

Douglass, James B. – 1979

A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…

Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit

The Rasch Model for Dichotomous Items: Theory, Applications and a Computer Program. No. 63.

Download full text

Gustafsson, Jan-Eric – 1977

The Rasch model for test analysis is described and compared with two-parameter and three-parameter latent-trait models. Conditional maximum likelihood equations for estimating item parameters are derived, and estimates of person parameters are described together with their confidence intervals. Goodness of fit tests are discussed, including a…

Descriptors: Adaptive Testing, Computer Programs, Equated Scores, Error of Measurement

Investigating Item Stability: An Empirical Investigation into the Variability of Item Statistics Under Conditions of Varying Sample Design and Sample Size. Occasional Paper No. 18.

Download full text

Farish, Stephen J. – 1984

The stability of Rasch test item difficulty parameters was investigated under varying conditions. Data were taken from a mathematics achievement test administered to over 2,000 Australian students. The experiments included: (1) relative stability of the Rasch, traditional, and z-item difficulty parameters using different sample sizes and designs;…

Descriptors: Achievement Tests, Difficulty Level, Estimation (Mathematics), Foreign Countries