Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Item Analysis | 9 |
Mathematical Models | 9 |
Reliability | 9 |
Test Items | 6 |
Statistical Analysis | 5 |
Goodness of Fit | 4 |
Comparative Analysis | 3 |
Correlation | 3 |
Latent Trait Theory | 3 |
Sample Size | 3 |
Sampling | 3 |
More ▼ |
Author
Publication Type
Reports - Research | 8 |
Journal Articles | 3 |
Numerical/Quantitative Data | 1 |
Reports - General | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
Australia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Osler, James Edward; Mansaray, Mahmud A. – Journal of Educational Technology, 2013
The online deployment of Technology Engineered online Student Ratings of Instruction (SRIs) by colleges and universities in the United States has dynamically changed the deployment of course evaluation. This research investigation is the fourth part of a post hoc study that analytically and psychometrically examines the design, reliability, and…
Descriptors: Course Evaluation, Educational Technology, Black Colleges, Higher Education

Melzer, Charles W.; And Others – Educational and Psychological Measurement, 1981
The magnitude of statistical bias for the phi-coefficient was investigated, using computer simulated examinations in which all the students had equal knowledge. Several modifications of phi were tested, but when applied to real examinations, none succeeded in improving its reproducibility when items are re-used on equivalent student groups.…
Descriptors: Correlation, Item Analysis, Mathematical Models, Multiple Choice Tests
Kolen, Michael J.; Whitney, Douglas R. – 1978
The application of latent trait theory to classroom tests necessitates the use of small sample sizes for parameter estimation. Computer generated data were used to assess the accuracy of estimation of the slope and location parameters in the two parameter logistic model with fixed abilities and varying small sample sizes. The maximum likelihood…
Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models
Reckase, Mark D. – 1978
Five comparisons were made relative to the quality of estimates of ability parameters and item calibrations obtained from the one-parameter and three-parameter logistic models. The results indicate: (1) The three-parameter model fit the test data better in all cases than did the one-parameter model. For simulation data sets, multi-factor data were…
Descriptors: Comparative Analysis, Goodness of Fit, Item Analysis, Mathematical Models

Kane, Michael T.; Moloney, James M. – 1976
The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…
Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests
Douglass, James B. – 1979
A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…
Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit
Gustafsson, Jan-Eric – 1977
The Rasch model for test analysis is described and compared with two-parameter and three-parameter latent-trait models. Conditional maximum likelihood equations for estimating item parameters are derived, and estimates of person parameters are described together with their confidence intervals. Goodness of fit tests are discussed, including a…
Descriptors: Adaptive Testing, Computer Programs, Equated Scores, Error of Measurement
Farish, Stephen J. – 1984
The stability of Rasch test item difficulty parameters was investigated under varying conditions. Data were taken from a mathematics achievement test administered to over 2,000 Australian students. The experiments included: (1) relative stability of the Rasch, traditional, and z-item difficulty parameters using different sample sizes and designs;…
Descriptors: Achievement Tests, Difficulty Level, Estimation (Mathematics), Foreign Countries