ERIC - Search Results

Descriptor

Mathematical Models	8
Statistical Distributions	8
Estimation (Mathematics)	4
Equations (Mathematics)	3
Item Response Theory	3
Comparative Analysis	2
Equated Scores	2
Higher Education	2
Item Bias	2
Multiple Choice Tests	2
Scores	2
Test Items	2
Achievement Tests	1
Advanced Placement Programs	1
Answer Keys	1
Black Students	1
Cheating	1
College Students	1
Computer Assisted Testing	1
Computer Simulation	1
Critical Incidents Method	1
Goodness of Fit	1
Graphs	1
High School Students	1
High Schools	1
More ▼

Source

Journal of Educational…

Author

Cohen, Allan S.	1
Kane, Michael T.	1
Kolen, Michael J.	1
Livingston, Samuel A.	1
Roberts, Dennis M.	1
Rogers, H. Jane	1
Stone, Clement A.	1
Swaminathan, Hariharan	1
Yen, Wendy M.	1

Publication Type

Journal Articles	8
Reports - Research	4
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Choice of Scale for Educational Measurement: An IRT Perspective.

Peer reviewed

Yen, Wendy M. – Journal of Educational Measurement, 1986

Two methods of constucting equal-interval scales for educational achievement are discussed: Thurstone's absolute scaling method and Item Response Theory. Alternative criteria for choosing a scale are contrasted. It is argued that clearer criteria are needed for judging the appropriateness and usefulness of alternative scaling procedures.…

Descriptors: Achievement Tests, Latent Trait Theory, Mathematical Models, Scaling

Monte Carlo Based Null Distribution for an Alternative Goodness-of-Fit Test Statistic in IRT Models.

Peer reviewed

Stone, Clement A. – Journal of Educational Measurement, 2000

Describes a goodness-of-fit statistic that considers the imprecision with which ability is estimated and involves constructing item fit tables based on each examinee's posterior distribution of ability, given the likelihood of the response pattern and an assumed marginal ability distribution. Also describes a Monte Carlo resampling procedure to…

Descriptors: Goodness of Fit, Item Response Theory, Mathematical Models, Monte Carlo Methods

Smoothing Methods for Estimating Test Score Distributions.

Peer reviewed

Kolen, Michael J. – Journal of Educational Measurement, 1991

Estimation/smoothing methods that are flexible enough to fit a wide variety of test score distributions are reviewed: kernel method, strong true-score model-based method, and method that uses polynomial log-linear models. Applications of these methods include describing/comparing test score distributions, estimating norms, and estimating…

Descriptors: Comparative Analysis, Equated Scores, Equations (Mathematics), Estimation (Mathematics)

Limitations of the Score-Difference Method in Detecting Cheating in Recognition Test Situations.

Peer reviewed

Roberts, Dennis M. – Journal of Educational Measurement, 1987

This study examines a score-difference model for the detection of cheating based on the difference between two scores for an examinee: one based on the appropriate scoring key and another based on an alternative, inappropriate key. It argues that the score-difference method could falsely accuse students as cheaters. (Author/JAZ)

Descriptors: Answer Keys, Cheating, Mathematical Models, Multiple Choice Tests

Combining Data on Criticality and Frequency in Developing Test Plans for Licensure and Certification Examinations.

Peer reviewed

Kane, Michael T.; And Others – Journal of Educational Measurement, 1989

This paper develops a multiplicative model as a means of combining ratings of criticality and frequency of various activities involved in job analyses. The model incorporates adjustments to ensure that effective weights of criticality and frequency are appropriate. An example of the model's use is presented. (TJH)

Descriptors: Critical Incidents Method, Higher Education, Job Analysis, Licensing Examinations (Professions)

Small-Sample Equating with Log-Linear Smoothing.

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1993

The extent to which log-linear smoothing could improve the accuracy of common-item equating by the chained equipercentile method in small samples of examinees was investigated with responses from a 100-item test and 93,283 examinees. Smoothing reduced the sample size required for a given degree of accuracy. (SLD)

Descriptors: Advanced Placement Programs, Equated Scores, Estimation (Mathematics), High School Students

Detecting Differential Item Functioning Using Logistic Regression Procedures.

Peer reviewed

Swaminathan, Hariharan; Rogers, H. Jane – Journal of Educational Measurement, 1990

A logistic regression model for characterizing differential item functioning (DIF) between two groups is presented. A distinction is drawn between uniform and nonuniform DIF in terms of model parameters. A statistic for testing the hypotheses of no DIF is developed, and simulation studies compare it with the Mantel-Haenszel procedure. (Author/TJH)

Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)

Influence of Prior Distributions on Detection of DIF.

Peer reviewed

Cohen, Allan S.; And Others – Journal of Educational Measurement, 1991

Detecting differential item functioning (DIF) on test items constructed to favor 1 group over another was investigated on parameter estimates from 2 item response theory-based computer programs--BILOG and LOGIST--using data for 1,000 White and 1,000 Black college students. Use of prior distributions and marginal-maximum a posteriori estimation is…

Descriptors: Black Students, College Students, Computer Assisted Testing, Equations (Mathematics)