Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 3 |
Descriptor
Test Length | 28 |
Testing Problems | 28 |
Test Construction | 10 |
Test Items | 9 |
Test Reliability | 7 |
Test Validity | 7 |
Higher Education | 6 |
Multiple Choice Tests | 6 |
Test Format | 6 |
Timed Tests | 6 |
Item Analysis | 5 |
More ▼ |
Source
Educational and Psychological… | 4 |
Journal of Educational… | 2 |
Applied Measurement in… | 1 |
Applied Psychological… | 1 |
Behavioral Research and… | 1 |
Participatory Educational… | 1 |
Psychological Assessment | 1 |
Science Education… | 1 |
Author
Wilcox, Rand R. | 2 |
Alonzo, Julie | 1 |
Axelrod, Bradley N. | 1 |
Bay, Luz | 1 |
Boyd, Thomas A. | 1 |
Budescu, David | 1 |
Coats, Pamela K. | 1 |
Conger, Anthony J. | 1 |
Gallucci, Nicholas T. | 1 |
Gilmer, Jerry S. | 1 |
Hambleton, Ronald K. | 1 |
More ▼ |
Publication Type
Reports - Research | 28 |
Journal Articles | 11 |
Speeches/Meeting Papers | 8 |
Information Analyses | 2 |
Numerical/Quantitative Data | 2 |
Tests/Questionnaires | 1 |
Education Level
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Primary Education | 1 |
Audience
Researchers | 4 |
Location
New Jersey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022
The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…
Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores
Sinharay, Sandip – Applied Measurement in Education, 2017
Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…
Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis
Kahn, Josh; Nese, Joseph T.; Alonzo, Julie – Behavioral Research and Teaching, 2016
There is strong theoretical support for oral reading fluency (ORF) as an essential building block of reading proficiency. The current and standard ORF assessment procedure requires that students read aloud a grade-level passage (˜ 250 words) in a one-to-one administration, with the number of words read correctly in 60 seconds constituting their…
Descriptors: Teacher Surveys, Oral Reading, Reading Tests, Computer Assisted Testing
Bay, Luz – 1995
An index is proposed to detect cheating on multiple-choice examinations, and its use is evaluated through simulations. The proposed index is based on the compound binomial distribution. In total, 360 simulated data sets reflecting 12 different cheating (copying) situations were obtained and used for the study of the sensitivity of the index in…
Descriptors: Cheating, Class Size, Identification, Multiple Choice Tests

Conger, Anthony J. – Educational and Psychological Measurement, 1983
A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)
Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length

Gallucci, Nicholas T. – Educational and Psychological Measurement, 1986
This study evaluated the degree to which 102 undergraduate participants objected to questions on the Minnesota Multiphasic Personality Inventory (MMPI) which referred to sex, religion, bladder and bowel functions, family relationships, and unusual thinking in comparision to degree of objection to length of the MMPI and repetition of questions.…
Descriptors: College Students, Higher Education, Personality Measures, Psychological Evaluation

Sindhu, R. S.; Sharma, Reeta – Science Education International, 1999
Finds that the time required to attempt all the test items of each question paper in a four-paper sample was inversely proportional to the percentage of students who attempted all the test items of that paper. Extrapolates results to give guidelines for determining the feasibility of newly-developed exam papers. (WRM)
Descriptors: Science Tests, Secondary Education, Test Construction, Test Length

Woodard, John L.; Axelrod, Bradley N. – Psychological Assessment, 1995
Using 308 patients referred for neuropsychological evaluation, 2 regression equations were developed to predict weighted raw score sums for General Memory and Delayed Recall using the Wechsler Memory Scale-Revised (WMS-R) analogs of 5 subtests from the original WMS. The equations may help reduce WMS-R administration time. (SLD)
Descriptors: Equations (Mathematics), Memory, Neuropsychology, Patients

Budescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Wilcox, Rand R. – Educational and Psychological Measurement, 1979
A problem of considerable importance in certain educational settings is determining how many items to include on a mastery test. Applying ranking and selection procedures, a solution is given which includes as a special case all existing single-stage, non-Bayesian solutions based on a strong true-score model. (Author/JKS)
Descriptors: Criterion Referenced Tests, Mastery Tests, Nonparametric Statistics, Probability
Kennedy, Rob – 1994
The purpose of this study was to investigate the relationship between the scores students earned on multiple choice tests and the number of minutes students required to complete the tests. The 5 tests were made up of 20 randomly drawn questions from a large pool of questions about research methods. Students were allowed an unlimited amount of time…
Descriptors: Graduate Students, Graduate Study, Higher Education, Multiple Choice Tests

Roberts, Dennis M. – Journal of Educational Measurement, 1987
This study examines a score-difference model for the detection of cheating based on the difference between two scores for an examinee: one based on the appropriate scoring key and another based on an alternative, inappropriate key. It argues that the score-difference method could falsely accuse students as cheaters. (Author/JAZ)
Descriptors: Answer Keys, Cheating, Mathematical Models, Multiple Choice Tests

Hattie, John – Educational and Psychological Measurement, 1983
It is argued that the tendency to omit items is a deviant response characteristic. Three studies using a self-actualization measure are outlined. Persons who omitted items did so because of fatigue, confusion with some items, unpreparedness to disclose information, and/or because they may not trust the researcher with certain information.…
Descriptors: Adults, Fatigue (Biology), Individual Differences, Personality Measures

Reckase, Mark D. – 1979
Because latent trait models require that large numbers of items be calibrated or that testing of the same large group be repeated, item parameter estimates are often obtained by administering separate tests to different groups and "linking" the results to construct an adequate item pool. Four issues were studied, based upon the analysis…
Descriptors: Achievement Tests, High Schools, Item Banks, Mathematical Models

Kafry, Ditsa; And Others – Applied Psychological Measurement, 1979
A series of behavioral expectation scale applications were analyzed in an attempt to point out an appropriate number of dimensions to be included in such studies. Results reflected the problems of dimension interdependence when the number of dimensions exceeds nine. (Author/JKS)
Descriptors: Behavior Rating Scales, Expectation, Factor Analysis, Higher Education
Previous Page | Next Page »
Pages: 1 | 2