NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)1
Since 2007 (last 20 years)5
Audience
Location
Australia1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 21 results Save | Export
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Pei-Hua; Chang, Hua-Hua; Wu, Haiyan – Educational and Psychological Measurement, 2012
Two sampling-and-classification-based procedures were developed for automated test assembly: the Cell Only and the Cell and Cube methods. A simulation study based on a 540-item bank was conducted to compare the performance of the procedures with the performance of a mixed-integer programming (MIP) method for assembling multiple parallel test…
Descriptors: Test Items, Selection, Test Construction, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2009
A series of resampling studies was conducted to compare the accuracy of equating in a common item design using four different methods: chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating, and the circle-arc method. Four operational test forms, each containing more than 100 items, were used for…
Descriptors: Sampling, Sample Size, Accuracy, Test Items
Shoemaker, David M. – 1972
Investigated empirically through post mortem item-examinee sampling were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. The results indicate clearly that a partially balanced incomplete block…
Descriptors: Comparative Analysis, Error Patterns, Item Sampling, Matrices
Peer reviewed Peer reviewed
Johansson, Charles B. – Journal of Counseling Psychology, 1975
Six in-general samples have been generated to fit different developments of the SVIB, Twenty experimental homogeneous scales were used to measure the similarities and differences among the six in-general samples. Generally, all samples were strikingly similar with the greatest differences appearing between male and female in-general samples.…
Descriptors: Comparative Analysis, Interest Inventories, Item Sampling, Research Projects
Peer reviewed Peer reviewed
Hsu, Louis M. – Educational and Psychological Measurement, 1980
A method based on the Poisson approximation to the binomial distribution and on the relation between the Chi-Squared distribution and the Poisson distribution is suggested for selected use in determining the number of items and passing scores in mastery Lests. (Author/RL)
Descriptors: Comparative Analysis, Cutting Scores, Item Sampling, Mastery Tests
Peer reviewed Peer reviewed
Taylor, Annette Kujawski – College Student Journal, 2005
This research examined 2 elements of multiple-choice test construction, balancing the key and optimal number of options. In Experiment 1 the 3 conditions included a balanced key, overrepresentation of a and b responses, and overrepresentation of c and d responses. The results showed that error-patterns were independent of the key, reflecting…
Descriptors: Comparative Analysis, Test Items, Multiple Choice Tests, Test Construction
van den Brink, Wulfert – Evaluation in Education: International Progress, 1982
Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques
Peer reviewed Peer reviewed
Levin, Joel R. – Journal of Educational Measurement, 1975
A set procedure developed in this study is useful in determining sample size, based on specification of linear contrasts involving certain formula treatments. (Author/DEP)
Descriptors: Analysis of Variance, Comparative Analysis, Mathematical Models, Measurement Techniques
Peer reviewed Peer reviewed
Jenkins, Joseph R.; And Others – Journal of Educational Measurement, 1972
Study investigated (1) whether there is consensus among test writers in identification of important segments of a prose passage, and (2) the characteristics of the prose segments chosen as important. (Authors/MB)
Descriptors: Comparative Analysis, Elementary School Teachers, Item Analysis, Sampling
Sullins, Walter L. – 1971
Five-hundred dichotomously scored response patterns were generated with sequentially independent (SI) items and 500 with dependent (SD) items for each of thirty-six combinations of sampling parameters (i.e., three test lengths, three sample sizes, and four item difficulty distributions). KR-20, KR-21, and Split-Half (S-H) reliabilities were…
Descriptors: Comparative Analysis, Correlation, Error of Measurement, Item Analysis
Douglass, James B. – 1979
A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…
Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit
Brown, James Dean – 1983
This study attempted to determine the effectiveness of cloze procedures as norm-referenced instruments by comparing the differential responses of four groups of college students of English as a second language on two identical cloze passages. The responses were scored using both exact-answer and acceptable-word methods. The results indicate that…
Descriptors: Cloze Procedure, College Students, Comparative Analysis, English (Second Language)
Swezey, Robert W.; Pearlstein, Richard B. – 1975
This manual outlines the rationale for using the Criterion Referenced Test (CRT) approach and suggests specific guidelines for test developers to use in constructing test items. Methods for assessing the adequacy of a CRT are also provided. (Author/RC)
Descriptors: Behavioral Objectives, Check Lists, Comparative Analysis, Criterion Referenced Tests
Haladyna, Tom – 1976
The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…
Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis
Previous Page | Next Page ยป
Pages: 1  |  2