NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 5,791 to 5,805 of 9,547 results Save | Export
Peer reviewed Peer reviewed
Rost, Jurgen; von Davier, Matthias – Applied Psychological Measurement, 1994
A new item-fit index is proposed that is both a descriptive measure of deviance of single items and an index for statistical inference. This index is based on assumptions of the dichotomous and polytomous Rasch models for items with ordered categories. A simulation study is described. (SLD)
Descriptors: Equations (Mathematics), Goodness of Fit, Item Response Theory, Simulation
Peer reviewed Peer reviewed
Swatton, P. – Educational Review, 1995
Analysis of a sample of 100 pupil scripts from a British Assessment of Performance Unit survey suggests that ability to read and interpret graphical data should not be seen as an independent process skill. Instead, hidden cognitive demands of assessment items require integrated understanding based on a sophisticated procedural model. (SK)
Descriptors: Cognitive Processes, Data Interpretation, Foreign Countries, Graphs
Peer reviewed Peer reviewed
Wainer, Howard – Applied Measurement in Education, 1995
Analysis of the 1991 Law School Admission Test (LSAT) shows that the testlet structure of the reading comprehension and analytic reasoning sections has a significant effect on the statistical characteristics of the test. The testlet-based reliability of these two sections is lower than had been previously calculated. (SLD)
Descriptors: Admission (School), Item Bias, Law Schools, Psychometrics
Peer reviewed Peer reviewed
Reuterberg, Sven-Eric; Gustafsson, Jan-Eric – Educational and Psychological Measurement, 1992
The use of confirmatory factor analysis by the LISREL program is demonstrated as an assumption-testing method when computing reliability coefficients under different model assumptions. Results indicate that reliability estimates are robust against departure from the assumption of parallelism of test items. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Mathematical Models, Robustness (Statistics)
Peer reviewed Peer reviewed
Carlson, J. Lon; Ostrosky, Anthony L. – Journal of Economic Education, 1992
Discusses effects of test question order on student performance. Addresses (1) differences in the distribution of scores on each form of the examination; (2) effects on the validity of individual examination items; and (3) effects on the reliability of the examination instrument. Concludes that distribution of examination scores may be influenced…
Descriptors: Economics Education, Evaluation Research, Higher Education, Multiple Choice Tests
Mandula, Barbara – AWIS Newsletter, 1990
Reviewed is the research of Dr. Phyllis Rosser and the comments made by several speakers at a hearing on gender bias in testing held in October 1989. Discussed are historical perspectives, the gender gap in testing, and possible explanations for this gap. Sample test items are provided. (CW)
Descriptors: Females, Higher Education, Sex Differences, Sex Fairness
Hughes, Charles A.; And Others – Diagnostique, 1991
One hundred seventh and tenth grade tests across several content areas were examined for the presence of six types of test-wiseness cues. Approximately 75 percent of teacher-made and publisher-provided tests contained one or more cued items. The most frequent type of cue was length of option, followed by specific determiners. (Author/JDD)
Descriptors: Cues, Incidence, Secondary Education, Teacher Made Tests
Peer reviewed Peer reviewed
Ratcliff, Roger; And Others – Psychological Review, 1992
Four global memory models were evaluated in 3 recognition memory experiments with 30 college students. Experiments provide receiver operating characteristic (ROC) curves. Data give a clear idea of the behavior of signal and noise distributions in recognition memory. Ways in which results support revision of current models are discussed. (SLD)
Descriptors: College Students, Estimation (Mathematics), Higher Education, Mathematical Models
Peer reviewed Peer reviewed
Smith, Richard M. – Educational and Psychological Measurement, 1994
Rasch model total-fit statistics and between-item fit statistics were compared for their ability to detect measurement disturbances through the use of simulated data. Results indicate that the between-fit statistic appears more sensitive to systematic measurement disturbances and the total-fit statistic is more sensitive to random measurement…
Descriptors: Comparative Analysis, Goodness of Fit, Item Response Theory, Measurement Techniques
Peer reviewed Peer reviewed
Hoijtink, Herbert – Applied Psychological Measurement, 1991
A probabilistic parallelogram model (the PARELLA model) is presented for the measurement of latent traits by proximity items. This unidimensional model assumes that the responses of persons to items result from proximity relations. The model is illustrated in an analysis of three empirical datasets from previous studies. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Item Response Theory
Peer reviewed Peer reviewed
Haladyna, Thomas M. – Educational Technology, Research and Development, 1991
Discusses the link between testing and the content of instruction and proposes a method for generating large numbers of objective-relevant test items for instructional programs in high schools, higher education, and training situations. Higher level thinking outcomes are discussed, item sets are described, and generic scenarios and questioning are…
Descriptors: Higher Education, Instructional Design, Questioning Techniques, Secondary Education
Rigol, Gretchen W. – College Board Review, 1991
The College Entrance Examination Board has not permitted calculator use on the Scholastic Aptitude Test because of unresolved concerns about equity, implications for test content, and logistical and security issues. Those issues no longer seem insurmountable, and significant changes are being introduced on many tests. (MSE)
Descriptors: Calculators, Cheating, College Entrance Examinations, Higher Education
Peer reviewed Peer reviewed
Clauser, Brian; And Others – Journal of Educational Measurement, 1994
The effect of reducing the number of score groups in the matching criterion of the Mantel-Haenszel procedure when screening for differential item functioning was investigated with a simulated data set. Results suggest that more than modest reductions cannot be recommended when ability distributions of reference and focal groups differ. (SLD)
Descriptors: Ability, Experimental Groups, Item Bias, Reference Groups
Peer reviewed Peer reviewed
Huynh, Huynh – Psychometrika, 1994
Given a Masters partial credit item with n known step difficulties, conditions are stated for the existence of a set of (locally) independent Rasch binary items such that their raw score and the partial credit raw score have identical probability density functions. (Author/SLD)
Descriptors: Equations (Mathematics), Item Response Theory, Performance Based Assessment, Probability
Peer reviewed Peer reviewed
Stocking, Martha L.; And Others – Applied Psychological Measurement, 1993
A method of automatically selecting items for inclusion in a test with constraints on item content and statistical properties was applied to real data. Tests constructed manually from the same data and constraints were compared to tests constructed automatically. Results show areas in which automated assembly can improve test construction. (SLD)
Descriptors: Algorithms, Automation, Comparative Testing, Computer Assisted Testing
Pages: 1  |  ...  |  383  |  384  |  385  |  386  |  387  |  388  |  389  |  390  |  391  |  ...  |  637