NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 6,991 to 7,005 of 9,530 results Save | Export
Peer reviewed Peer reviewed
Wolf, Lisa F.; And Others – Applied Measurement in Education, 1995
The relationship between characteristics of test takers and characteristics of test items was examined in a quasiexperimental study involving 301 high school juniors and sophomores taking a mathematics examination that was consequential to sophomores but not juniors. Results are interpreted in relation to the expectancy value model of motivation.…
Descriptors: Difficulty Level, Grade 10, Grade 11, High School Students
Peer reviewed Peer reviewed
Snow, Catherine E.; And Others – Journal of Research in Childhood Education, 1995
Reports on a battery of oral language and early literacy tests, called the SHELL. Describes the tests, presents tasks and scoring system, and provides information about performance by participants in the Home-School Study of Language and Literacy Development. Descriptive, correlational, and predictive analyses based on SHELL-K (kindergarten) and…
Descriptors: Early Childhood Education, Emergent Literacy, Grade 1, Kindergarten
Peer reviewed Peer reviewed
Perkins, Kyle; And Others – Language Testing, 1995
This article reports the results of using a three-layer back propagation artificial neural network to predict item difficulty in a reading comprehension test. Three classes of variables were examined: text structure, propositional analysis, and cognitive demand. Results demonstrate that the networks can consistently predict item difficulty. (JL)
Descriptors: Artificial Intelligence, Difficulty Level, English (Second Language), Language Tests
Peer reviewed Peer reviewed
Powers, Donald E.; Leung, Susan Wilson – Journal of Educational Measurement, 1995
Test-taking strategies that examinees may use without reading the passages on which reading comprehension questions are based, similar to those of the new Scholastic Assessment Test, were studied with 350 high school juniors. Strategies most often used involved choosing answers based on consistency and reconstructing main themes from other…
Descriptors: College Entrance Examinations, Decision Making, Grade 11, High School Students
Peer reviewed Peer reviewed
Ryan, Katherine E.; Bachman, Lyle F. – Language Testing, 1992
The extent to which items from the Test of English as a Foreign Language and the First Certificate in English function differently for test-takers of equal ability from different native language and curricular backgrounds was investigated. Results suggest a need for methods like logistic regression to examine nonuniform differential item…
Descriptors: Comparative Analysis, English (Second Language), Language Acquisition, Language Tests
Peer reviewed Peer reviewed
DeMauro, G. – Language Testing, 1992
Several analyses are presented on the relationships among the Test of Spoken English, Test of Written English, and Test of English as a Foreign Language. The multivariate prediction of each test from the scores on the others is very accurate; variances with two prominent factors may relate to specific cognitive test-taking skills. (eight…
Descriptors: Comparative Analysis, Language Research, Language Skills, Language Tests
Peer reviewed Peer reviewed
Harasym, P. H.; And Others – Evaluation and the Health Professions, 1992
Findings from a study with approximately 200 first-year University of Calgary (Canada) nursing students provide evidence that the use of negation (e.g., not, except) should be limited in stems of multiple-choice test items and that a single-response negatively worded item should be converted to a multiple-response positively worded item. (SLD)
Descriptors: College Students, Foreign Countries, Higher Education, Multiple Choice Tests
Peer reviewed Peer reviewed
McMurray, Mary Anne; And Others – Journal of Research in Science Teaching, 1991
Reports a study investigating the utility of 52 items, selected from a readily available item pool developed for instructional purposes, when the items are used to measure critical thinking abilities of biology students. The items had reasonably good internal consistency reliability and good concurrent validity. (PR)
Descriptors: Biology, College Science, Critical Thinking, Educational Research
Peer reviewed Peer reviewed
Hoijtink, Herbert; Molenaar, Ivo W. – Psychometrika, 1992
The PARallELogram Analysis (PARELLA) model is a probabilistic parallelogram model that can be used for the measurement of latent attitudes or latent preferences. A method is presented for testing for differential item functioning (DIF) for the PARELLA model using the approach of D. Thissen and others (1988). (SLD)
Descriptors: Attitude Measures, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)
Peer reviewed Peer reviewed
Haladyna, Thomas M. – Educational Measurement: Issues and Practice, 1992
Context-dependent item sets, containing a subset of test items related to a passage or stimulus, are discussed. A brief review of methods for developing item sets reveals their potential for measuring high-level thinking. Theories and technologies for scoring item sets remain largely experimental. Research needs are discussed. (SLD)
Descriptors: Cognitive Tests, Educational Technology, Licensing Examinations (Professions), Problem Solving
Peer reviewed Peer reviewed
Beaton, Albert E.; Allen, Nancy L. – Journal of Educational Statistics, 1992
The National Assessment of Educational Progress (NAEP) makes possible comparison of groups of students and provides information about what these groups know and can do. The scale anchoring techniques described in this chapter address the latter purpose. The direct method and the smoothing method of scale anchoring are discussed. (SLD)
Descriptors: Comparative Testing, Educational Assessment, Elementary Secondary Education, Knowledge Level
Peer reviewed Peer reviewed
Liou, Michelle; Chang, Chih-Hsin – Psychometrika, 1992
An extension is proposed for the network algorithm introduced by C.R. Mehta and N.R. Patel to construct exact tail probabilities for testing the general hypothesis that item responses are distributed according to the Rasch model. A simulation study indicates the efficiency of the algorithm. (SLD)
Descriptors: Algorithms, Computer Simulation, Difficulty Level, Equations (Mathematics)
Peer reviewed Peer reviewed
Cliff, Norman; Donoghue, John R. – Psychometrika, 1992
A test theory using only ordinal assumptions is presented, based on the idea that the test items are a sample from a universe of items. The sum across items of the ordinal relations for a pair of persons on the universe items is analogous to a true score. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Response Theory, Item Sampling
Peer reviewed Peer reviewed
Luecht, Richard M.; Hirsch, Thomas M. – Applied Psychological Measurement, 1992
Derivations of several item selection algorithms for use in fitting test items to target information functions (IFs) are described. These algorithms, which use an average growth approximation of target IFs, were tested by generating six test forms and were found to provide reliable fit. (SLD)
Descriptors: Algorithms, Computer Assisted Testing, Equations (Mathematics), Goodness of Fit
Peer reviewed Peer reviewed
Kelderman, Henk; Rijkes, Carl P. M. – Psychometrika, 1994
A loglinear item response theory (IRT) model is proposed that relates polytomously scored item responses to a multidimensional latent space. The analyst may specify a response function for each response, and each item may have a different number of response categories. Conditional maximum likelihood estimates are derived. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Goodness of Fit, Item Response Theory
Pages: 1  |  ...  |  463  |  464  |  465  |  466  |  467  |  468  |  469  |  470  |  471  |  ...  |  636