NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 7,816 to 7,830 of 9,533 results Save | Export
Peer reviewed Peer reviewed
Williams, Valerie S. L. – Applied Measurement in Education, 1997
Using item response theory to investigate differential item functioning (DIF), students' expected course grades were examined and found to function similarly across sex and race. These grades were incorporated into the matching criterion, enhancing the validity of subgroup comparisons for the third-grade mathematics test taken by 1,050 students.…
Descriptors: Comparative Analysis, Criteria, Elementary School Students, Grade 3
Peer reviewed Peer reviewed
And Others; Birenbaum, Menucha – Educational and Psychological Measurement, 1997
The agreement of diagnostic classifications from two parallel subtests assessing a mathematics skill with three levels of scoring was studied with 431 Arab Israeli 10th graders. Results indicate that, even when parallel form reliability is high, less agreement is apparent when performance is evaluated at the micro level. (SLD)
Descriptors: Arabs, Classification, Diagnostic Tests, Evaluation Methods
Peer reviewed Peer reviewed
Sireci, Stephen G. – Educational Measurement: Issues and Practice, 1997
Different methodologies for linking tests across languages are reviewed and evaluated, focusing on monolingual item response theory, bilingual group designs, and matched monolingual group designs. These methods, although not without weaknesses, are superior for promoting score comparability than methods that rely on translation or expert judgment…
Descriptors: Bilingualism, Comparative Analysis, Cross Cultural Studies, Educational Assessment
Peer reviewed Peer reviewed
Motl, Robert W.; DiStefano, Christine – Structural Equation Modeling, 2002
Examined the longitudinal invariance of method effects associated with negatively worded items on a self-report measure of global self-esteem. Data from the National Educational Longitudinal Study for 3,950 junior high school and high school students show that the method effects associated with negatively worded items exhibit invariance across…
Descriptors: High School Students, High Schools, Junior High School Students, Junior High Schools
Peer reviewed Peer reviewed
Kulhavy, Raymond W.; And Others – Contemporary Educational Psychology, 1990
Assumptions of a servocontrol model of test item feedback were tested in a study of 94 junior and senior high school students receiving feedback or no feedback with 2 retention intervals. Results support the assumption that response certitude is related to the learner's ability to comprehend a given item. (SLD)
Descriptors: Comparative Testing, Feedback, High School Students, Junior High School Students
Peer reviewed Peer reviewed
Vockell, Edward L.; Hall, Jane – Social Studies, 1989
Examines the ways in which computers can assist teachers in developing good tests. Describes the program TESTWORKS in detail and provides charts comparing this program with 11 others in the areas of price, type of questions generated, computer functions, and the usefulness of each. Discusses the use of word processors and databases. (KO)
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Software, Computer Uses in Education
Peer reviewed Peer reviewed
Hsu, Tse-chi; Yu, Lifa – Educational Measurement: Issues and Practice, 1989
How computers are used to analyze item data is reviewed, and the information that existing item-analysis programs provide is described. Summaries of studies comparing the performance of some of these packages reveal some of their current limitations. Emphasis is on the usefulness to educational practice of these packages. (SLD)
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Computer Uses in Education
Peer reviewed Peer reviewed
Smith, Richard M.; And Others – Journal of Dental Education, 1989
A study of gender bias in the Dental Admission Test's mathematics test and its validity in predicting dental school success found no significant difference between male and female performance and no significant difference in the predictive validity of items favoring males or females. (Author/MSE)
Descriptors: College Entrance Examinations, Dental Schools, Higher Education, Logical Thinking
Peer reviewed Peer reviewed
Gohmann, Stephan F.; Spector, Lee C. – Journal of Economic Education, 1989
Compares the effect of content ordering and scrambled ordering on examinations in courses, such as economics, that require quantitative skills. Empirical results suggest that students do no better if they are given a content-ordered rather than a scrambled examination as student performance is not adversely affected by scrambled ordered…
Descriptors: Cheating, Economics Education, Educational Research, Grading
Peer reviewed Peer reviewed
Way, Walter D.; And Others – Applied Measurement in Education, 1989
The effects of using item response theory (IRT) ability estimates based on customized tests formed by selecting areas from a nationally standardized achievement test were examined. For some populations, in some conditions, IRT ability estimates can be equivalent to scores based on full-length tests. (SLD)
Descriptors: Achievement Tests, Adaptive Testing, Content Validity, Elementary Education
Peer reviewed Peer reviewed
McKinley, Robert L. – Journal of Educational Measurement, 1988
Six procedures for combining sets of item response theory (IRT) item parameter estimates from different samples were evaluated using real and simulated response data. Results support use of covariance matrix-weighted averaging and a procedure using sample-size-weighted averaging of estimated item characteristic curves at the center of the ability…
Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Estimation (Mathematics)
Peer reviewed Peer reviewed
Reynolds, Trudy; And Others – Language Testing, 1994
Presents a study conducted to provide a comparative analysis of five item analysis indices using both IRT and non-IRT indices to describe the characteristics of flagged items and to investigate the appropriateness of logistic regression as an item analysis technique for further studies. The performance of five item analysis indices was examined.…
Descriptors: College Students, Comparative Analysis, English (Second Language), Item Analysis
Peer reviewed Peer reviewed
Bordage, Georges; And Others – Academic Medicine, 1995
Three related Canadian studies assessed the content validity of 59 clinical problems designed as part of a test of medical decision-making skills. Focus was on the key features, i.e., the critical or essential steps in identification and management of the clinical problem. Results support content validity of the key features. (MSE)
Descriptors: Clinical Teaching (Health Professions), Content Validity, Decision Making, Foreign Countries
Peer reviewed Peer reviewed
Korashy, Abdel-Fattah El- – Educational and Psychological Measurement, 1995
The Rasch model was applied to selection of items for an Arabic version of the Otis-Lennon Mental Ability Test using a sample of 599 male and female Kuwaiti secondary school and university students. Results indicated that the test is suitable for the range of ability intended to be measured. (SLD)
Descriptors: Arabic, Cognitive Ability, College Students, Foreign Countries
Peer reviewed Peer reviewed
Black, Paul – Studies in Educational Evaluation, 1995
The role of assessment in science education is explored, focusing on summative assessment in British public certificate examinations. Examples of test items are presented to illustrate difficulties in making valid and reliable assessments, and issues with implications for formative assessment are discussed. (SLD)
Descriptors: Educational Assessment, Feedback, Foreign Countries, Formative Evaluation
Pages: 1  |  ...  |  518  |  519  |  520  |  521  |  522  |  523  |  524  |  525  |  526  |  ...  |  636