NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 8,941 to 8,955 of 10,090 results Save | Export
North Carolina State Dept. of Public Instruction, Raleigh. Research and Testing Services. – 1988
General testing guidelines for experienced and new testing personnel, teachers, and school administrators are presented. The Testing Code of Ethics is intended to enhance the awareness of school personnel regarding proper testing procedures and to emphasize the unrelenting attention necessary to provide accurate test data for decision making.…
Descriptors: Administrator Responsibility, Codes of Ethics, Confidentiality, Elementary Secondary Education
Baron, Joan Boykoff; And Others – 1989
A description of ecologically valid, authentic, and integrated performance-based assessment instruments of different lengths was undertaken to measure the extent by which students meet Connecticut's Common Core of Learning (CCL) objectives. The description focuses on the students' knowledge and general understanding of science and mathematics;…
Descriptors: Educational Assessment, Educational Improvement, Elementary Secondary Education, Evaluation Criteria
Laurier, Michel – 1990
Computerized adaptive testing for language teaching and learning takes advantage of two properties of the computer: its number-crunching and multiple-branching capabilities. Adaptive testing has also been called tailored testing because it aims at presenting items that suit the student's competence and that are informative, using an item bank and…
Descriptors: Adaptive Testing, Computer Assisted Testing, Foreign Countries, French
Kippel, Gary M.; Shivakumar, K. R. – 1990
Evidence of the validity of the National Teacher Examinations' (NTEs') Early Childhood Education Test (ECET) as it is used in the New York City School System is presented. The NTE Core Battery and Specialty Area Tests are used as part of the alternative process for teacher licensure in New York City. The ECET measures knowledge and skills required…
Descriptors: Beginning Teachers, Early Childhood Education, Licensing Examinations (Professions), Minimum Competency Testing
Smith, Michael W. – 1990
The insights provided by Rasch analysis of results from a literature test were explored. Students in grades 9, 10, and 11 (n=261) responded to a 28-item test before they received one of three treatments: (1) direct instruction, based on research on metacognition in reading, which attempts to give conscious control of strategies used to understand…
Descriptors: Comparative Testing, High School Students, High Schools, Irony
Zwick, Rebecca – 1986
Although perfectly scalable items rarely occur in practice, Guttman's concept of a scale has proved to be valuable to the development of measurement theory. If the score distribution is uniform and there is an equal number of items at each difficulty level, both the elements and the eigenvalues of the Pearson correlation matrix of dichotomous…
Descriptors: Correlation, Difficulty Level, Item Analysis, Latent Trait Theory
Lockwood, Robert E.; And Others – 1986
Standards, passing scores, or cut scores have been seen as an element of criterion-referenced tests since their introduction. This paper discusses at least two issues surrounding the establishment of cut scores which appear to need clarification: (1) the theoretical definition of a cut score; and (2) decisions which must be made in selecting a…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, High Schools
Weber, Jerry; Twing, Jon – 1986
This study examines the relative value of five empirical procedures for determining cutscores in placement of entering college freshmen. Placement test scores were obtained for all students entering in the fall, but tested in the summer of 1983 at a midwestern community college. Students who took at least one placement test in arithmetic, algebra,…
Descriptors: Achievement Tests, Basic Skills, College Freshmen, Comparative Analysis
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Wise, Lauress L. – 1986
A primary goal of this study was to determine the extent to which item difficulty was related to item position and, if a significant relationship was found, to suggest adjustments to predicted item difficulty that reflect differences in item position. Item response data from the Medical College Admission Test (MCAT) were analyzed. A data set was…
Descriptors: College Entrance Examinations, Difficulty Level, Educational Research, Error of Measurement
Stansfield, Charles W.; Ross, Jacqueline – 1988
An overview of the research needed on the new Test of Written English (TWE), a section of the Test of English as a Foreign Language (TOEFL), looks at research needs in the areas of test validity, test reliability, topic development, and equating. Suggested topics for study include: the uniqueness of the construct measured by the test, in…
Descriptors: Construct Validity, English (Second Language), Essays, Language Tests
Nyberg, V. R.; Clarke, S. C. T. – 1983
The School Subjects Attitude Scales were developed to measure student attitudes toward school subjects. It is intended for use in grades 5-12. Reliability estimates were sufficiently high to warrant using the Scales for responses of groups, but not with individuals. Various criteria confirmed the scales' validity. Large groups of public school…
Descriptors: Attitude Measures, Elementary Secondary Education, Foreign Countries, Rating Scales
Ferrara, Steven F. – 1987
The necessity of controlling the order in which trained essay raters for a statewide writing assessment program receive student essays was studied. The underlying theoretical question concerns possible rater bias caused by raters reading long strings of essays of homogeneous quality; this problem is usually referred to as context effect or…
Descriptors: Context Effect, Essay Tests, Evaluators, Graduation Requirements
DeAyala, R. J.; Koch, William R. – 1986
A computerized flexilevel test was implemented and its ability estimates were compared with those of a Bayesian estimation based computerized adaptive test (CAT) as well as with known true ability estimates. Results showed that when the flexilevel test was terminated according to Lord's criterion, its ability estimates were highly and…
Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis
Drasgow, Fritz; And Others – 1987
This paper addresses the information revealed in incorrect option selection on multiple choice items. Multilinear Formula Scoring (MFS), a theory providing methods for solving psychological measurement problems of long standing, is first used to estimate option characteristic curves for the Armed Services Vocational Aptitude Battery Arithmetic…
Descriptors: Aptitude Tests, Item Analysis, Latent Trait Theory, Mathematical Models
Pages: 1  |  ...  |  593  |  594  |  595  |  596  |  597  |  598  |  599  |  600  |  601  |  ...  |  673