NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 646 to 660 of 1,161 results Save | Export
Peer reviewed Peer reviewed
Stone, Clement A. – Educational Measurement: Issues and Practice, 1992
TESTAT is a supplementary module for the popular SYSTAT statistical package for the personal computer. The program performs test analyses based on classical test theory and item response theory. Limitations and advantages are discussed. (SLD)
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Error of Measurement, Item Response Theory
Peer reviewed Peer reviewed
Armstrong, Ronald D.; Jones, Douglas H. – Applied Psychological Measurement, 1992
Polynomial algorithms are presented that are used to solve selected problems in test theory, and computational results from sample problems with several hundred decision variables are provided that demonstrate the benefits of these algorithms. The algorithms are based on optimization theory in networks (graphs). (SLD)
Descriptors: Algorithms, Decision Making, Equations (Mathematics), Mathematical Models
Peer reviewed Peer reviewed
Davidson, Fred – System, 2000
Statistical analysis tools in language testing are described, chiefly classical test theory and item response theory. Computer software for statistical analysis is briefly reviewed and divided into three tiers: commonly available; statistical packages; and specialty software. (Author/VWL)
Descriptors: Computer Software, Language Tests, Second Language Learning, Statistical Analysis
Peer reviewed Peer reviewed
Brown, Roger – International Journal of Computer Algebra in Mathematics Education, 2001
Reviews what is happening in examination systems that have begun to allow the use of Computer Algebra Systems (CAS) in externally set 'high stakes' assessment regimes. Discusses possible options for the future with the intention of developing a dialogue on how assessment with a CAS can help develop the mathematical literacy of students. (Author/MM)
Descriptors: Calculators, Computer Uses in Education, Evaluation, High Stakes Tests
Sireci, Stephen G. – 1995
The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…
Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis
Anderson, Margaret D. – 1996
An effective test and measurement course in psychology should expose students to a variety of available psychological tests, as well as to the mechanics of test construction and evaluation. In a test and measurement course at the State University of New York's College at Cortland, the course is divided into two components with an overlaying group…
Descriptors: Cooperative Learning, Group Activities, Higher Education, Psychological Testing
Linacre, John M.; Wright, Benjamin D. – 1987
The Mantel-Haenszel (MH) procedure attempts to identify and quantify differential item performance (item bias). This paper summarizes the MH statistics, and identifies the parameters they estimate. An equivalent procedure based on the Rasch model is described. The theoretical properties of the two approaches are compared and shown to require the…
Descriptors: Algorithms, Estimation (Mathematics), Item Analysis, Measurement Techniques
Norris, Stephen P. – 1989
This report describes a methodology for using verbal reports of thinking to develop and validate multiple-choice tests of critical thinking. These verbal reports of individuals' thinking on draft items of multiple-choice critical thinking tests can be used systematically to provide evidence of the thinking processes elicited by such tests, and in…
Descriptors: Critical Thinking, Educational Research, Multiple Choice Tests, Protocol Analysis
Reuman, David A.; And Others – 1982
According to classical test theory, the presence of random measurement error in a psychological test has important implications for validation studies. The more comprehensive application of classical test theory in construct validation is distinguished from that in criterion-oriented validation. Critics of thematic apperceptive measurement of the…
Descriptors: Academic Achievement, Achievement Need, Adults, Error of Measurement
Oxford-Carpenter, Rebecca L.; Schultz-Shiner, Linda J. – 1985
This paper addresses practical Army problems in reading assessment from a theory base reflecting the most recent research on reading comprehension. Military and occupational research shows that reading proficiency is related to job performance. Reading assessment is a key issue in the Army due to changes in the reading ability levels of the Army…
Descriptors: Armed Forces, Military Personnel, Postsecondary Education, Psychometrics
Hutchinson, T. P. – 1985
For over 50 years, the overwhelming weight of evidence has been that subjects are able to make use of partial information when responding to multiple-choice items. The subject chooses the alternative which has given rise to the lowest mismatch, except that if this minimum mismatch is larger than some threshold, the question is left unanswered.…
Descriptors: Guessing (Tests), Multiple Choice Tests, Predictive Measurement, Science Tests
Fremer, John J. – 1985
The author proposes a greater professional association role in establishing standards for quality assurance in testing. He presents his views as a test developer who dislikes the legal model for resolving professional issues. The use of publications and informational activities to make people aware of the professional standards and how they can be…
Descriptors: Professional Associations, Professional Continuing Education, Quality Control, Standards
Holmes, Susan E. – 1982
The purpose of the present study was to examine the accuracy of indirect trait estimates, i.e., estimates of some primary trait obtained from a second measure which have been equated to the first. The California Achievement Test in Reading was the primary measure and the Prescriptive Reading Inventory was the indirect measure. Four kinds of…
Descriptors: Content Analysis, Elementary Education, Equated Scores, Item Analysis
Shaycoft, Marion F. – 1979
Focusing on the use of "paper and pencil" criterion-referenced tests in educational measurement, and to correct misconceptions, the definitions of basic terms and historical antecedents are discussed. Classifications of the tests are compared with other achievement tests. The phases in developing criterion-referenced tests are presented with the…
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Testing, Evaluation Methods
Yen, Wendy M. – 1982
Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…
Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods
Pages: 1  |  ...  |  40  |  41  |  42  |  43  |  44  |  45  |  46  |  47  |  48  |  ...  |  78