Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 45 |
| Since 2017 (last 10 years) | 91 |
| Since 2007 (last 20 years) | 144 |
Descriptor
| Test Format | 418 |
| Test Reliability | 418 |
| Test Validity | 243 |
| Test Construction | 135 |
| Test Items | 119 |
| Higher Education | 88 |
| Multiple Choice Tests | 68 |
| Foreign Countries | 67 |
| Testing | 65 |
| Test Interpretation | 61 |
| Comparative Analysis | 57 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 33 |
| Teachers | 23 |
| Administrators | 18 |
| Researchers | 12 |
| Community | 1 |
| Counselors | 1 |
| Policymakers | 1 |
| Students | 1 |
| Support Staff | 1 |
Location
| New York | 9 |
| Turkey | 8 |
| California | 7 |
| Canada | 6 |
| Japan | 6 |
| Germany | 4 |
| United Kingdom | 4 |
| Georgia | 3 |
| Israel | 3 |
| France | 2 |
| Indonesia | 2 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| Job Training Partnership Act… | 1 |
| No Child Left Behind Act 2001 | 1 |
| Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedNorcini, John J. – Journal of Educational Measurement, 1987
Answer keys for physician and teacher licensing examinations were studied. The impact of variability on total errors of measurement was examined for answer keys constructed using the aggregate method. Results indicated that, in some cases, scorers contributed to a sizable reduction in measurement error. (Author/GDC)
Descriptors: Adults, Answer Keys, Error of Measurement, Evaluators
Peer reviewedHalpern, Honey G. – Reading World, 1984
Demonstrates that the reflection-impulsivity dimension is related to some reading tasks but not others. Finds in particular that not all word recognition tasks are affected by conceptual tempo. (FL)
Descriptors: Conceptual Tempo, Grade 2, Primary Education, Reading Research
Maddox, Taddy, Ed. – 2003
This directory describes tests available for use by psychologists, educators and human resource personnel in businesses. Each of the three main sections is divided into subsections. Psychology contains 21 subsections; Education, 49 subsections; and Business, 20 subsections. The tests within each subsection are listed alphabetically by title. Each…
Descriptors: Business, Elementary Secondary Education, Measures (Individuals), Psychology
Peer reviewedBardo, John W.; Yeager, Samuel J. – Perceptual and Motor Skills, 1982
Responses to various fixed test-response formats were examined for "reliability" due to systematic error; Cronbach's alphas up to .67 were obtained. Of formats tested, four-point Likert Scales were least affected while forms of lines and faces were most problematic. Possible modification in alpha to account for systematic bias is…
Descriptors: Higher Education, Measures (Individuals), Psychometrics, Response Style (Tests)
Peer reviewedAlbanese, Mark A. – Evaluation and the Health Professions, 1982
Findings regarding formats and scoring formulas for multiple-choice test items with more than one correct response are presented. Strong cluing effects in the Type K format, increasing the correct score percentage and reducing test reliability, recommend using the Type X format. Alternative scoring methods are discussed. (Author/CM)
Descriptors: Health Occupations, Multiple Choice Tests, Professional Education, Response Style (Tests)
Peer reviewedMorgan, Anne; Wainer, Howard – Journal of Educational Statistics, 1980
Two estimation procedures for the Rasch Model of test analysis are reviewed in detail, particularly with respect to new developments that make the more statistically rigorous conditional maximum likelihood estimation practical for use with longish tests. (Author/JKS)
Descriptors: Error of Measurement, Latent Trait Theory, Maximum Likelihood Statistics, Psychometrics
Peer reviewedStraton, Ralph G.; Catts, Ralph M. – Educational and Psychological Measurement, 1980
Multiple-choice tests composed entirely of two-, three-, or four-choice items were investigated. Results indicated that number of alternatives per item was inversely related to item difficulty, but directly related to item discrimination. Reliability and standard error of measurement of three-choice item tests was equivalent or superior.…
Descriptors: Difficulty Level, Error of Measurement, Foreign Countries, Higher Education
Peer reviewedGreen, Kathy – Journal of Experimental Education, 1979
Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)
Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format
Peer reviewedFrisbie, David A.; Becker, Douglas F. – Applied Measurement in Education, 1990
Seventeen educational measurement textbooks were reviewed to analyze current perceptions regarding true-false achievement testing. A synthesis of the rules for item writing is presented, and the purported advantages and disadvantages of the true-false format derived from those texts are reviewed. (TJH)
Descriptors: Achievement Tests, Higher Education, Methods Courses, Objective Tests
Peer reviewedFederico, Pat-Anthony – Behavior Research Methods, Instruments, and Computers, 1991
Using a within-subjects design, computer-based and paper-based tests of aircraft silhouette recognition were administered to 83 male naval pilots and flight officers to determine the relative reliabilities and validities of 2 measurement modes. Relative reliabilities and validities of the two modes were contingent on the multivariate measurement…
Descriptors: Aircraft Pilots, Comparative Testing, Computer Assisted Testing, Males
Peer reviewedSaunders, Phillip – Journal of Economic Education, 1991
Discusses the content and cognitive specification of the third edition of the Test of Understanding in College Economics. Presents examples of the construction and sampling criteria employed in the latest and previous versions of the test. Explains that the test emphasizes recognition and understanding of basic terms, concepts, and principles with…
Descriptors: Economics Education, Educational Testing, Higher Education, Student Evaluation
Peer reviewedBrown, James Dean; And Others – Written Communication, 1991
Investigates whether prompts and topic types affect writing performance of college freshmen taking the Manoa Writing Placement Examination (MWPE). Finds that the MWPE is reliable but that responses to prompts and prompt sets differ. Shows that differences arising in performance on prompts or topics can be minimized by examining mean scores and…
Descriptors: Freshman Composition, Higher Education, Test Format, Test Reliability
Peer reviewedStraus, Murray A.; Hamby, Sherry L.; Finkelhor, David; Moore, David W.; Runyan, Desmond – Child Abuse & Neglect: The International Journal, 1998
A study of 1,000 children examined the effectiveness of the Parent-Child Conflict Tactics Scales (CTSPC) in measuring parental psychological and physical maltreatment of children, as well as nonviolent modes of discipline. The CTSPC was found to be better suited to measuring child maltreatment than the original Conflict Tactics Scales. (Author/CR)
Descriptors: Child Abuse, Child Neglect, Discipline, Evaluation Methods
Jacobs, Stanley S. – 1994
This study analyzed and tested the equivalence of Forms A and B of the California Critical Thinking Skills Test (CCTST). In designing the CCTST, Form A was composed of 34 items from a bank of 200. To develop a parallel measure, Form B was developed by rewriting 28 of the 34 items and rearranging their order. Study participants were all entering…
Descriptors: College Freshmen, Comparative Analysis, Critical Thinking, Higher Education
Goldstein, Irwin; And Others – 1979
The purpose of this test is to evaluate a non-native speaking student's speaking knowledge of the basic structures of English, using the most frequently used words in the English Language. The test does not attempt to determine vocabulary level or student's ability to learn vocabulary effectively, rather the test focuses exclusively on aural/oral…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Listening Comprehension


