Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedShapiro, Steven K.; And Others – Journal of School Psychology, 1995
Examines the performance characteristics of 83 school-identified learning-disabled children on the Differential Ability Scales. Sixty percent showed a significant standard score discrepancy between the General Conceptual Ability and at least one achievement test. Implications regarding the educational diagnostic and intervention processes…
Descriptors: Academic Ability, Achievement Tests, Cognitive Ability, Intelligence
Peer reviewedVance, Hubert; And Others – Psychology in the Schools, 1996
Investigated performance of 166 special education students (6 to 16 years) who had taken the Wechsler Intelligence Scale for Children-Revised (WISC-R) and later the Wechsler Intelligence Scale for Children-III (WISC-III). Results indicated a significant, positive correlation among global scales (p<.001). Findings suggest WISC-R and WISC-III…
Descriptors: Comparative Analysis, Comparative Testing, Elementary Secondary Education, Intelligence Tests
Koul, Ravinder; Clariana, Roy B.; Salehi, Roya – Journal of Educational Computing Research, 2005
This article reports the results of an investigation of the convergent criterion-related validity of two computer-based tools for scoring concept maps and essays as part of the ongoing formative evaluation of these tools. In pairs, participants researched a science topic online and created a concept map of the topic. Later, participants…
Descriptors: Scoring, Essay Tests, Test Validity, Formative Evaluation
Laux, John M.; Young, Jennifer L.; McLaughlin, Laura P.; Perera-Diltz, Dilani – Canadian Journal of Counselling, 2006
The Schwartz Outcome Scale-10 (SOS-10) is an effective measure of change in inpatient and outpatient populations, as well as student counseling centers, chemically dependent populations, and research projects. The utility of the SOS-10 in Canada is limited because there is currently no French version of this instrument. This study reports on a…
Descriptors: Foreign Countries, Translation, Measures (Individuals), Patients
Gipps, Caroline V. – Studies in Higher Education, 2005
This paper reviews the role of ICT-based assessment in the light of the growing use of virtual learning environments in universities. Issues of validity, efficiency, type of response, and scoring are addressed. A major area of research is the automated scoring of text. Claims for automated formative assessment are queried, since the feedback of…
Descriptors: Scoring, Evaluation Methods, Feedback, Formative Evaluation
Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007
This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…
Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory
National Academy of Education, Stanford, CA. – 1993
The National Academy of Education Panel is providing the intellectual leadership and coordination for an independent evaluation of the 1990 and 1992 Trial State Assessments (TSA) of the National Assessment of Educational Progress (NAEP). These instruments provide state-by-state comparisons of educational achievement, which is the first such use of…
Descriptors: Academic Achievement, Comparative Analysis, Educational Assessment, Elementary School Students
ERIC Clearinghouse on Reading and Communication Skills, Urbana, IL. – 1984
This collection of abstracts is part of a continuing series providing information on recent doctoral dissertations. The 19 titles deal with a variety of topics, including the following: (1) the effectiveness of English placement examinations used at five junior colleges in California; (2) the direction and distance of context required by college…
Descriptors: Academic Achievement, Annotated Bibliographies, Doctoral Dissertations, Educational Assessment
Ray, John R.; Bowman, Harry L. – 1988
The licensure of public school personnel in Tennessee is a function of the State Department of Education (SDE). Additional requirements for this certification function were added by the Comprehensive Education Reform Act (Senate Bill Number 1) of the 1984 Tennessee General Assembly. The Act mandated that the SDE use tests to assess the competence…
Descriptors: Beginning Teachers, Communication Skills, Higher Education, Item Banks
van der Flier, Henk; Drenth, Pieter J. D. – 1977
The criterion-oriented problem of test bias and fairness in selection is compared to the construct-oriented problem of comparability of test scores in cross-cultural research. These problems are shown to have important similarities, and their studies may supplement each other. In a formulation of psychometric criteria for the comparability of test…
Descriptors: Admission Criteria, Comparative Testing, Criterion Referenced Tests, Cross Cultural Studies
Weiss, David J., Ed. – 1977
This symposium consists of five papers and presents some recent developments in adaptive testing which have applications to several military testing problems. The overview, by James R. McBride, defines adaptive testing and discusses some of its item selection and scoring strategies. Item response theory, or item characteristic curve theory, is…
Descriptors: Ability, Achievement Tests, Adaptive Testing, Bayesian Statistics
Cummins, Jim – 1980
Intelligence quotient (IQ) scores are widely accepted as measures of academic potential. However, both hereditary and environmental factors also play a role in performance. The limitations of IQ tests require that they be handled differently when administered to students from backgrounds other than the dominant cultural group. In addition,…
Descriptors: Cultural Context, Educational Environment, Educational Practices, Elementary Secondary Education
PDF pending restorationSilverman, Robert J.; Russell, Randall H. – 1977
Before a bilingual program can be set up, students who are potential candidates for such a program must be identified. A study was made to investigate the interrelationships of three commonly used measures of "language dominance": the Language Facility Test (LFT), the Home Bilingual Usage Estimate (HBUE), and the Teacher Judgment Questionnaire…
Descriptors: Achievement Tests, Bilingual Education, Bilingual Students, Bilingual Teachers
Steinberg, Jonathan; Cline, Frederick; Ling, Guangming; Cook, Linda; Tognatta, Namrata – Journal of Applied Testing Technology, 2009
This study examines the appropriateness of a large-scale state standards-based English-Language Arts (ELA) assessment for students who are deaf or hard of hearing by comparing the internal test structures for these students to students without disabilities. The Grade 4 and 8 ELA assessments were analyzed via a series of parcel-level exploratory…
Descriptors: Test Bias, Language Arts, State Standards, Partial Hearing
Kane, Michael T. – 1992
Valid assessment of professional competence has proven to be an elusive goal. Objective tests, direct observation of performance, overall ratings of competence, and simulations have been tried and found wanting in one way or another. Objective test items are criticized as being unrealistic and therefore invalid. Direct observation tends to be very…
Descriptors: Competence, Objective Tests, Observation, Performance Tests

Direct link
