Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Taylor, Ronald D. – 1993
A review of the recent literature has yielded a number of concerns about the validity, reliability, cost, efficiency, generalizability, utility, and cultural sensitivity of performance based assessments. The resulting conclusion was that continuing the performance based assessment initiative should be rethought. Suggested alternatives included…
Descriptors: Accountability, Cost Effectiveness, Cultural Awareness, Educational Assessment
Can Portfolios Assess Student Performance and Influence Instruction? The 1991-92 Vermont Experience.
Koretz, Daniel; And Others – 1992
The results of an evaluation of Vermont's statewide assessment initiative are presented, with information about the implementation of the program, its effects on educational practice, the analytic challenge presented by the portfolio scoring process, the reliability and validity of portfolio scores, and the tensions between assessment and…
Descriptors: Educational Assessment, Educational Change, Educational Practices, Elementary Education
Lehigh County Community Coll., Schnecksville, PA. – 1993
An experimental study compared the effectiveness of a traditional General Educational Development (GED) curriculum with a literacy curriculum based on applied literacy skills. An experimental group of 34 adult students received GED instruction emphasizing functional and workplace contexts and supplemental instruction, whereas the 35 students in…
Descriptors: Adult Basic Education, Comparative Analysis, Course Descriptions, Followup Studies
Croft, Cedric – 1993
Standards-based assessment, or at least the concept of standards-based assessment, will provide a key strategy for the implementation of the New Zealand National Qualifications Framework. This paper considers the meaning of standards-based assessment and its role in New Zealand's assessment for nationally recognized qualifications. Standards-based…
Descriptors: Academic Achievement, Competence, Criterion Referenced Tests, Educational Assessment
Hacker, Jacob; Hathaway, Walter – 1991
Testing and assessment that are "more authentic" (performance-based or alternative) represent the most pressing issue in education today. Some of the major criticisms leveled at standardized testing are examined, and the advantages and disadvantages of more authentic assessment are reviewed. A general direction for integrating traditional and…
Descriptors: Comparative Analysis, Cost Effectiveness, Educational Assessment, Educational Trends
Dolmans, Diana H. J. M.; And Others – 1991
In problem-based learning, an instrument is needed to measure students' actual learning activities. Mapping the domain of learning activities undertaken by students and improving problems is especially important in a problem-based curriculum because students choose their own learning objectives. Such an instrument, the Topic Evaluation…
Descriptors: Course Content, Course Evaluation, Curriculum Development, Curriculum Evaluation
Burke, J. Bruce; VanSusteren, Timothy J. – 1984
A study, conducted in a basic educational psychology course which serves as a gateway course to a teacher education program at Michigan State University, focused on the development of new knowledge while it taught about the educational research process and the subject being studied. A new format for objective testing, alternate-choice questions,…
Descriptors: Classroom Research, Education Courses, Education Majors, Educational Psychology
Denton, Cliff; Postlethwaite, Keith – 1982
In the second year of a project investigating the ability of secondary school teachers to identify high-ability students, two questions were addressed: what student characteristics influenced teachers' judgments, and why checklists appeared to have little impact on teachers' judgments. A structured approach was developed to study student…
Descriptors: Ability Grouping, Ability Identification, Academically Gifted, Check Lists
Lockhart, Madelyn M.; And Others – 1988
The report presents a summary of selected quantitative measures for August 1987 through July 1988 of students who applied for University of Florida graduate school admission, those who were accepted by academic departments, and those who subsequently enrolled. A brief introduction and listing of procedural assumptions precede the two figures and…
Descriptors: College Admission, College Applicants, College Entrance Examinations, Foreign Students
Des Marchais, Jacques E.; And Others – 1989
The design and evaluation of a rating scale for program evaluation in problem-based medical curricula are described. The design process was guided by a theory of how students learn in problem-based learning as modeled by W. H. Gijselaers and H. G. Schmidt (1988). The rating scale was used in an evaluation of a new problem-based, community-oriented…
Descriptors: Construct Validity, Curriculum Evaluation, Factor Analysis, Foreign Countries
Garrido, Mariquita; Payne, David A. – 1987
Minimum competency cut-off scores on a statistics exam were estimated under four conditions: the Angoff judging method with item data (n=20), and without data available (n=19); and the Modified Angoff method with (n=19), and without (n=19) item data available to judges. The Angoff method required free response percentage estimates (0-100) percent,…
Descriptors: Academic Standards, Comparative Analysis, Criterion Referenced Tests, Cutting Scores
Ward, William C.; And Others – 1986
The keylist format (rather than the conventional multiple-choice format) for item presentation provides a machine-scorable surrogate for a truly free-response test. In this format, the examinee is required to think of an answer, look it up in a long ordered list, and enter its number on an answer sheet. The introduction of keylist items into…
Descriptors: Analogy, Aptitude Tests, Construct Validity, Correlation
Stanley, William B.; And Others – 1985
This study investigated a number of questions regarding the nature of social concept development in young children. Subjects were 64 kindergarten children and 65 first grade public school students from lower to upper middle class socioeconomic levels, of whom 66 were male, 63 were female, 78 were Caucasian, and 51 were black. Two assessment…
Descriptors: Age Differences, Blacks, Concept Formation, Difficulty Level
Knoop, Robert; Common, Ronald W. – 1985
The Performance Review, Analysis, and Improvement System for Educators (PRAISE) is a formative evaluation instrument designed to improve the performance of school principals. The system appears to be reliable and valid and is flexible enough to accommodate the needs of a variety of schools. Sample items and categories of the instrument include…
Descriptors: Administrator Evaluation, Computer Oriented Programs, Data Interpretation, Elementary Secondary Education
Legg, Sue M.; Algina, James – 1986
This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…
Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores


