Publication Date
In 2025 | 8 |
Since 2024 | 32 |
Since 2021 (last 5 years) | 122 |
Since 2016 (last 10 years) | 277 |
Since 2006 (last 20 years) | 469 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 395 |
Teachers | 190 |
Administrators | 102 |
Researchers | 99 |
Policymakers | 57 |
Students | 48 |
Parents | 43 |
Counselors | 19 |
Community | 14 |
Support Staff | 3 |
Location
Canada | 83 |
Australia | 65 |
United States | 46 |
California | 35 |
United Kingdom (England) | 29 |
New York | 28 |
Texas | 27 |
Netherlands | 26 |
United Kingdom | 26 |
Kentucky | 23 |
Ohio | 22 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Sternberg, Robert J. – Educational Researcher, 1998
Links the literatures on human abilities and expertise, suggesting that human abilities are a form of developing expertise. Discusses the role of tests in a scheme that regards abilities as developing expertise and presents a model that implies a shift toward practice grounded in the development of knowledge-based expertise in all children.…
Descriptors: Ability, Children, Educational Assessment, Elementary Secondary Education

Phelps, Richard P. – Educational Measurement: Issues and Practice, 2000
Compiled information from 31 countries to study trends in large-scale testing. Shows a clear trend toward adding, not dropping, testing programs. Twenty-seven countries show a net increase in testing, while only three show a decrease. Fifty-nine testing programs have been added; only four have been dropped. (SLD)
Descriptors: Educational Trends, Foreign Countries, International Education, International Studies

MacKay, Gilbert; Lundie, Jennifer – International Journal of Disability, Development and Education, 1998
Recognizes the attraction of Goal Attainment Scaling (GAS), a technique that uses a scale to measure client's achievement, but suggests that there are concerns about the calculation of its standard scores. Examples show how GAS may be used in service development, whether or not numerical values are attached. (Author/CR)
Descriptors: Achievement Gains, Achievement Rating, Adults, Children

Graham, John R.; Ben-Porath, Yossef S.; McNulty, John L. – Psychological Assessment, 1997
The meaning of low scores on some Minnesota Multiphasic Personality Inventory-2 (MMPI-2) scales was examined by comparing therapists' descriptors of 669 mental health patients with high, normal, or low scores on each scale. Results show that for most scales both high and low scores provide potentially important information. (SLD)
Descriptors: Correlation, Mental Disorders, Patients, Personality Assessment

Posavac, E. J. – Evaluation and Program Planning, 1998
Misuses of null hypothesis significance testing are reviewed and alternative approaches are suggested for carrying out and reporting statistical tests that might be useful to program evaluators. Several themes, including the importance of respecting the magnitude of Type II errors and describing effect sizes in units stakeholders can understand,…
Descriptors: Effect Size, Evaluation Methods, Hypothesis Testing, Program Evaluation

Shavelson, Richard J.; Solano-Flores, Guillermo; Ruiz-Primo, Maria Araceli – Evaluation and Program Planning, 1998
Research on developing technology for large-scale performance assessments in science is reported briefly, and a conceptual framework is presented for defining, generating, and evaluating science performance assessments. Types of tasks are discussed, and the technical qualities of performance assessments are discussed in the context of…
Descriptors: Educational Technology, Generalizability Theory, Models, Performance Based Assessment

Bartley, Anthony W. – Evaluation and Program Planning, 1998
Outlines each of the papers presented in this special section, describes difficulties the arguments posed, and raises questions that might be put to the author of each of these discussions of new assessment methods in mathematics. Implications for the technology for the development of performance assessments are discussed. (SLD)
Descriptors: Educational Technology, Mathematics Tests, Science Education, Science Tests

Prieto, Luis; Roset, Montse; Badia, Xavier – Journal of Applied Measurement, 2001
Tested the metric properties of a Spanish version of the Assessment of Growth Hormone Deficiency in Adults (AGHDA) questionnaire through Rasch analysis with a sample of 356 adult patients in Spain. Results suggest that the Spanish AGHDA could be a useful complement of the clinical evaluation of growth hormone deficiency patients at group and…
Descriptors: Adults, Evaluation Methods, Foreign Countries, Individual Development
Olsen, Laurie – Leadership, 2001
Good reforms can have harmful results if equity effects are ignored. As California implements its accountability system, certain questions must be addressed concerning the system's data use, measurement features (consistency, meaningfulness, achievement growth, achievement gaps among groups), instructional improvement focus, incentives for…
Descriptors: Academic Achievement, Accountability, Data Collection, Educational Change

Cashel, Mary Louise; And Others – Assessment, 1995
The use of scales on the Personality Assessment Inventory (PAI) to detect defensiveness in criminal and nonclinical samples was evaluated with 45 male inmates and 38 male undergraduates under standard conditions or under instructions to feign a positive role. Results indicate that the PAI is susceptible to defensive dissimulation. (SLD)
Descriptors: Criminals, Higher Education, Identification, Multivariate Analysis

Reise, Steven P.; Flannery, Wm. Peter – Applied Measurement in Education, 1996
Statistical and theoretical issues that arise from assessing person-fit on measures of typical performance are discussed, including the frequent attenuation of detection of person-misfit, the need for methods of identifying sources of response aberrancy, and person-fit measures as moderators of trait-criterion relations. (SLD)
Descriptors: Item Response Theory, Measurement Techniques, Performance, Responses

Raven, John – Cognitive Psychology, 2000
Summarizes data related to the stability and variation in the norms for the Raven's Progressive Matrices Test (J. Raven, 1936), a measure of basic cognitive functioning, for different cultural, ethnic, and socioeconomic groups worldwide and within countries. Also considers variation over time and suggests an explanation for the variation in norms…
Descriptors: Change, Cognitive Tests, Ethnicity, Foreign Countries

Haertel, Edward H. – Educational Measurement: Issues and Practice, 1999
Discusses issues of validity in high-stakes testing, beginning with some purposes of a testing program and proceeding to some underlying assumptions about testing. Suggests four possible studies to address assumptions often ignored by asking various groups of people about testing. (SLD)
Descriptors: Elementary Secondary Education, High Stakes Tests, Research Needs, Surveys

Popham, W. James – Educational Measurement: Issues and Practice, 1999
Discusses the direction large-scale educational testing is heading, pointing out pitfalls in current and future use of such tests. The large-scale assessment community seems to be unconcerned about the central mission of education, the instruction of children. (SLD)
Descriptors: Educational Testing, Futures (of Society), Role of Education, Standardized Tests
Gose, Ben; Selingo, Jeffrey – Chronicle of Higher Education, 2001
Explores how social, legal, and demographic forces threaten to dethrone the most widely used college entrance exam. New criticism focuses on the use of what is essentially an IQ test to measure students' ability to learn. (EV)
Descriptors: College Admission, College Entrance Examinations, High Stakes Tests, Higher Education