Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Source
Author
Brown, James Dean | 1 |
Bruininks, Robert H. | 1 |
Calmettes, Guillaume | 1 |
Cohen, Allan S., Comp. | 1 |
Craig, Holly K. | 1 |
Drummond, Gordon B. | 1 |
Dunne, Michael P. | 1 |
Estes, Carole | 1 |
Estes, Gary D. | 1 |
Evans, C. | 1 |
Evans, Julia L. | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 6 |
Reports - Descriptive | 5 |
Guides - Non-Classroom | 2 |
Dissertations/Theses -… | 1 |
Information Analyses | 1 |
Reference Materials -… | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 2 |
Elementary Secondary Education | 1 |
Postsecondary Education | 1 |
Audience
Policymakers | 1 |
Practitioners | 1 |
Researchers | 1 |
Location
Australia | 1 |
Colorado (Denver) | 1 |
Europe | 1 |
North Carolina (Charlotte) | 1 |
Pennsylvania (Pittsburgh) | 1 |
Tennessee (Memphis) | 1 |
United Kingdom | 1 |
United States | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Kelvin Terrell Pompey – ProQuest LLC, 2021
Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…
Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015
A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…
Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation
Evans, C.; Kandiko Howson, C.; Forsythe, A. – Higher Education Pedagogies, 2018
Internationally, the political appetite for educational measurement capable of capturing a metric of value for money and effectiveness has momentum. While most would agree with the need to assess costs relevant to quality to help support better governmental policy decisions about public spending, poorly understood measurement comes with unintended…
Descriptors: Higher Education, Achievement Gains, Political Issues, Quality Assurance
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Calmettes, Guillaume; Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012
A jack knife is a pocket knife that is put to many tasks, because it's ready to hand. Often there could be a better tool for the job, such as a screwdriver, a scraper, or a can-opener, but these are not usually pocket items. In statistical terms, the expression implies making do with what's available. Another simile, of an extreme situation, is…
Descriptors: Statistical Analysis, Computation, Population Distribution, Evaluation Methods
Bill & Melinda Gates Foundation, 2012
No one has a bigger stake in teaching effectiveness than students. Nor are there any better experts on how teaching is experienced by its intended beneficiaries. Only recently have many policymakers and practitioners come to recognize that--when asked the right questions, in the right ways--students can be an important source of information on the…
Descriptors: Student Surveys, Student Attitudes, Feedback (Response), Test Validity
Runyan, Desmond K.; Dunne, Michael P.; Zolotor, Adam J. – Child Abuse & Neglect: The International Journal, 2009
The "World Report on Children and Violence", (Pinheiro, 2006) was produced at the request of the UN Secretary General and the UN General Assembly. This report recommended improvement in research on child abuse. ISPCAN representatives took this charge and developed 3 new instruments. We describe this background and introduce three new measures…
Descriptors: Child Abuse, Screening Tests, Child Welfare, Test Construction

Estes, Carole; Estes, Gary D. – 1980
Multiple matrix sampling is a sampling design in which both test items and examinees are randomly sampled from their respective populations. This study was designed to develop and assess a method for computing an estimate of a correlation coefficient when a multiple matrix sampling design is used. The examinee populations included 212 third-grade…
Descriptors: Correlation, Elementary Secondary Education, Evaluation Methods, Grade 3
Gottfredson, Stephen D.; Moriarty, Laura J. – Crime & Delinquency, 2006
Statistically based risk assessment devices are widely used in criminal justice settings. Their promise remains largely unfulfilled, however, because assumptions and premises requisite to their development and application are routinely ignored and/or violated. This article provides a brief review of the most salient of these assumptions and…
Descriptors: Risk, Justice, Criminals, Crime

Evans, Julia L.; Craig, Holly K. – Journal of Speech and Hearing Research, 1992
Analysis of spontaneous language samples of 10 children (ages 8-9) with specific language impairments found that interviews were a reliable, valid, and efficient assessment context, eliciting the same profile of behaviors as a freeplay context without altering diagnostic classifications. (Author/JDD)
Descriptors: Data Collection, Discourse Analysis, Educational Diagnosis, Efficiency
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Womer, Frank B. – 1971
This symposium deals with recent issues in the development of the National Assessment model. General goals are outlined and the following topics are discussed: "Objectives and Exercises" (Jack C. Merwin); "Sampling" (A. Finkner); and "Data Analysis" (John Milholland). (CK) Aspect of National Assessment (NAEP) dealt…
Descriptors: Academic Achievement, Conferences, Data Analysis, Demonstration Programs

Brown, James Dean – Annual Review of Applied Linguistics, 1995
Discusses the evaluation of second and foreign language programs, focusing on whether such evaluations should be summative or formative; use outside experts or program staff; emphasize qualitative or quantitative data; and concentrate on the process or the product. An annotated bibliography discusses six important works in the field. (78…
Descriptors: Annotated Bibliographies, Evaluation Methods, Evaluation Problems, Politics of Education
Bruininks, Robert H.; And Others – 1989
This paper examines issues in designing post-school follow-up studies in special education. The examination focuses on survey research techniques, which are widely used in the investigation of post-school adjustment of former students with handicaps. In special education, survey research studies are used commonly to address many important…
Descriptors: Data Collection, Elementary Secondary Education, Evaluation Methods, Followup Studies
Morris, Lynn Lyons; Fitz-Gibbon, Carol Taylor – 1978
Measuring attainment of the program's objectives and describing the program's implementation are listed as two of the evaluator's major responsibilities. The description should include an explanation of the context in which the program was initiated, as well as the component materials and activities. This booklet has three purposes: (1) suggesting…
Descriptors: Data Analysis, Data Collection, Educational Assessment, Evaluation Methods
Previous Page | Next Page ยป
Pages: 1 | 2