Publication Date
| In 2026 | 3 |
| Since 2025 | 691 |
| Since 2022 (last 5 years) | 4073 |
| Since 2017 (last 10 years) | 11864 |
| Since 2007 (last 20 years) | 29261 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Policymakers | 492 |
| Practitioners | 488 |
| Researchers | 349 |
| Teachers | 336 |
| Administrators | 189 |
| Parents | 68 |
| Community | 67 |
| Students | 45 |
| Counselors | 33 |
| Media Staff | 7 |
| Support Staff | 4 |
| More ▼ | |
Location
| Turkey | 1166 |
| Texas | 791 |
| California | 740 |
| Florida | 603 |
| United States | 572 |
| Canada | 516 |
| Australia | 504 |
| China | 490 |
| North Carolina | 441 |
| New York | 384 |
| United Kingdom | 381 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 65 |
| Meets WWC Standards with or without Reservations | 112 |
| Does not meet standards | 116 |
Pannell, Summer; White, Lisa; McBrayer, Juliann Sergi – School Leadership Review, 2018
The impact school leaders have on student achievement is prominent in the national conversation regarding educational reform. Perhaps, one of the most highly debated topics is how to assess their impact, and recent legislation tasked every state with determining how to evaluate principal effectiveness. Any new or customized evaluation tool…
Descriptors: Principals, Self Efficacy, Administrator Evaluation, Feedback (Response)
Rispoli, Matthew; Hadley, Pamela A. – Journal of Speech, Language, and Hearing Research, 2018
Purpose: The purpose of this letter is to clarify the psycholinguistic underpinnings of the tense marker total and tense agreement productivity score and to extend the discussion of when composite diversity and productivity measures are best used. Conclusion: We encourage the use of composite diversity and productivity measures when assessing…
Descriptors: Psycholinguistics, Morphemes, Accuracy, Grammar
Looney, Marilyn A. – Measurement in Physical Education and Exercise Science, 2018
The purpose of this article was two-fold (1) provide an overview of the commonly reported and under-reported absolute agreement indices in the kinesiology literature for continuous data; and (2) present examples of these indices for hypothetical data along with recommendations for future use. It is recommended that three types of information be…
Descriptors: Interrater Reliability, Evaluation Methods, Kinetics, Indexes
Dorans, Neil J. – ETS Research Report Series, 2018
A distinction is made between scores as measures of a construct and predictions of a criterion or outcome variable. The interpretation attached to predictions of criteria, such as job performance or college grade point average (GPA), differs from that attached to scores that are measures of a construct, such as reading proficiency or knowledge…
Descriptors: Job Performance, Scores, Data Interpretation, Statistical Distributions
Akin Arikan, Çigdem; Gelbal, Selahattin – International Journal of Assessment Tools in Education, 2018
In this study, the equated score results of the kernel equating (KE) method compared with the results of traditional equating methods--equipercentile and linear equating and 9th grade 2009 ÖBBS Form B of Social Sciences and 2009 ÖBBS Form D of Social Sciences was used under an equivalent groups (EG) design. Study sample consists of 16.249 students…
Descriptors: Equated Scores, Methods, Foreign Countries, National Competency Tests
Wyse, Adam E. – Measurement: Interdisciplinary Research and Perspectives, 2018
A key part of determining cut-scores when performing Angoff standard setting is utilizing equating methods to place standard-setting ratings onto the scale used to report scores to examinees. This article describes three equating methods that can be employed to place Angoff ratings onto the scale used to report scores to examinees when applying…
Descriptors: Standard Setting (Scoring), Equated Scores, Probability, Regression (Statistics)
Gotch, Chad M.; Roduta Roberts, Mary – Educational Measurement: Issues and Practice, 2018
As the primary interface between test developers and multiple educational stakeholders, score reports are a critical component to the success (or failure) of any assessment program. The purpose of this review is to document recent research on individual-level score reporting to advance the research and practice of score reporting. We conducted a…
Descriptors: Scores, Models, Correlation, Stakeholders
Center on Standards and Assessments Implementation, 2018
Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…
Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias
Bernard, Trevor Marshall – ProQuest LLC, 2018
The purpose of this study was to identify perceptions of environmental changes that promote self-directed learning in the workplace by Human Resources Development (HRD) practitioners and to investigate possible differences of the dependent LPA score variables to independent variables of highest level of education achieved, race/ethnicity, age,…
Descriptors: Independent Study, Workplace Learning, Labor Force Development, Correlation
Mantecon, Jesus Gerardo Alvarado; Ghavidel, Hadi Abdi; Zouaq, Amal; Jovanovic, Jelena; McDonald, Jenny – International Educational Data Mining Society, 2018
The automatic evaluation of text-based assessment items, such as short answers or essays, is an open and important research challenge. In this paper, we compare several features for the classification of short open-ended responses to questions related to a large first-year health sciences course. These features include a) traditional n-gram…
Descriptors: Questioning Techniques, Comparative Analysis, Models, Semantics
Li, Tingxuan; Clase, Kari L.; Li, Weiling; Traynor, Anne – Journal of Baltic Science Education, 2019
This research is motivated by the perspective that when empirical studies and assessment frameworks inform each other, assessments can enrich science education and strengthen its connections to modern science. The research proposes a bioenergy competency assessment for science education. It uses an argument-based approach to validation. Multiple…
Descriptors: Energy, Achievement Tests, Science Tests, Test Validity
Beaujean, A. Alexander; Benson, Nicholas F. – Applied Measurement in Education, 2019
Charles Spearman and L. L. Thurstone were pioneers in the field of intelligence. They not only developed methods to assess and understand intelligence, but also developed theories about its structure and function. Methodologically, their approaches were not that distinct, but their theories of intelligence were philosophically very different --…
Descriptors: Psychologists, Intelligence Tests, Scores, Theories
Sheppard, Beth – ORTESOL Journal, 2019
In this research note, the author checks for correlations between different dimensions in an analytic rubric used for scoring discussion performance. Highly correlated dimensions can be cause for concern that the different aspects of performance are not well defined or not adequately observed. The author's analysis showed some weak to moderate…
Descriptors: Scoring Rubrics, Correlation, Bias, Student Evaluation
Cook, Emily E.; Turner, Sarah – AERA Open, 2019
When students with the capacity to succeed in a 4-year college do not take a college admission test, this represents a potential loss of opportunity for students and colleges alike. However, the costs of testing--both pecuniary and nonpecuniary--may exceed the benefits for students who lack the interest in or qualifications for college attendance.…
Descriptors: College Entrance Examinations, High School Graduates, Aptitude Tests, High Schools
Akin Arikan, Cigdem – Eurasian Journal of Educational Research, 2019
Problem Statement: Equating can be defined as a statistical process that allows modifying the differences between test forms with similar content and difficulty so that the scores obtained from these forms can be used interchangeably. In the literature, there are many equating methods, one of which is Kernel equating. Trends in International…
Descriptors: Equated Scores, Foreign Countries, Achievement Tests, International Assessment

Peer reviewed
Direct link
