Publication Date
| In 2026 | 0 |
| Since 2025 | 17 |
| Since 2022 (last 5 years) | 74 |
| Since 2017 (last 10 years) | 189 |
| Since 2007 (last 20 years) | 384 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 274 |
| Researchers | 122 |
| Teachers | 102 |
| Administrators | 63 |
| Counselors | 28 |
| Parents | 21 |
| Policymakers | 21 |
| Students | 15 |
| Community | 8 |
Location
| Canada | 45 |
| Australia | 33 |
| California | 33 |
| United Kingdom | 23 |
| United States | 20 |
| Pennsylvania | 18 |
| United Kingdom (England) | 17 |
| New York | 15 |
| Japan | 14 |
| Michigan | 14 |
| New Jersey | 12 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Nolan, Meaghan M.; Beran, Tanya; Hecker, Kent G. – Statistics Education Research Journal, 2012
Students with positive attitudes toward statistics are likely to show strong academic performance in statistics courses. Multiple surveys measuring students' attitudes toward statistics exist; however, a comparison of the validity and reliability of interpretations based on their scores is needed. A systematic review of relevant electronic…
Descriptors: Student Attitudes, Statistics, Attitude Measures, Student Surveys
Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G. – Educational Assessment, 2011
This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…
Descriptors: Computer Assisted Testing, Scoring, Test Interpretation, Equated Scores
Gandy, Sandra E. – Reading & Writing Quarterly, 2013
With the increasing amount of testing taking place in classrooms, teachers may question how appropriate those assessments are for the growing numbers of English language learners (ELLs) in the United States. One of the assessment options for classroom teachers is the informal reading inventory (IRI), which is the most frequently used assessment…
Descriptors: Informal Reading Inventories, English Language Learners, Student Evaluation, Standardized Tests
Buschang, Rebecca E.; Chung, Gregory K. W. K.; Delacruz, Girlie C.; Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2012
The purpose of this study was to validate inferences about scores of one task designed to measure subject matter knowledge and three tasks designed to measure aspects of pedagogical content knowledge. Evidence for the validity of inferences was based on two expectations. First, if tasks were sensitive to expertise, we would find group differences.…
Descriptors: Validity, Measures (Individuals), Test Interpretation, Algebra
Pommerich, Mary – Educational Measurement: Issues and Practice, 2012
Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…
Descriptors: Testing, Scores, Measurement, Test Construction
O'Reilly, Tenaha; Sabatini, John – ETS Research Report Series, 2013
This paper represents the third installment of the Reading for Understanding (RfU) assessment framework. This paper builds upon the two prior installments (Sabatini & O'Reilly, 2013; Sabatini, O'Reilly, & Deane, 2013) by discussing the role of performance moderators in the test design and how scenario-based assessment can be used as a tool…
Descriptors: Reading Comprehension, Reading Tests, Test Construction, Student Characteristics
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Al-Shara'H, Nayel Darweesh – Education, 2013
The study aimed at investigating Jordanian EFL teachers' self-reported frequencies of using the procedures of preparing, correcting, analyzing, interpreting an achievement test, and discussing its results with students. To achieve this, a 31-item questionnaire was used. The questionnaire was administered to 118 basic stage EFL teachers after…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Test Construction
Johnson, Sandra – Routledge, Taylor & Francis Group, 2011
"Assessing Learning in the Primary Classroom" is an accessible introduction to the concepts critical to a professional understanding of this vital aspect of a teacher's role. It comprehensively considers the principles underpinning effective assessment, the different forms it can take and the different purposes it serves, both within and beyond…
Descriptors: Student Evaluation, Elementary Education, Educational Assessment, Validity
Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012
Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…
Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability
Dimoliatis, Ioannis D. K.; Jelastopulu, Eleni – Universal Journal of Educational Research, 2013
The surgical theatre educational environment measures STEEM, OREEM and mini-STEEM for students (student-STEEM) comprise an up to now disregarded systematic overestimation (OE) due to inaccurate percentage calculation. The aim of the present study was to investigate the magnitude of and suggest a correction for this systematic bias. After an…
Descriptors: Educational Environment, Scores, Grade Prediction, Academic Standards
Tuccitto, Daniel E.; Giacobbi, Peter R., Jr.; Leite, Walter L. – Educational and Psychological Measurement, 2010
This study tested five confirmatory factor analytic (CFA) models of the Positive Affect Negative Affect Schedule (PANAS) to provide validity evidence based on its internal structure. A sample of 223 club sport athletes indicated their emotions during the past week. Results revealed that an orthogonal two-factor CFA model, specifying error…
Descriptors: Factor Analysis, Models, Affective Measures, Validity
Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014
There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…
Descriptors: Student Surveys, National Surveys, Art Education, Design
Wolf, Raffaela; Zahner, Doris; Kostoris, Fiorella; Benjamin, Roger – Council for Aid to Education, 2014
The measurement of higher-order competencies within a tertiary education system across countries presents methodological challenges due to differences in educational systems, socio-economic factors, and perceptions as to which constructs should be assessed (Blömeke, Zlatkin-Troitschanskaia, Kuhn, & Fege, 2013). According to Hart Research…
Descriptors: Case Studies, International Assessment, Performance Based Assessment, Critical Thinking
Choi, Ick Kyu – ProQuest LLC, 2013
At the University of California, Los Angeles, the Test of Oral Proficiency (TOP), an internally developed oral proficiency test, is administered to international teaching assistant (ITA) candidates to ensure an appropriate level of academic oral English proficiency. Test taker performances are rated live by two raters according to four subscales.…
Descriptors: Screening Tests, Profiles, Oral Language, English

Peer reviewed
Direct link
