Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Khoshaim, Heba Bakr; Rashid, Saima – International Journal of Instruction, 2016
Assessment is one of the vital steps in the teaching and learning process. The reported action research examines the effectiveness of an assessment process and inspects the validity of exam questions used for the assessment purpose. The instructors of a college-level mathematics course studied questions used in the final exams during the academic…
Descriptors: Item Analysis, Test Items, Mathematics Tests, Difficulty Level
Benítez, Isabel; Padilla, José-Luis; Hidalgo Montesinos, María Dolores; Sireci, Stephen G. – Applied Measurement in Education, 2016
Analysis of differential item functioning (DIF) is often used to determine if cross-lingual assessments are equivalent across languages. However, evidence on the causes of cross-lingual DIF is still evasive. Expert appraisal is a qualitative method useful for obtaining detailed information about problematic elements in the different linguistic…
Descriptors: Test Bias, Mixed Methods Research, Questionnaires, International Assessment
Bochner, Joseph H.; Samar, Vincent J.; Hauser, Peter C.; Garrison, Wayne M.; Searls, J. Matt; Sanders, Cynthia A. – Language Testing, 2016
American Sign Language (ASL) is one of the most commonly taught languages in North America. Yet, few assessment instruments for ASL proficiency have been developed, none of which have adequately demonstrated validity. We propose that the American Sign Language Discrimination Test (ASL-DT), a recently developed measure of learners' ability to…
Descriptors: American Sign Language, Test Validity, Language Proficiency, Phonological Awareness
Zembat, Rengin; Turasli, Nalan Kuru; Güven, Gülçin; Sezer, Türker; Aksin, Ezgi; Yilmaz, Elif; Bayindir, Dilan – Journal of Education and Training Studies, 2016
The aim of this study is to investigate the reliability and validity of the DeMoulin Self-Concept Developmental Scale for 36-72 month old children. In addition, it has been attempted to examine the effects of age and gender variables on the self-concept of children. The study is in survey method. The sample consists of 810 children who attend…
Descriptors: Test Validity, Test Reliability, Self Concept Measures, Age Differences
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016
In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…
Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics
Andersson, Björn – Journal of Educational Measurement, 2016
In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…
Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests
Seker, Hasan – Universal Journal of Educational Research, 2016
In the present study, some of the pre-service teachers' criticisms against their exams were investigated. Moreover, as an alternative, to what extent philosophical, romantic and mythic questions could be used was also looked at. The study group consists of 117 pre-service teachers from the classroom teacher education. In the study, it was…
Descriptors: Test Items, Test Content, Preservice Teachers, Criticism
Ojerinde, Dibu; Popoola, Omokunmi; Onyeneho, Patrick; Egberongbe, Aminat – Perspectives in Education, 2016
Statistical procedure used in adjusting test score difficulties on test forms is known as "equating". Equating makes it possible for various test forms to be used interchangeably. In terms of where the equating method fits in the assessment cycle, there are pre-equating and post-equating methods. The major benefits of pre-equating, when…
Descriptors: Measurement, Comparative Analysis, High Stakes Tests, Pretests Posttests
Haugen, Heidi; Stevenson, Anne; Meyer, Rebecca L. – Journal of Extension, 2016
This article explores how a one-time training designed to support learning transfer affected 4-H volunteers' comfort levels with the training content and how comfort levels, in turn, affected the volunteers' application of tools and techniques learned during the training. Results of a follow-up survey suggest that the training participants…
Descriptors: Active Learning, Inquiry, Volunteer Training, Participation
Qian, Hong; Staniewska, Dorota; Reckase, Mark; Woo, Ada – Educational Measurement: Issues and Practice, 2016
This article addresses the issue of how to detect item preknowledge using item response time data in two computer-based large-scale licensure examinations. Item preknowledge is indicated by an unexpected short response time and a correct response. Two samples were used for detecting item preknowledge for each examination. The first sample was from…
Descriptors: Reaction Time, Licensing Examinations (Professions), Computer Assisted Testing, Prior Learning
Soysal, Sümeyra; Arikan, Çigdem Akin; Inal, Hatice – Online Submission, 2016
This study aims to investigate the effect of methods to deal with missing data on item difficulty estimations under different test length conditions and sampling sizes. In this line, a data set including 10, 20 and 40 items with 100 and 5000 sampling size was prepared. Deletion process was applied at the rates of 5%, 10% and 20% under conditions…
Descriptors: Research Problems, Data Analysis, Item Response Theory, Test Items
Klotz, Viola Katharina; Winther, Esther; Marx, Christian; Goeze, Annika; Fischer, Christoph; Sangmeister, Julia – AERA Online Paper Repository, 2016
Apprentices' performance after vocational educational training (VET) is commonly attributed to more or less effective training. This implies the assumption that learning is significantly affected by vocational instruction (instructional sensitivity). However, the question has not been investigated yet if VETs are effective, i.e., that they foster…
Descriptors: Vocational Education, Apprenticeships, Performance Based Assessment, Item Analysis
Ma, Yijun; Agnihotri, Lalitha; Baker, Ryan; Mojarad, Shirin – International Educational Data Mining Society, 2016
Time has become a standard feature used in EDM models, and is used in models of meta-cognitive strategies to models of disengagement. Most of these models consider whether a student action is "too fast" or "too slow". However, an open question remains on how we define and select these cut-offs. Moreover, it is not clear that…
Descriptors: Difficulty Level, Reaction Time, Student Reaction, Academic Ability
Morrison, Kristin M.; Schwartz, Robert Andrew – AERA Online Paper Repository, 2016
Certain item features can be used to explain the difference between the difficulties of items. These item features can relate to the steps necessary to solve the problems or ways in which a increased understanding of material is acquired. This study will examine the relationship between item difficulty and the process skills associated with these…
Descriptors: Student Evaluation, Alternative Assessment, Standardized Tests, Test Items
Krstic, Ksenija; Šoškic, Andela; Kovic, Vanja; Holmqvist, Kenneth – European Journal of Psychology of Education, 2018
PISA results show that a considerable number of 15-year-old pupils after 8 to 10 years of schooling have a low level of functional reading literacy, as defined in the PISA framework. While PISA results help identify the level of reading competency, they do not reveal what might be the reasons why some students fail to solve the tasks. One way to…
Descriptors: Eye Movements, Reading Processes, Achievement Tests, Foreign Countries

Peer reviewed
Direct link
