Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Zhu, Mengxiao; Shu, Zhan; von Davier, Alina A. – Journal of Educational Measurement, 2016
New technology enables interactive and adaptive scenario-based tasks (SBTs) to be adopted in educational measurement. At the same time, it is a challenging problem to build appropriate psychometric models to analyze data collected from these tasks, due to the complexity of the data. This study focuses on process data collected from SBTs. We…
Descriptors: Measurement, Data Collection, National Competency Tests, Scoring Rubrics
Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem – Educational Sciences: Theory and Practice, 2016
The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…
Descriptors: Test Bias, Test Items, Difficulty Level, Test Theory
Gliniecka, Martyna – EURASIA Journal of Mathematics, Science & Technology Education, 2016
Process of communication can be challenging. At first participants must standardize their concepts of things to hold them close enough to others' concepts, then it's crucial to use appropriate expressions to verbalize those concepts to ensure the mutual understanding. Therefore, it can be problematic when cognitive constructs are hard to…
Descriptors: Creativity, Creative Thinking, Questionnaires, Questioning Techniques
Andrich, David; Marais, Ida; Humphry, Stephen Mark – Educational and Psychological Measurement, 2016
Recent research has shown how the statistical bias in Rasch model difficulty estimates induced by guessing in multiple-choice items can be eliminated. Using vertical scaling of a high-profile national reading test, it is shown that the dominant effect of removing such bias is a nonlinear change in the unit of scale across the continuum. The…
Descriptors: Guessing (Tests), Statistical Bias, Item Response Theory, Multiple Choice Tests
Guskey, Thomas R. – Journal of Staff Development, 2016
Effective professional learning evaluation requires consideration of five critical stages or levels of information. These five levels, which are presented in this article, represent an adaptation of an evaluation model developed by Kirkpatrick (1959, 1998) for judging the value of supervisory training programs in business and industry.…
Descriptors: Hierarchical Linear Modeling, Outcomes of Education, Supervisory Training, Faculty Development
Smith, Mike U.; Snyder, Scott W.; Devereaux, Randolph S. – Journal of Research in Science Teaching, 2016
The present study reports the development of a brief, quantitative, web-based, psychometrically sound measure--the Generalized Acceptance of EvolutioN Evaluation (GAENE, pronounced "gene") in a format that is useful in large and small groups, in research, and in classroom settings. The measure was designed to measure only evolution…
Descriptors: Test Construction, Evolution, Student Attitudes, Test Items
Dag, Funda – Educational Sciences: Theory and Practice, 2016
The purpose of this study is to determine the language equivalence and the validity and reliability of the Turkish version of the "Web-Based Learning Platform Evaluation Scale" ("Web Tabanli Ögrenme Ortami Degerlendirme Ölçegi" [WTÖODÖ]) used in the selection and evaluation of web-based learning environments. Within this scope,…
Descriptors: Foreign Countries, Web Based Instruction, Electronic Learning, Internet
Dennis, Alan R.; Abaci, Serdar; Morrone, Anastasia S.; Plaskoff, Joshua; McNamara, Kelly O. – Journal of Computing in Higher Education, 2016
With additional features and increasing cost advantages, e-textbooks are becoming a viable alternative to paper textbooks. One important feature offered by enhanced e-textbooks (e-textbooks with interactive functionality) is the ability for instructors to annotate passages with additional insights. This paper describes a pilot study that examines…
Descriptors: Textbooks, Electronic Publishing, Pilot Projects, Multiple Choice Tests
Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016
Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…
Descriptors: Test Items, Test Construction, Psychometrics, Models
Bridgeman, Brent – Educational Measurement: Issues and Practice, 2016
Scores on essay-based assessments that are part of standardized admissions tests are typically given relatively little weight in admissions decisions compared to the weight given to scores from multiple-choice assessments. Evidence is presented to suggest that more weight should be given to these assessments. The reliability of the writing scores…
Descriptors: Multiple Choice Tests, Scores, Standardized Tests, Comparative Analysis
Couchman, Justin J.; Miller, Noelle E.; Zmuda, Shaun J.; Feather, Kathryn; Schwartzmeyer, Tina – Metacognition and Learning, 2016
Students often gauge their performance before and after an exam, usually in the form of rough grade estimates or general feelings. Are these estimates accurate? Should they form the basis for decisions about study time, test-taking strategies, revisions, subject mastery, or even general competence? In two studies, undergraduates took a real…
Descriptors: Higher Education, College Students, Tests, Metacognition
Rutkowski, Leslie; Rutkowski, David; Zhou, Yan – International Journal of Testing, 2016
Using an empirically-based simulation study, we show that typically used methods of choosing an item calibration sample have significant impacts on achievement bias and system rankings. We examine whether recent PISA accommodations, especially for lower performing participants, can mitigate some of this bias. Our findings indicate that standard…
Descriptors: Simulation, International Programs, Adolescents, Student Evaluation
Vadivu, P. Pandia; Sridhar, R.; Kumar, B. Mohan – Online Submission, 2016
The main objective of the present pilot study was to construct and standardize a Scientific Aptitude Test (SAT) for the secondary school science students (Grade 9 and 10). The test items (30 questions) were prepared by the researchers based on the five important aspects in scientific thinking such as Analogy, Scientific reasoning, Numerical…
Descriptors: Secondary School Students, Pilot Projects, Science Tests, Grade 9
O'Keeffe, Lisa – Mathematics Education Research Group of Australasia, 2016
Language is frequently discussed as barrier to mathematics word problems. Hence this paper presents the initial findings of a linguistic analysis of numeracy skills test sample items. The theoretical perspective of multi-modal text analysis underpinned this study, in which data was extracted from the ten sample numeracy test items released by the…
Descriptors: Numeracy, Mathematics Skills, Test Items, Preservice Teachers
De Backer, Fauve; Baele, Judith; van Avermaet, Piet; Slembrouck, Stef – Language Assessment Quarterly, 2019
One of the greatest assessment challenges is providing fair and valid tests for multilingual pupils. Since language proficiency impacts their results on content tests, accommodations are suggested to help in solving the validity issues. In this mixed methods study on multilingual assessment, pupils were divided according to three testing…
Descriptors: Multilingualism, Testing Accommodations, Student Attitudes, Language Proficiency

Peer reviewed
Direct link
