Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Yasuno, Fumiko; Nishimura, Keiichi; Negami, Seiya; Namikawa, Yukihiko – International Journal for Technology in Mathematics Education, 2019
Our study is on developing mathematics items for Computer-Based Testing (CBT) using Tablet PC. These items are subject-based items using interactive dynamic objects. The purpose of this study is to obtain some suggestions for further tasks drawing on field test results for developed items. First, we clarified the role of the interactive dynamic…
Descriptors: Mathematics Instruction, Mathematics Tests, Test Items, Computer Assisted Testing
Nakata, Tatsuya; Suzuki, Yuichi – Studies in Second Language Acquisition, 2019
Although researchers argue that studying semantically related words simultaneously (semantic clustering) inhibits vocabulary acquisition, recent studies have yielded inconsistent results. This study examined the effects of semantic clustering while addressing the limitations of previous studies (e.g., confounding of semantic relatedness with other…
Descriptors: Semantics, Vocabulary Development, Interference (Language), Learning Processes
Allen, David – Language Assessment Quarterly, 2019
Cross-linguistic lexical similarity in the form of cognates and loanwords has been shown to positively impact second language learning and use, as well as performance on tests of lexical knowledge, even when the learners' languages differ in script. The present study utilizes Japanese cognate frequency, as an indication of cognate knowledge, to…
Descriptors: Foreign Countries, Language Tests, Vocabulary Skills, Second Language Learning
Sporre, Karin – Journal of Curriculum Studies, 2019
The purpose of this study is to describe, critically analyse and discuss the Swedish system of assessing ethics education in compulsory school through national tests. The publicly available tests from 2013 for grades six and nine have been studied as have the assessment instructions for teachers. Staff responsible for the test construction have…
Descriptors: Ethical Instruction, Compulsory Education, Foreign Countries, National Competency Tests
Toker, Deniz – TESL-EJ, 2019
The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Fukuzawa, Sherry; deBraga, Michael – Journal of Curriculum and Teaching, 2019
Graded Response Method (GRM) is an alternative to multiple-choice testing where students rank options according to their relevance to the question. GRM requires discrimination and inference between statements and is a cost-effective critical thinking assessment in large courses where open-ended answers are not feasible. This study examined…
Descriptors: Alternative Assessment, Multiple Choice Tests, Test Items, Test Format
Beck, Christina; Nerdel, Claudia – Contributions from Science Education Research, 2019
Dealing with multiple external representations (MERs) in science education is the key to students' understanding of science communication and becoming scientifically literate. It is generally accepted that learning scientific concepts, processes, and principles requires understanding and interacting with MERs. Science can be understood as a…
Descriptors: Biology, Science Instruction, Models, Visual Aids
Lorenceau, Adrien; Marec, Camille; Mostafa, Tarek – OECD Publishing, 2019
This paper explains the rationale for updating the OECD Programme for International Student Assessment (PISA) 2021 questionnaire on Information and Communication Technology (ICT) and shows how it covers policy topics of current relevance. After presenting key findings based on previous ICT-related PISA data, the paper provides a summary of the…
Descriptors: Questionnaires, Information Technology, Achievement Tests, Secondary School Students
Sarah Lindstrom Johnson; Ray E. Reichenberg; Kathan Shukla; Tracy E. Waasdorp; Catherine P. Bradshaw – Grantee Submission, 2019
The United States government has become increasingly focused on school climate, as recently evidenced by its inclusion as an accountability indicator in the "Every Student Succeeds Act". Yet, there remains considerable variability in both conceptualizing and measuring school climate. To better inform the research and practice related to…
Descriptors: Item Response Theory, Educational Environment, Accountability, Educational Legislation
Pishghadam, Reza; Baghaei, Purya; Seyednozadi, Zahra – International Journal of Testing, 2017
This article attempts to present emotioncy as a potential source of test bias to inform the analysis of test item performance. Emotioncy is defined as a hierarchy, ranging from "exvolvement" (auditory, visual, and kinesthetic) to "involvement" (inner and arch), to emphasize the emotions evoked by the senses. This study…
Descriptors: Test Bias, Item Response Theory, Test Items, Psychological Patterns
Loukina, Anastassia; Zechner, Klaus; Yoon, Su-Youn; Zhang, Mo; Tao, Jidong; Wang, Xinhao; Lee, Chong Min; Mulholland, Matthew – ETS Research Report Series, 2017
This report presents an overview of the "SpeechRater"? automated scoring engine model building and evaluation process for several item types with a focus on a low-English-proficiency test-taker population. We discuss each stage of speech scoring, including automatic speech recognition, filtering models for nonscorable responses, and…
Descriptors: Automation, Scoring, Speech Tests, Test Items
Hohensinn, Christine; Baghaei, Purya – Psicologica: International Journal of Methodology and Experimental Psychology, 2017
In large scale multiple-choice (MC) tests alternate forms of a test may be developed to prevent cheating by changing the order of items or by changing the position of the response options. The assumption is that since the content of the test forms are the same the order of items or the positions of the response options do not have any effect on…
Descriptors: Multiple Choice Tests, Test Format, Test Items, Difficulty Level
Wise, Steven L. – Educational Measurement: Issues and Practice, 2017
The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time
Carney, Michele B.; Cavey, Laurie; Hughes, Gwyneth – Elementary School Journal, 2017
This article illustrates an argument-based approach to presenting validity evidence for assessment items intended to measure a complex construct. Our focus is developing a measure of teachers' ability to analyze and respond to students' mathematical thinking for the purpose of program evaluation. Our validity argument consists of claims addressing…
Descriptors: Mathematics Instruction, Mathematical Logic, Thinking Skills, Evidence
Foss, Donald J.; Pirozzolo, Joseph W. – Journal of Educational Psychology, 2017
We carried out 4 semester-long studies of student performance in a college research methods course (total N = 588). Two sections of it were taught each semester with systematic and controlled differences between them. Key manipulations were repeated (with some variation) across the 4 terms, allowing assessment of replicability of effects.…
Descriptors: Undergraduate Students, Student Evaluation, Testing, Incidence

Peer reviewed
Direct link
