Publication Date
| In 2026 | 0 |
| Since 2025 | 451 |
| Since 2022 (last 5 years) | 2409 |
| Since 2017 (last 10 years) | 6589 |
| Since 2007 (last 20 years) | 17993 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 2140 |
| Teachers | 1216 |
| Researchers | 1054 |
| Administrators | 483 |
| Policymakers | 453 |
| Students | 176 |
| Parents | 147 |
| Counselors | 100 |
| Community | 61 |
| Media Staff | 17 |
| Support Staff | 15 |
| More ▼ | |
Location
| Canada | 784 |
| Australia | 690 |
| United States | 582 |
| California | 569 |
| United Kingdom | 479 |
| Texas | 413 |
| Florida | 403 |
| Germany | 391 |
| New York | 378 |
| United Kingdom (England) | 369 |
| China | 361 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 17 |
| Meets WWC Standards with or without Reservations | 22 |
| Does not meet standards | 21 |
Akbay, Tuncer; Akbay, Lokman; Erol, Osman – Malaysian Online Journal of Educational Technology, 2021
Integration of e-learning and computerized assessments into many levels of educational programs has been increasing as digital technology progresses. Due to a handful of prominent advantages of computer-based-testing (CBT), a rapid transition in test administration mode from paper-based-testing (PBT) to CBT has emerged. Recently, many national and…
Descriptors: Computer Assisted Testing, Testing, High Stakes Tests, International Assessment
Alemi, Minoo; Miri, Mola; Mozafarnezhad, Alemeh – International Journal of Language Testing, 2019
Although group dynamic assessment (GDA) has been gaining attention over the recent decade, its applicability in online context has been left rather underexplored. Hence, the current study examined the effects of GDA on developing EFL learners' written grammatical accuracy in the online context of 'Telegram'. To this aim, 60 Iranian EFL students…
Descriptors: Alternative Assessment, Group Testing, English (Second Language), Second Language Learning
Jonson, Jessica L.; Trantham, Pamela; Usher-Tate, Betty Jean – Educational Measurement: Issues and Practice, 2019
One of the substantive changes in the 2014 Standards for Educational and Psychological Testing was the elevation of fairness in testing as a foundational element of practice in addition to validity and reliability. Previous research indicates that testing practices often do not align with professional standards and guidelines. Therefore, to raise…
Descriptors: Culture Fair Tests, Test Validity, Test Reliability, Intelligence Tests
Buek, Katharine; Barghaus, Katherine; Fantuzzo, John – AERA Online Paper Repository, 2017
Quality assessment is an essential component of preschool education. The Standards for Educational and Psychological Testing provide benchmarks for evaluating the validity of inferences made from assessment or test results (AERA, APA & NCME, 2014). According to the Standards, test developers should investigate and document information related…
Descriptors: Preschool Education, Test Validity, Preschool Children, Standards
Florida Department of Education, 2020
Throughout the years, career and technical education (CTE) has focused on teaching technical competencies and related academic skills that prepare students to enter and advance in a variety of career fields and postsecondary education. Program and course descriptions are reviewed on a regular basis to ensure that the technical and academic skills…
Descriptors: Vocational Education, Basic Skills, Student Evaluation, Evaluation Methods
Jaeger, Antônio; Queiroz, Morgana C.; Selmeczy, Diana; Dobbins, Ian G. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2020
During recognition memory decisions, external hints or cues alter the accuracy and confidence of correct rejections (valid > uncued > invalid). In contrast, although hits show analogous accuracy effects, hit confidence remains largely unaffected by cue validity. Prior research suggested this confidence validity dissociation (CVD) may depend…
Descriptors: Recognition (Psychology), Cues, Accuracy, Validity
Nikolakopoulos, Stavros – Research Synthesis Methods, 2020
In narrative synthesis of evidence, it can be the case that the only quantitative measures available concerning the efficacy of an intervention is the direction of the effect, that is, whether it is positive or negative. In such situations, the sign test has been proposed in the literature and in recent Cochrane guidelines as a way to test whether…
Descriptors: Synthesis, Evidence, Statistical Analysis, Nonparametric Statistics
Langenfeld, Thomas – Educational Measurement: Issues and Practice, 2020
The COVID-19 pandemic has accelerated the shift toward online learning solutions necessitating the need for developing online assessment solutions. Vendors offer online assessment delivery systems with varying security levels designed to minimize unauthorized behaviors. Combating cheating and securing assessment content, however, is not solely the…
Descriptors: Computer Assisted Testing, Justice, COVID-19, Pandemics
Pugh, Debra; De Champlain, André; Gierl, Mark; Lai, Hollis; Touchie, Claire – Research and Practice in Technology Enhanced Learning, 2020
The purpose of this study was to compare the quality of multiple choice questions (MCQs) developed using automated item generation (AIG) versus traditional methods, as judged by a panel of experts. The quality of MCQs developed using two methods (i.e., AIG or traditional) was evaluated by a panel of content experts in a blinded study. Participants…
Descriptors: Computer Assisted Testing, Test Construction, Multiple Choice Tests, Test Items
Bardovi-Harlig, Kathleen; Comajoan-Colomé, Llorenç – Studies in Second Language Acquisition, 2020
Twenty years ago, a state-of-the-art review in "SSLA" marked the coming of age of the study of temporality in second language acquisition. This was followed by three monographs on tense and aspect the next year. This article presents a state-of-the-scholarship review of the last 20 years of research addressing the aspect hypothesis (AH)…
Descriptors: Second Language Learning, Language Acquisition, Morphology (Languages), Hypothesis Testing
Luecht, Richard M. – Educational Measurement: Issues and Practice, 2020
The educational testing landscape is changing in many significant ways as evidence-based, principled assessment design (PAD) approaches are formally adopted. This article discusses the challenges and presents some score scale- and task-focused strategies for developing useful performance-level descriptors (PLDs) under a PAD approach. Details of…
Descriptors: Test Construction, Academic Standards, Science Education, Educational Testing
Duncan, Helen; Purcell, Catherine – Journal of Further and Higher Education, 2020
Timed examinations continue to be a common educational assessment method employed to evaluate a student's knowledge, ability and skills in their subject area. Extra time is the most common adjustment that students with specific learning difficulties (SpLD) are granted in these exams. This adjustment aims to provide students with SpLD parity with…
Descriptors: Learning Disabilities, Students with Disabilities, Testing Accommodations, Time
Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020
In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…
Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles
Metsämuuronen, Jari – International Journal of Educational Methodology, 2020
Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…
Descriptors: Correlation, Test Items, Scores, Difficulty Level
Erdemir, Mustafa; Akyuz, Halil Ibrahim – Journal on School Educational Technology, 2020
The purpose of this study is to reduce ethics violations such as tricking and cheating that may occur in the offline assessment of undergraduate level Physics-II (Electricity) course subjects. The study is significant for the reliable and ethical evaluation of the Internet and computer-based educational process. Thirty-eight pre-service teachers…
Descriptors: Test Reliability, Ethics, Cheating, Undergraduate Students

Peer reviewed
Direct link
