Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 44 |
| Since 2017 (last 10 years) | 110 |
| Since 2007 (last 20 years) | 202 |
Descriptor
| Test Construction | 734 |
| Test Reliability | 734 |
| Test Validity | 513 |
| Testing | 236 |
| Testing Problems | 156 |
| Computer Assisted Testing | 142 |
| Test Items | 120 |
| Higher Education | 115 |
| Foreign Countries | 102 |
| Test Interpretation | 102 |
| Scoring | 97 |
| More ▼ | |
Source
Author
| Hambleton, Ronald K. | 8 |
| Reckase, Mark D. | 8 |
| Weiss, David J. | 6 |
| Alonzo, Julie | 5 |
| Tindal, Gerald | 5 |
| Ebel, Robert L. | 4 |
| Eignor, Daniel R. | 4 |
| Irvin, P. Shawn | 4 |
| Lai, Cheng-Fei | 4 |
| Park, Bitnara Jasmine | 4 |
| Capie, William | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 38 |
| Teachers | 26 |
| Researchers | 22 |
| Administrators | 11 |
| Policymakers | 4 |
| Support Staff | 4 |
| Counselors | 3 |
| Students | 3 |
| Community | 1 |
Location
| Canada | 14 |
| Australia | 9 |
| Nebraska | 9 |
| Indonesia | 8 |
| New York | 8 |
| United Kingdom (England) | 7 |
| China | 6 |
| Turkey | 6 |
| United Kingdom | 6 |
| Illinois | 5 |
| Japan | 5 |
| More ▼ | |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 5 |
| Individuals with Disabilities… | 3 |
| Rehabilitation Act 1973… | 3 |
| Elementary and Secondary… | 2 |
| No Child Left Behind Act 2001 | 2 |
| Education Consolidation… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
W. James Popham – Pearson, 2024
"Classroom Assessment" shows pre- and in-service teachers how to use classroom testing accurately and formatively to dramatically increase their teaching effectiveness and promote student learning. In addition to clear and concise guidelines on how to develop and use quality classroom assessments, the author also focuses on the teaching…
Descriptors: Student Evaluation, Testing, Teacher Effectiveness, Test Construction
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
Shangchao Min; Kyoungwon Bishop – Language Testing, 2024
This paper evaluates the multistage adaptive test (MST) design of a large-scale academic language assessment (ACCESS) for Grades 1-12, with an aim to simplify the current MST design, using both operational and simulated test data. Study 1 explored the operational population data (1,456,287 test-takers) of the listening and reading tests of MST…
Descriptors: Adaptive Testing, Test Construction, Language Tests, English Language Learners
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Vinay Kumar Yadav; Shakti Prasad – Measurement: Interdisciplinary Research and Perspectives, 2024
In sample survey analysis, accurate population mean estimation is an important task, but traditional approaches frequently ignore the intricacies of real-world data, leading to biassed results. In order to handle uncertainties, indeterminacies, and ambiguity, this work presents an innovative approach based on neutrosophic statistics. We proposed…
Descriptors: Sampling, Statistical Bias, Predictor Variables, Predictive Measurement
Ágnes Hódi; Edit Tóth – International Journal of Early Childhood, 2024
Phonological awareness plays a key role in learning to read; therefore, its assessment has received a lot of attention. Research in the domain of phonological awareness has been characterized by attempts to develop reliable and valid assessment tools for diverse populations. Over the past few decades, phonological awareness assessment has gone…
Descriptors: Phonological Awareness, Computer Assisted Testing, Hungarian, Native Language
Areekkuzhiyil, Santhosh – Online Submission, 2021
Assessment is an integral part of any teaching learning process. Assessment has large number of functions to perform, whether it is formative or summative. This paper analyse the issues involved and the areas of concern in the classroom assessment practice and discusses the recent reforms take place. [This paper was published in Edutracks v20 n8…
Descriptors: Student Evaluation, Formative Evaluation, Summative Evaluation, Test Validity
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Tom Benton – Research Matters, 2024
Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…
Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction
Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024
The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…
Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety
Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025
Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…
Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression
Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025
This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Direct link
Peer reviewed
