Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 21 |
Since 2016 (last 10 years) | 40 |
Since 2006 (last 20 years) | 74 |
Descriptor
Guidelines | 156 |
Test Validity | 156 |
Test Reliability | 60 |
Test Construction | 59 |
Language Tests | 39 |
Foreign Countries | 38 |
Second Language Learning | 37 |
Student Evaluation | 29 |
English (Second Language) | 28 |
Test Items | 28 |
Testing | 25 |
More ▼ |
Source
Author
Sireci, Stephen G. | 4 |
Hambleton, Ronald K. | 3 |
Alonzo, Julie | 2 |
Cox, Troy L. | 2 |
Erickson, Harley E. | 2 |
Lindheim, Elaine | 2 |
Malone, Margaret E. | 2 |
Miller, Patrick W. | 2 |
Popham, W. James | 2 |
Ravand, Hamdollah | 2 |
Tindal, Gerald | 2 |
More ▼ |
Publication Type
Education Level
Location
Europe | 8 |
China | 5 |
Japan | 3 |
Norway | 2 |
South Korea | 2 |
Spain | 2 |
Thailand | 2 |
United Kingdom | 2 |
Australia | 1 |
Bulgaria | 1 |
Canada | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Benjawan Plengkham; Sonthaya Rattanasak; Patsawut Sukserm – Journal of Education and Learning, 2025
This academic article provides the essential steps for designing an effective English questionnaire in social science research, with a focus on ensuring clarity, cultural sensitivity and ethical integrity. Developed from key insights from related studies, it outlines potential practice in questionnaire design, item development and the importance…
Descriptors: Guidelines, Test Construction, Questionnaires, Surveys
Indiana Department of Education, 2024
The Indiana Department of Education's (IDOE's) Accessibility and Accommodations Information for Statewide Assessments is a document intended for school-level personnel and decision-making teams as they prepare for and implement Indiana statewide assessments. Information is provided for school personnel as a reference to inform guidance on…
Descriptors: Measurement, Statewide Planning, Standardized Tests, Academic Accommodations (Disabilities)
de Ruiter, Laura E.; Bers, Marina U. – Computer Science Education, 2022
Background and Context: Despite the increasing implementation of coding in early curricula, there are few valid and reliable assessments of coding abilities for young children. This impedes studying learning outcomes and the development and evaluation of curricula. Objective: Developing and validating a new instrument for assessing young…
Descriptors: Programming Languages, Computer Software, Coding, Computer Science Education
W. James Popham – Pearson, 2024
"Classroom Assessment" shows pre- and in-service teachers how to use classroom testing accurately and formatively to dramatically increase their teaching effectiveness and promote student learning. In addition to clear and concise guidelines on how to develop and use quality classroom assessments, the author also focuses on the teaching…
Descriptors: Student Evaluation, Testing, Teacher Effectiveness, Test Construction
Hae In Park – English Teaching, 2024
The present study aimed to validate a 70-item Korean bilingual version of the Vocabulary Size Test (VST) using Rasch modeling. The goal was to assess the applicability of this Korean version of the VST for Korean learners of English in an English as a foreign language (EFL) context by examining validity evidence based on Messick's framework.…
Descriptors: Korean, Bilingualism, English (Second Language), Second Language Learning
Im, Gwan-Hyeok; Shin, Dongil; Park, Soohyeon – Current Issues in Language Planning, 2022
This study suggests a conceptual framework for policy-driven test development and validation, using the Test of Proficiency in Korean (TOPIK) as an example context. By linking the literature on policy analysis and argument structure in the validation of testing, the strong relationships between policy and testing are illustrated. This rationalizes…
Descriptors: Language Proficiency, Language Tests, Korean, Test Construction
Articulating and Evaluating Validity Arguments for the "TOEIC"® Tests. Research Report. ETS RR-17-51
Schmidgall, Jonathan E. – ETS Research Report Series, 2017
This report provides a brief overview of how the "TOEIC"® program has adopted an argument-based approach to validity in order to support the use of the TOEIC tests. This approach emphasizes the need to explicitly state claims about the measurement quality and intended use of a test and to support those claims with evidence. This report…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Test Use
Ketabi, Somaye; Alavi, Seyyed Mohammed; Ravand, Hamdollah – International Journal of Language Testing, 2021
Although Diagnostic Classification Models (DCMs) were introduced to education system decades ago, it seems that these models were not employed for the original aims upon which they had been designed. Using DCMs has been mostly common in analyzing large-scale non-diagnostic tests and these models have been rarely used in developing Cognitive…
Descriptors: Diagnostic Tests, Test Construction, Goodness of Fit, Classification
Mohammed Ambusaidi – ProQuest LLC, 2022
There is an increased demand on nursing faculty to provide quality teaching and assessment. Nursing faculty are required to ensure accurate assessment of learning through testing and outcome measurement that are critical elements of the evaluation process. Likewise, nursing faculty should implement a logical evaluation system. However, the…
Descriptors: Nursing Education, College Faculty, Test Construction, Test Validity
Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017
Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…
Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education
International Journal of Testing, 2018
The second edition of the International Test Commission Guidelines for Translating and Adapting Tests was prepared between 2005 and 2015 to improve upon the first edition, and to respond to advances in testing technology and practices. The 18 guidelines are organized into six categories to facilitate their use: pre-condition (3), test development…
Descriptors: Translation, Test Construction, Testing, Scoring
Huang, Heng-Tsung Danny; Hung, Shao-Ting Alan; Chao, Hsiu-Yi; Chen, Jyun-Hong; Lin, Tsui-Peng; Shih, Ching-Lin – Language Assessment Quarterly, 2022
Prompted by Taiwanese university students' increasing demand for English proficiency assessment, the absence of a test designed specifically for this demographic subgroup, and the lack of a localized and freely-accessible proficiency measure, this project set out to develop and validate a computerized adaptive English proficiency testing (E-CAT)…
Descriptors: Computer Assisted Testing, English (Second Language), Second Language Learning, Second Language Instruction
McKenna, Meaghan; Soto-Boykin, Xigrid; Cheng, Ke; Haynes, Elizabeth; Osorio, Amanda; Altshuler, Joan – Grantee Submission, 2021
This article describes the development and administration of a survey to identify early childhood educators' successes and barriers when delivering remote instruction (e.g., online whole or small group instruction) during the COVID-19 pandemic to children 2-5 years old. The survey was developed using procedures outlined by the commonly accepted…
Descriptors: Test Construction, Testing, Surveys, Early Childhood Teachers
Burton, J. Dylan – Language Assessment Quarterly, 2023
The effects of question or task complexity on second language speaking have traditionally been investigated using complexity, accuracy, and fluency measures. Response processes in speaking tests, however, may manifest in other ways, such as through nonverbal behavior. Eye behavior, in the form of averted gaze or blinking frequency, has been found…
Descriptors: Oral Language, Speech Communication, Language Tests, Eye Movements
Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020
This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…
Descriptors: Holistic Approach, Classification, Grading, Language Tests