Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 25 |
Since 2006 (last 20 years) | 39 |
Descriptor
Difficulty Level | 47 |
Scores | 47 |
Test Validity | 47 |
Test Items | 31 |
Foreign Countries | 22 |
Test Reliability | 20 |
Test Construction | 17 |
Item Analysis | 11 |
Language Tests | 10 |
Second Language Learning | 9 |
English (Second Language) | 8 |
More ▼ |
Source
Author
Chen, Jing | 2 |
Pollock, Steven J. | 2 |
Rock, Donald A. | 2 |
Agawa, Toshie | 1 |
Akhtar, Hanif | 1 |
Ali, Syed Haris | 1 |
Alonazi, Zaha | 1 |
Apichat Khamboonruang | 1 |
Asano, Keiko | 1 |
Asikainen, Mervi A. | 1 |
Bansilal, Sarah | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 18 |
Postsecondary Education | 15 |
Elementary Education | 10 |
Secondary Education | 4 |
Early Childhood Education | 3 |
High Schools | 3 |
Primary Education | 3 |
Grade 1 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Middle Schools | 2 |
More ▼ |
Audience
Location
Turkey | 3 |
Canada | 2 |
Colorado | 2 |
South Africa | 2 |
Asia | 1 |
Colombia | 1 |
Finland | 1 |
Indonesia | 1 |
Iran | 1 |
Israel | 1 |
Italy | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024
This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…
Descriptors: Korean, Test Validity, Test Reliability, Imitation
Apichat Khamboonruang – Language Testing in Asia, 2025
Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…
Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests
Lee, Shinhye – ETS Research Report Series, 2022
In response to the calls for making key stakeholders' perspectives relevant in the test validation process, the study discussed in this report sought test-taker feedback as part of collecting validity evidence and supporting the ongoing field testing efforts of the new "TOEFL ITP"® Speaking section. Specifically, I aimed to investigate…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Test Validity
Tim Jacobbe; Bob delMas; Brad Hartlaub; Jeff Haberstroh; Catherine Case; Steven Foti; Douglas Whitaker – Numeracy, 2023
The development of assessments as part of the funded LOCUS project is described. The assessments measure students' conceptual understanding of statistics as outlined in the GAISE PreK-12 Framework. Results are reported from a large-scale administration to 3,430 students in grades 6 through 12 in the United States. Items were designed to assess…
Descriptors: Statistics Education, Common Core State Standards, Student Evaluation, Elementary School Students
Emily Tucker – ProQuest LLC, 2021
To better understand Tennessee's new standardized science assessments, this quantitative study utilized a nonexperimental, descriptive-comparative design to compare the readability of the long-used science TCAP assessment with the newly created science TNReady assessment in grades three, four, and five. As new standards in the state boast higher…
Descriptors: Science Tests, Standardized Tests, Achievement Tests, Readability
Akhtar, Hanif – International Association for Development of the Information Society, 2022
When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…
Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes
Mohammed Ambusaidi – ProQuest LLC, 2022
There is an increased demand on nursing faculty to provide quality teaching and assessment. Nursing faculty are required to ensure accurate assessment of learning through testing and outcome measurement that are critical elements of the evaluation process. Likewise, nursing faculty should implement a logical evaluation system. However, the…
Descriptors: Nursing Education, College Faculty, Test Construction, Test Validity
Guven Demir, Elif; Öksuz, Yücel – Participatory Educational Research, 2022
This research aimed to investigate animation-based achievement tests according to the item format, psychometric features, students' performance, and gender. The study sample consisted of 52 fifth-grade students in Samsun/Turkey in 2017-2018. Measures of the research were open-ended (OE), animation-based open-ended (AOE), multiple-choice (MC), and…
Descriptors: Animation, Achievement Tests, Test Items, Psychometrics
Wu, Amery D.; Chen, Michelle Y.; Stone, Jake E. – International Journal of Testing, 2018
This article investigates how test-takers change their strategies to handle increased test difficulty. An adult sample reported their test-taking strategies immediately after completing the tasks in a reading test. Data were analyzed using structural equation modeling specifying a measurement-invariant, ability-moderated, latent transition…
Descriptors: Test Wiseness, Reading Tests, Reading Comprehension, Difficulty Level
Yuksel, Ibrahim; Savas, Muhammed Ali – Asian Journal of Education and Training, 2019
In this research, it is aimed to develop a valid and reliable test to determine the drawing a shape-schema and making a table levels of prospective teachers at Mathematics and Science Education, Turkish and Social Sciences Education and Basic Education Departments. In this process, a comprehensive item pool has been prepared with the table of…
Descriptors: Preservice Teachers, Item Banks, Test Validity, Foreign Countries
Bastianello, Tamara; Brondino, Margherita; Persici, Valentina; Majorano, Marinella – Journal of Research in Childhood Education, 2023
The present contribution aims at presenting an assessment tool (i.e., the TALK-assessment) built to evaluate the language development and school readiness of Italian preschoolers before they enter primary school, and its predictive validity for the children's reading and writing skills at the end of the first year of primary school. The early…
Descriptors: Literacy, Computer Assisted Testing, Italian, Language Acquisition
Alonazi, Zaha – ProQuest LLC, 2019
Achieving sufficient proficiency in academic writing is critical in university level setting. It is not surprising hence, that the admission to English speaking universities is usually conditioned not only by a particular total score from Standardized tests of English proficiency, e.g., TOEFL or IELTs but also a specific band score in writing. To…
Descriptors: Writing Tests, Placement Tests, Language Tests, College Admission
Shakhman, Larisa; Barak, Moshe – EURASIA Journal of Mathematics, Science and Technology Education, 2019
This study addresses the development and evaluation of the Physics Problem-Solving Taxonomy (PPST), comprising five levels: retrieval, diagnosis, strategy, conceptual, and creative thinking. The taxonomy draws on Bloom's revised taxonomy in the cognitive domain, the Types of Knowledge Taxonomy, and the Problem-Solving Taxonomy in engineering. The…
Descriptors: Foreign Countries, Physics, Problem Solving, Taxonomy
Chen, Pei-Hua; Fu, Jen-Tso – Language Assessment Quarterly, 2018
The Revised Preschool Language Assessment (RPLA) is a standardized measure for examining the language status of and determining potential language difficulties among preschoolers between 3 and 6 years old. To facilitate the applicability of the RPLA for use with Mandarin-speaking children, the present study adopted exploratory factor analysis…
Descriptors: Mandarin Chinese, Preschool Children, Language Tests, Standardized Tests
Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020
There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…
Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education