Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 22 |
Since 2006 (last 20 years) | 29 |
Descriptor
Difficulty Level | 29 |
Test Reliability | 29 |
Test Items | 22 |
Foreign Countries | 17 |
Test Validity | 15 |
Test Construction | 11 |
Middle School Students | 10 |
Grade 7 | 9 |
Grade 8 | 8 |
Psychometrics | 8 |
Grade 5 | 7 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 27 |
Journal Articles | 24 |
Numerical/Quantitative Data | 2 |
Dissertations/Theses -… | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Middle Schools | 29 |
Elementary Education | 21 |
Secondary Education | 20 |
Junior High Schools | 18 |
Intermediate Grades | 12 |
Grade 5 | 8 |
Grade 7 | 8 |
Grade 8 | 8 |
Grade 6 | 7 |
High Schools | 5 |
Grade 9 | 4 |
More ▼ |
Audience
Location
Turkey | 7 |
Indonesia | 5 |
Turkey (Ankara) | 2 |
Turkey (Istanbul) | 2 |
Arizona | 1 |
Canada (Montreal) | 1 |
Georgia | 1 |
Germany | 1 |
Hawaii | 1 |
Idaho | 1 |
Indiana | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Gates MacGinitie Reading Tests | 1 |
Program for International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Ruying Li; Gaofeng Li – International Journal of Science and Mathematics Education, 2025
Systems thinking (ST) is an essential competence for future life and biology learning. Appropriate assessment is critical for collecting sufficient information to develop ST in biology education. This research offers an ST framework based on a comprehensive understanding of biological systems, encompassing four skills across three complexity…
Descriptors: Test Construction, Test Validity, Science Tests, Cognitive Tests
Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021
This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…
Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability
Büsra Kilinç; Mehmet Diyaddin Yasar – Science Insights Education Frontiers, 2024
In this study, it was aimed to develop an achievement test taking into account the subject acquisitions of the sound and properties unit in the sixth-grade science course. In the test development phase, firstly, literature review for the study was conducted. Then, 30 multiple choice questions in align with the subject acquisition in the 2018…
Descriptors: Science Tests, Test Construction, Grade 6, Science Instruction
Munawarah; Thalhah, Siti Zuhaerah; Angriani, Andi Dian; Nur, Fitriani; Kusumayanti, Andi – Mathematics Teaching Research Journal, 2021
The increase in the need for critical and analytical thinking among students to boost their confidence in dealing with complex and difficult problems has led to the development of computational skills. Therefore, this study aims to develop an instrument test for computational thinking (CT) skills in the mathematics-based RME (Realistic Mathematics…
Descriptors: Test Construction, Mathematics Tests, Computation, Thinking Skills
Sparks, Jesse R.; van Rijn, Peter W.; Deane, Paul – Educational Assessment, 2021
Effectively evaluating the credibility and accuracy of multiple sources is critical for college readiness. We developed 24 source evaluation tasks spanning four predicted difficulty levels of a hypothesized learning progression (LP) and piloted these tasks to evaluate the utility of an LP-based approach to designing formative literacy assessments.…
Descriptors: Middle School Students, Information Sources, Grade 6, Grade 7
Jéldrez, Elvira; Cain, Kate; Silva, Macarena; Strasser, Katherine – Reading Psychology, 2023
Reading motivation is multidimensional and a critical contributor to students' reading comprehension skill. Its multidimensionality is problematic, as there is currently no consensus on the dimensions underlying reading motivation, which are being tested through a variety of instruments that lack statistical validation. Our goal was to discuss the…
Descriptors: Reading Motivation, Test Validity, Test Reliability, Factor Structure
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Bozdag, Hüseyin Cihan; Türkoguz, Suat – International Online Journal of Primary Education, 2021
The study determines the conceptual understanding levels of primary school students on the concept of light according to the Rasch Model with a Four-tier Light Conceptual Understanding Test (LCUT). The participants were 355 (164 girls and 191 boys) primary school students studying at a public school in Izmir city center. In the study, the Rasch…
Descriptors: Foreign Countries, Elementary School Students, Grade 5, Item Response Theory
Guven Demir, Elif; Öksuz, Yücel – Participatory Educational Research, 2022
This research aimed to investigate animation-based achievement tests according to the item format, psychometric features, students' performance, and gender. The study sample consisted of 52 fifth-grade students in Samsun/Turkey in 2017-2018. Measures of the research were open-ended (OE), animation-based open-ended (AOE), multiple-choice (MC), and…
Descriptors: Animation, Achievement Tests, Test Items, Psychometrics
Alnasraween, Moen Salman; Almughrabi, Ayat Mohammad; Ammari, Raeda Mofid; Alkaramneh, Mohammad Saleh – Cypriot Journal of Educational Sciences, 2021
The purpose of this study is to construct a digital culture test in light of the Item Response Theory and to investigate its psychometric properties. The study sample consisted of six hundred fifty (650) male and female students in the eighth grade from the Directorate of Education and Teaching of Salt District. To obtain the results, the…
Descriptors: Foreign Countries, Technological Literacy, Tests, Psychometrics
Basaraba, Deni L.; Yovanoff, Paul; Shivraj, Pooja; Ketterlin-Geller, Leanne R. – Practical Assessment, Research & Evaluation, 2020
Stopping rules for fixed-form tests with graduated item difficulty are intended to stop administration of a test at the point where students are sufficiently unlikely to provide a correct response following a pattern of incorrect responses. Although widely employed in fixed-form tests in education, little research has been done to empirically…
Descriptors: Formative Evaluation, Test Format, Test Items, Difficulty Level
Chastenay, Pierre; Riopel, Martin – Physical Review Physics Education Research, 2020
We present the development and validation of a new assessment tool, the Moon Phases Concept Inventory for Middle School (MPCI-MS), a concept inventory about the phases of the moon targeting students aged 10 to 14 years old. Items in the questionnaire are based on a careful examination of the concept domain of phases of the moon, ideas and concepts…
Descriptors: Test Construction, Test Validity, Astronomy, Scientific Concepts
Ölmez, Ibrahim Burak; Ölmez, Safiye Bahar – Mathematics Education Research Journal, 2019
The purpose of this study was to investigate the psychometric characteristics of the Math Anxiety Scale (MANX; Erol 1989, Unpublished master thesis, Bogazici University) with data collected from 952 middle school students in Turkey. The Rasch Rating Scale model was used to examine the MANX at the item level. The results revealed that although the…
Descriptors: Middle School Students, Mathematics Instruction, Foreign Countries, Difficulty Level
Semiun, Thresia Trivict; Luruk, Fransiska Densiana – English Language Teaching Educational Journal, 2020
This study aimed at examining the quality of an English summative test of grade VII in a public school located in Kupang. Particularly, this study examined content validity, reliability, and conducted item analysis including item validity, item difficulty, item discrimination, and distracter effectiveness. This study was descriptive evaluative…
Descriptors: Summative Evaluation, Language Tests, English (Second Language), Content Validity
Atalmis, Erkan Hasan – International Journal of Assessment Tools in Education, 2018
Although multiple-choice items (MCIs) are widely used for classroom assessment, designing MCIs with sufficient number of plausible distracters is very challenging for teachers. In this regard, previous empirical studies reveal that using three-option MCIs provides various advantages when compared to four-option MCIs due to less preparation and…
Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Test Reliability
Previous Page | Next Page »
Pages: 1 | 2