Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 21 |
Descriptor
Gender Differences | 22 |
Science Tests | 22 |
Test Items | 22 |
Foreign Countries | 11 |
Item Response Theory | 8 |
Mathematics Tests | 8 |
Science Achievement | 8 |
Item Analysis | 7 |
Academic Achievement | 6 |
Student Characteristics | 6 |
Achievement Tests | 5 |
More ▼ |
Source
Author
Donovan, Jenny | 2 |
Lennon, Melissa | 2 |
Becker, Michael | 1 |
Berberoglu, Giray | 1 |
Chiang, Jui-Ling | 1 |
Chiu, Ming Ming | 1 |
Cho, YoungWoo | 1 |
Corina E. Brown | 1 |
Cortes, Kimberly Linenberger | 1 |
Darling, Andrew | 1 |
Duffin, Kirk | 1 |
More ▼ |
Publication Type
Reports - Research | 15 |
Journal Articles | 14 |
Reports - Evaluative | 6 |
Numerical/Quantitative Data | 4 |
Collected Works - Proceedings | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 8 |
Secondary Education | 8 |
Elementary Education | 7 |
Postsecondary Education | 6 |
Elementary Secondary Education | 5 |
Grade 8 | 5 |
Middle Schools | 4 |
Grade 6 | 3 |
Junior High Schools | 3 |
Intermediate Grades | 2 |
Grade 4 | 1 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 6 |
Trends in International… | 4 |
National Assessment of… | 2 |
ACT Assessment | 1 |
Advanced Placement… | 1 |
What Works Clearinghouse Rating
Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022
The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…
Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences
Malik, Umairia; Low, David; Wilson, Kate – Physics Teacher, 2021
We ask questions of students in order to probe their understanding. We design our questions in such a way that we can assess a student's progress towards an accurate worldview. However, there is a consensus that a performance gap exists in many physics assessments, where male students outperform their female peers. While early work in this area…
Descriptors: Physics, Science Instruction, World Views, Science Tests
Guven Demir, Elif; Öksuz, Yücel – Participatory Educational Research, 2022
This research aimed to investigate animation-based achievement tests according to the item format, psychometric features, students' performance, and gender. The study sample consisted of 52 fifth-grade students in Samsun/Turkey in 2017-2018. Measures of the research were open-ended (OE), animation-based open-ended (AOE), multiple-choice (MC), and…
Descriptors: Animation, Achievement Tests, Test Items, Psychometrics
Moyer, Eric L.; Galindo, Jennifer – National Assessment Governing Board, 2023
The National Assessment Governing Board (the Board) contracted with Pearson to design and implement a review of the achievement level descriptions (ALDs) for National Assessment of Educational Progress (NAEP) Grade 8 assessments in Science, U.S. History, and Civics. This document describes the procedural and technical aspects and outcomes of the…
Descriptors: National Competency Tests, Student Evaluation, Grade 8, Academic Achievement
Hrouzková, Tereza; Richterek, Lukáš – International Baltic Symposium on Science and Technology Education, 2021
The Lawson classroom test of scientific reasoning is a quite popular and widely used tool that measures the level and development of the student's scientific reasoning skills. In this contribution, the results of this test for the N=446 students of the Faculty of Science Palacký University Olomouc from the years 2018-2020 at the beginning of their…
Descriptors: Science Tests, Thinking Skills, Undergraduate Students, Science Education
Luo, Wei; Smith, Thomas J.; Whalley, Kyle; Darling, Andrew; Ormand, Carol; Hung, Wei-Chen; Chiang, Jui-Ling; Pelletier, Jon; Duffin, Kirk – British Journal of Educational Technology, 2019
This paper presents results from a randomized experimental design replicated over four semesters that compared students' performance in understanding landform evolution processes as measured by the pretest to posttest score growth between two treatment methods: an online interactive simulation tool and a paper-based exercise. While both methods…
Descriptors: Earth Science, Models, Science Tests, Computer Simulation
Steedle, Jeffrey; Pashley, Peter; Cho, YoungWoo – ACT, Inc., 2020
Three mode comparability studies were conducted on the following Saturday national ACT test dates: October 26, 2019, December 14, 2019, and February 8, 2020. The primary goal of these studies was to evaluate whether ACT scores exhibited mode effects between paper and online testing that would necessitate statistical adjustments to the online…
Descriptors: Test Format, Computer Assisted Testing, College Entrance Examinations, Scores
Traxler, Adrienne; Henderson, Rachel; Stewart, John; Stewart, Gay; Papak, Alexis; Lindell, Rebecca – Physical Review Physics Education Research, 2018
Research on the test structure of the Force Concept Inventory (FCI) has largely ignored gender, and research on FCI gender effects (often reported as "gender gaps") has seldom interrogated the structure of the test. These rarely crossed streams of research leave open the possibility that the FCI may not be structurally valid across…
Descriptors: Physics, Science Instruction, Sex Fairness, Gender Differences
Nagy, Gabriel; Nagengast, Benjamin; Frey, Andreas; Becker, Michael; Rose, Norman – Assessment in Education: Principles, Policy & Practice, 2019
Position effects (PE) cause decreasing probabilities of correct item responses towards the end of a test. We analysed PEs in science, mathematics and reading tests administered in the German extension to the PISA 2006 study with respect to their variability at the student- and school-level. PEs were strongest in reading and weakest in mathematics.…
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
Yalcin, Seher – Eurasian Journal of Educational Research, 2018
Purpose: Studies in the literature have generally demonstrated that the causes of differential item functioning (DIF) are complex and not directly related to defined groups. The purpose of this study is to determine the DIF according to the mixture item response theory (MixIRT) model, based on the latent group approach, as well as the…
Descriptors: Item Response Theory, Test Items, Test Bias, Error of Measurement
Shah, Lisa; Hao, Jie; Schneider, Jeremy; Fallin, Rebekah; Cortes, Kimberly Linenberger; Ray, Herman E.; Rushton, Gregory T. – Journal of Chemical Education, 2018
Teachers play a critical role in the preparation of future science, technology, engineering, and mathematics majors and professionals. What teachers know about their discipline (i.e., content knowledge) has been identified as an important aspect of instructional effectiveness; however, studies have not yet assessed the content knowledge of…
Descriptors: Science Teachers, Science Instruction, Chemistry, Introductory Courses
Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A. – Educational and Psychological Measurement, 2013
The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…
Descriptors: Item Response Theory, Models, Standard Setting (Scoring), Science Tests
Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve – ETS Research Report Series, 2017
In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…
Descriptors: Item Analysis, Gender Differences, Achievement Gap, Grade 8
Liu, Ou Lydia; Ryoo, Kihyun; Linn, Marcia C.; Sato, Elissa; Svihla, Vanessa – International Journal of Science Education, 2015
Although researchers call for inquiry learning in science, science assessments rarely capture the impact of inquiry instruction. This paper reports on the development and validation of assessments designed to measure middle-school students' progress in gaining integrated understanding of energy while studying an inquiry-oriented curriculum. The…
Descriptors: Energy, Science Education, Psychometrics, Case Studies
Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011
This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…
Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences
Previous Page | Next Page »
Pages: 1 | 2