ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	13

Descriptor

Difficulty Level	13
Test Items	13
Test Validity	13
Grade 8	10
Item Response Theory	7
Middle School Students	6
Foreign Countries	5
Mathematics Tests	5
Test Construction	5
Grade 7	4
Multiple Choice Tests	4
Grade 4	3
Science Tests	3
Student Evaluation	3
Test Reliability	3
Achievement Tests	2
Computer Assisted Testing	2
Grade 6	2
International Assessment	2
Item Analysis	2
Language Usage	2
National Competency Tests	2
Psychometrics	2
Science Achievement	2
Science Instruction	2
More ▼

Source

Behavioral Research and…	2
Educational Assessment	2
American Institutes for…	1
Cypriot Journal of…	1
ETS Research Report Series	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Research on…	1
Large-scale Assessments in…	1
Online Submission	1
Participatory Educational…	1
More ▼

Publication Type

Reports - Research	11
Journal Articles	9
Numerical/Quantitative Data	2
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Grade 8	13
Middle Schools	12
Elementary Education	10
Junior High Schools	8
Secondary Education	8
Grade 7	6
Grade 4	5
Elementary Secondary Education	4
Grade 6	4
Grade 3	2
Grade 5	2
Grade 9	2
High Schools	2
Intermediate Grades	2
Grade 10	1
Grade 11	1
Grade 12	1
More ▼

Audience

Location

California	2
Georgia	2
Turkey	2
Alabama	1
Arizona	1
Arkansas	1
Connecticut	1
Florida	1
Germany	1
Idaho	1
Illinois	1
Indiana	1
Iowa	1
Jordan	1
Kentucky	1
Nevada	1
Singapore	1
South Africa	1
Tennessee	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Investigating Item Complexity as a Source of Cross-National DIF in TIMSS Math and Science

Peer reviewed

Direct link

Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024

Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…

Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity

Assessing Source Evaluation Skills of Middle School Students Using Learning Progressions

Peer reviewed

Direct link

Sparks, Jesse R.; van Rijn, Peter W.; Deane, Paul – Educational Assessment, 2021

Effectively evaluating the credibility and accuracy of multiple sources is critical for college readiness. We developed 24 source evaluation tasks spanning four predicted difficulty levels of a hypothesized learning progression (LP) and piloted these tasks to evaluate the utility of an LP-based approach to designing formative literacy assessments.…

Descriptors: Middle School Students, Information Sources, Grade 6, Grade 7

Validity and Reliability of Eight-Grade Digital Culture Test in Light of Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Alnasraween, Moen Salman; Almughrabi, Ayat Mohammad; Ammari, Raeda Mofid; Alkaramneh, Mohammad Saleh – Cypriot Journal of Educational Sciences, 2021

The purpose of this study is to construct a digital culture test in light of the Item Response Theory and to investigate its psychometric properties. The study sample consisted of six hundred fifty (650) male and female students in the eighth grade from the Directorate of Education and Teaching of Salt District. To obtain the results, the…

Descriptors: Foreign Countries, Technological Literacy, Tests, Psychometrics

A Study on the Identification of Latent Classes Using Mixture Item Response Theory Models: TIMSS 2015 Case

Peer reviewed
PDF on ERIC

Download full text

Saatçioglu, Fatima Münevver; Atar, Hakan Yavuz – Participatory Educational Research, 2020

This study examined the existence of latent classes in TIMSS 2015 data from three countries, Singapure, Turkey and South Africa, were analyzed using Mixture Item Response Theory (MixIRT) models (Rasch, 1PL, 2PL and 3PL) on 18 multiple-choice items in the science subtest. Based on the findings, it was concluded that the data obtained from TIMSS…

Descriptors: Foreign Countries, Item Response Theory, Achievement Tests, International Assessment

The Impact of Sub-Skills and Item Content on Students' Skills with Regard to the Control-of-Variables Strategy

Peer reviewed

Direct link

Schwichow, Martin; Christoph, Simon; Boone, William J.; Härtig, Hendrik – International Journal of Science Education, 2016

The so-called control-of-variables strategy (CVS) incorporates the important scientific reasoning skills of designing controlled experiments and interpreting experimental outcomes. As CVS is a prominent component of science standards appropriate assessment instruments are required to measure these scientific reasoning skills and to evaluate the…

Descriptors: Thinking Skills, Science Instruction, Science Experiments, Science Tests

Using Ordered Multiple-Choice Items to Assess Students' Understanding of the Structure and Composition of Matter

Peer reviewed

Direct link

Hadenfeldt, Jan C.; Bernholt, Sascha; Liu, Xiufeng; Neumann, Knut; Parchmann, Ilka – Journal of Chemical Education, 2013

Helping students develop a sound understanding of scientific concepts can be a major challenge. Lately, learning progressions have received increasing attention as a means to support students in developing understanding of core scientific concepts. At the center of a learning progression is a sequence of developmental levels reflecting an…

Descriptors: Elementary School Science, Secondary School Science, Science Instruction, Chemistry

Study of the Feasibility of a NAEP Mathematics Accessible Block Alternative

Download full text

DeStefano, Lizanne; Johnson, Jeremiah – American Institutes for Research, 2013

This paper describes one of the first efforts by the National Assessment of Educational Progress (NAEP) to improve measurement at the lower end of the distribution, including measurement for students with disabilities (SD) and English language learners (ELLs). One way to improve measurement at the lower end is to introduce one or more…

Descriptors: National Competency Tests, Measures (Individuals), Disabilities, English Language Learners

An Application of Cognitive Diagnostic Assessment on TIMMS-2007 8th Grade Mathematics Items

Download full text

Toker, Turker; Green, Kathy – Online Submission, 2012

The least squares distance method (LSDM) was used in a cognitive diagnostic analysis of TIMSS (Trends in International Mathematics and Science Study) items administered to 4,498 8th-grade students from seven geographical regions of Turkey, extending analysis of attributes from content to process and skill attributes. Logit item positions were…

Descriptors: Foreign Countries, Least Squares Statistics, Grade 8, Mathematics Tests

Creating Vocabulary Item Types That Measure Students' Depth of Semantic Knowledge. Research Report. ETS RR-14-02

Peer reviewed
PDF on ERIC

Download full text

Deane, Paul; Lawless, René R.; Li, Chen; Sabatini, John; Bejar, Isaac I.; O'Reilly, Tenaha – ETS Research Report Series, 2014

We expect that word knowledge accumulates gradually. This article draws on earlier approaches to assessing depth, but focuses on one dimension: richness of semantic knowledge. We present results from a study in which three distinct item types were developed at three levels of depth: knowledge of common usage patterns, knowledge of broad topical…

Descriptors: Vocabulary, Test Items, Language Tests, Semantics

Development and Validation of the Student Tool for Technology Literacy (ST[superscript 2]L)

Peer reviewed
PDF on ERIC

Download full text

Hohlfeld, Tina N.; Ritzhaupt, Albert D.; Barron, Ann E. – Journal of Research on Technology in Education, 2010

This article provides an overview of the development and validation of the Student Tool for Technology Literacy (ST[superscript 2]L). Developing valid and reliable objective performance measures for monitoring technology literacy is important to all organizations charged with equipping students with the technology skills needed to successfully…

Descriptors: Test Validity, Ability Grouping, Grade 8, Test Construction

Computer Testing as a Form of Accommodation for English Language Learners

Peer reviewed

Direct link

Abedi, Jamal – Educational Assessment, 2009

This study compared performance of both English language learners (ELLs) and non-ELL students in Grades 4 and 8 under accommodated and nonaccommodated testing conditions. The accommodations used in this study included a computerized administration of a math test with a pop-up glossary, a customized English dictionary, extra testing time, and…

Descriptors: Computer Assisted Testing, Testing Accommodations, Mathematics Tests, Grade 4

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Instrument Development Procedures for Silent Reading Measures. Technical Report Number 08-03

Download full text

Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop and gather validity evidence for silent reading fluency passages. A number of passages were written following a traditional story grammar structure (character, setting, events) and placed on a computer for students to read silently. We describe in detail, the manner in which content-related evidence was…

Descriptors: Silent Reading, Reading Fluency, Reading Tests, Test Validity

Deane, Paul	2
Ketterlin-Geller, Leanne R.	2
Liu, Kimy	2
Tindal, Gerald	2
Abedi, Jamal	1
Alkaramneh, Mohammad Saleh	1
Almughrabi, Ayat Mohammad	1
Alnasraween, Moen Salman	1
Ammari, Raeda Mofid	1
Atar, Hakan Yavuz	1
Barron, Ann E.	1
Bejar, Isaac I.	1
Bernholt, Sascha	1
Boone, William J.	1
Christoph, Simon	1
Daniel M. Bolt	1
DeStefano, Lizanne	1
Green, Kathy	1
Hadenfeldt, Jan C.	1
Hohlfeld, Tina N.	1
Härtig, Hendrik	1
Johnson, Jeremiah	1
Jung, Eunju	1
Lawless, René R.	1
Li, Chen	1
More ▼