Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Using Necessary Information to Identify Item Dependence in Passage-Based Reading Comprehension Tests
Baldonado, Angela Argo; Svetina, Dubravka; Gorin, Joanna – Applied Measurement in Education, 2015
Applications of traditional unidimensional item response theory models to passage-based reading comprehension assessment data have been criticized based on potential violations of local independence. However, simple rules for determining dependency, such as including all items associated with a particular passage, may overestimate the dependency…
Descriptors: Reading Tests, Reading Comprehension, Test Items, Item Response Theory
Baghaei, Purya; Kubinger, Klaus D. – Practical Assessment, Research & Evaluation, 2015
The present paper gives a general introduction to the linear logistic test model (Fischer, 1973), an extension of the Rasch model with linear constraints on item parameters, along with eRm (an R package to estimate different types of Rasch models; Mair, Hatzinger, & Mair, 2014) functions to estimate the model and interpret its parameters. The…
Descriptors: Item Response Theory, Models, Test Validity, Hypothesis Testing
Custer, Michael – Online Submission, 2015
This study examines the relationship between sample size and item parameter estimation precision when utilizing the one-parameter model. Item parameter estimates are examined relative to "true" values by evaluating the decline in root mean squared deviation (RMSD) and the number of outliers as sample size increases. This occurs across…
Descriptors: Sample Size, Item Response Theory, Computation, Accuracy
Cao, Yi; Lu, Ru; Tao, Wei – ETS Research Report Series, 2014
The local item independence assumption underlying traditional item response theory (IRT) models is often not met for tests composed of testlets. There are 3 major approaches to addressing this issue: (a) ignore the violation and use a dichotomous IRT model (e.g., the 2-parameter logistic [2PL] model), (b) combine the interdependent items to form a…
Descriptors: Item Response Theory, Equated Scores, Test Items, Simulation
Carlson, James E. – ETS Research Report Series, 2014
A little-known theorem, a generalization of Pythagoras's theorem, due to Pappus, is used to present a geometric explanation of various definitions of the contribution of component tests to their composite. I show that an unambiguous definition of the unique contribution of a component to the composite score variance is present if and only if the…
Descriptors: Geometric Concepts, Scores, Validity, Reliability
Yao, Lihua – Journal of Educational Measurement, 2014
The intent of this research was to find an item selection procedure in the multidimensional computer adaptive testing (CAT) framework that yielded higher precision for both the domain and composite abilities, had a higher usage of the item pool, and controlled the exposure rate. Five multidimensional CAT item selection procedures (minimum angle;…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection
Oliveri, María Elena; Ercikan, Kadriye; Zumbo, Bruno D. – Applied Measurement in Education, 2014
Heterogeneity within English language learners (ELLs) groups has been documented. Previous research on differential item functioning (DIF) analyses suggests that accurate DIF detection rates are reduced greatly when groups are heterogeneous. In this simulation study, we investigated the effects of heterogeneity within linguistic (ELL) groups on…
Descriptors: Test Bias, Accuracy, English Language Learners, Simulation
Hidalgo, Mª Dolores; Gómez-Benito, Juana; Zumbo, Bruno D. – Educational and Psychological Measurement, 2014
The authors analyze the effectiveness of the R[superscript 2] and delta log odds ratio effect size measures when using logistic regression analysis to detect differential item functioning (DIF) in dichotomous items. A simulation study was carried out, and the Type I error rate and power estimates under conditions in which only statistical testing…
Descriptors: Regression (Statistics), Test Bias, Effect Size, Test Items
Yakisan, Mehmet – International Journal of Environmental and Science Education, 2016
Biology that is a branch of science examining organisms in every aspect has a very wide content. Besides this wide content, there are abstract concepts in some subjects. Various alternative conceptions are determined in different education levels especially in abstract and microscopic biology subjects. The aim of this study is to determine the…
Descriptors: Preservice Teachers, Preservice Teacher Education, Genetics, Cytology
Slappendel, Geerte; Mandy, William; van der Ende, Jan; Verhulst, Frank C.; van der Sijde, Ad; Duvekot, Jorieke; Skuse, David; Greaves-Lord, Kirstin – Journal of Autism and Developmental Disorders, 2016
The Developmental Diagnostic Dimensional Interview-short version (3Di-sv) provides a brief standardized parental interview for diagnosing autism spectrum disorder (ASD). This study explored its validity, and compatibility with DSM-5 ASD. 3Di-sv classifications showed good sensitivity but low specificity when compared to ADOS-2-confirmed clinical…
Descriptors: Diagnostic Tests, Pervasive Developmental Disorders, Interviews, Factor Analysis
Qudah, Ahmad Hassan – Journal of Education and Practice, 2016
This study aimed at identify the effect of using a proposed teaching strategy based on the selective thinking in acquire mathematical concepts by Classroom Teacher Students at Al- al- Bayt University, The sample of the study consisted of (74) students, equally distributed into a control group and an experimental group. The selective thinking…
Descriptors: Mathematics, Mathematics Instruction, Mathematical Concepts, Teaching Methods
Eleje, Lydia I.; Esomonu, Nkechi P. M.; Agu, Ngozi N.; Okoye, Romy O.; Obasi, Emma; Onah, Frederick E. – World Journal of Education, 2016
A diagnostic test in economics to aid the teachers determine student's specific weak content areas was developed and validated. Five research questions guided the study. Preliminary validation was done by two experienced teachers in the content area of secondary economics and two experts in test construction. The pilot testing was conducted for…
Descriptors: Secondary School Students, Economics Education, Educational Diagnosis, Test Construction
Sadler, Philip M.; Sonnert, Gerhard; Coyle, Harold P.; Miller, Kelly A. – Educational Assessment, 2016
The psychometrically sound development of assessment instruments requires pilot testing of candidate items as a first step in gauging their quality, typically a time-consuming and costly effort. Crowdsourcing offers the opportunity for gathering data much more quickly and inexpensively than from most targeted populations. In a simulation of a…
Descriptors: Test Items, Test Construction, Psychometrics, Biological Sciences
Costa Carreras, Joan – Journal of Multilingual and Multicultural Development, 2016
In this paper, we analyse the relationship between use and perception of five Catalan variants of the variable "subject coreferential with an antecedent." This will be done by examining the results of a cloze test and three perception surveys answered in 1997 by 26 respondents. Two different correct constructions were then presented as…
Descriptors: Language Usage, Romance Languages, Language Variation, Surveys
Margolis, Melissa J.; Mee, Janet; Clauser, Brian E.; Winward, Marcia; Clauser, Jerome C. – Educational Measurement: Issues and Practice, 2016
Evidence to support the credibility of standard setting procedures is a critical part of the validity argument for decisions made based on tests that are used for classification. One area in which there has been limited empirical study is the impact of standard setting judge selection on the resulting cut score. One important issue related to…
Descriptors: Academic Standards, Standard Setting (Scoring), Cutting Scores, Credibility

Peer reviewed
Direct link
