Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedRudner, Lawrence M. – Educational Measurement: Issues and Practice, 2001
Identifies and evaluates alternative methods for weighting tests. Presents formulas for composite reliability and validity as a function of component weights and suggests a rational process that identifies and considers trade-offs in determining weights. Discusses drawbacks to implicit weighting and explicit weighting and the difficulty of…
Descriptors: Reliability, Test Construction, Test Items, Validity
Peer reviewedvan der Linden, Wim J. – Applied Psychological Measurement, 2000
Presents six computational methods based on mixed-integer programming for assembling tests from a bank with an item-set structure and evaluated these methods using mathematical programming feasibiity and expected solution times. Illustrates these methods with two data sets from the Law School Admission Test. Discusses the best approximations to…
Descriptors: Item Banks, Test Construction, Test Items
Peer reviewedMeara, Kevin; Robin, Frederic; Sireci, Stephen G. – Multivariate Behavioral Research, 2000
Investigated the usefulness of multidimensional scaling (MDS) for assessing the dimensionality of dichotomous test data. Focused on two MDS proximity measures, one based on the PC statistic (T. Chen and M. Davidson, 1996) and other, on interitem Euclidean distances. Simulation results show that both MDS procedures correctly identify…
Descriptors: Correlation, Multidimensional Scaling, Simulation, Test Items
Bryant, Damon U. – Psychometrika, 2005
The purpose of this note is twofold: (a) to present the formula for the item information function (IIF) in any direction for the Multidimensional 3-Parameter Logistic (M3-PL) model and (b) to give the equation for the location of maximum item information (theta[sub max]) in the direction of the item discrimination vector. Several corollaries are…
Descriptors: Test Items, Models, Psychometrics, Equations (Mathematics)
Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2005
In an Angoff standard-setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item constituting the test. In many cases, these item performance estimates are made twice, with information shared with the judges between estimates. Especially for long tests,…
Descriptors: Test Items, Probability, Standard Setting (Scoring)
Hagtvet, Knut A.; Solhaug, Trond – Scandinavian Journal of Educational Research, 2005
Recent literature on parcel indicators in measurement models used in covariance structural modelling has mainly been concerned with statistical properties of parameter estimates. Less attention has been paid to measurement properties for inferring the assumed latent construct. The present study illustrates a two-facet measurement model that…
Descriptors: Secondary School Students, Methods, Test Items
Johnson, Elizabeth K.; Jusczyk, Peter W.; Cutler, Anne; Norris, Dennis – Cognitive Psychology, 2003
The Possible Word Constraint limits the number of lexical candidates considered in speech recognition by stipulating that input should be parsed into a string of lexically viable chunks. For instance, an isolated single consonant is not a feasible word candidate. Any segmentation containing such a chunk is disfavored. Five experiments using the…
Descriptors: Test Items, Infants, Word Recognition, Experiments
Borsboom, Denny; Mellenbergh, Gideon J.; Van Heerden, Jaap – Applied Psychological Measurement, 2002
In this article, a distinction is made between absolute and relative measurement. Absolute measurement refers to the measurement of traits on a group-invariant scale, and relative measurement refers to the within-group measurement of traits, where the scale of measurement is expressed in terms of the within-group position on a trait. Relative…
Descriptors: Test Items, Measures (Individuals), Test Theory
Kasintorn, Tanachit – ProQuest LLC, 2009
The purpose of this study was to develop a test of academic readiness for first grade instruction in Thailand. Test of Academic Readiness (TAR) consists of six domains: verbal, visual, memory, math, logical, and general knowledge. Two pilot studies were carried out and a main study tested items in those domains. Rasch model was used to assess the…
Descriptors: Content Validity, Reading Readiness Tests, Doctoral Dissertations, Foreign Countries
Ives, Sarah Elizabeth – ProQuest LLC, 2009
The purposes of this study were to investigate preservice mathematics teachers' orientations, content knowledge, and pedagogical content knowledge of probability; the relationships among these three aspects; and the usefulness of tasks with respect to examining these aspects of knowledge. The design of the study was a multi-case study of five…
Descriptors: Preservice Teachers, Test Items, Mathematics Teachers, Probability
Wang, Jing – ProQuest LLC, 2009
The ultimate goal of physics education research (PER) is to develop a theoretical framework to understand and improve the learning process. In this journey of discovery, assessment serves as our headlamp and alpenstock. It sometimes detects signals in student mental structures, and sometimes presents the difference between expert understanding and…
Descriptors: Test Items, Mathematical Models, Educational Testing, Physics
National Assessment Governing Board, 2009
As the ongoing national indicator of what American students know and can do, the National Assessment of Educational Progress (NAEP) in Reading regularly collects achievement information on representative samples of students in grades 4, 8, and 12. The information that NAEP provides about student achievement helps the public, educators, and…
Descriptors: National Competency Tests, Reading Tests, Test Items, Test Format
Huang, Chiungjung – Educational and Psychological Measurement, 2009
This study examined the percentage of task-sampling variability in performance assessment via a meta-analysis. In total, 50 studies containing 130 independent data sets were analyzed. Overall results indicate that the percentage of variance for (a) differential difficulty of task was roughly 12% and (b) examinee's differential performance of the…
Descriptors: Test Bias, Research Design, Performance Based Assessment, Performance Tests
Jang, Eunice Eunhee – Language Testing, 2009
With recent statistical advances in cognitive diagnostic assessment (CDA), the CDA approach has been increasingly applied to non-diagnostic tests partly to meet accountability demands for student achievement. The study aimed to evaluate critically the validity of the CDA application to an existing non-diagnostic L2 reading comprehension test and…
Descriptors: Feedback (Response), Reading Comprehension, Test Items, Validity
Paek, Insu; Lee, Jihyun; Stankov, Lazar; Wilson, Mark – ETS Research Report Series, 2008
This study investigated the relationship between students' actual performance (accuracy) and their subjective judgments of accuracy (confidence) on selected English language proficiency tests. The unidimensional and multidimensional IRT Rasch approaches were used to model the discrepancy between confidence and accuracy at the item and test level…
Descriptors: Self Esteem, Accuracy, Item Response Theory, English

Direct link
