Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Hagtvet, Knut A.; Solhaug, Trond – Scandinavian Journal of Educational Research, 2005
Recent literature on parcel indicators in measurement models used in covariance structural modelling has mainly been concerned with statistical properties of parameter estimates. Less attention has been paid to measurement properties for inferring the assumed latent construct. The present study illustrates a two-facet measurement model that…
Descriptors: Secondary School Students, Methods, Test Items
Johnson, Elizabeth K.; Jusczyk, Peter W.; Cutler, Anne; Norris, Dennis – Cognitive Psychology, 2003
The Possible Word Constraint limits the number of lexical candidates considered in speech recognition by stipulating that input should be parsed into a string of lexically viable chunks. For instance, an isolated single consonant is not a feasible word candidate. Any segmentation containing such a chunk is disfavored. Five experiments using the…
Descriptors: Test Items, Infants, Word Recognition, Experiments
Borsboom, Denny; Mellenbergh, Gideon J.; Van Heerden, Jaap – Applied Psychological Measurement, 2002
In this article, a distinction is made between absolute and relative measurement. Absolute measurement refers to the measurement of traits on a group-invariant scale, and relative measurement refers to the within-group measurement of traits, where the scale of measurement is expressed in terms of the within-group position on a trait. Relative…
Descriptors: Test Items, Measures (Individuals), Test Theory
Kasintorn, Tanachit – ProQuest LLC, 2009
The purpose of this study was to develop a test of academic readiness for first grade instruction in Thailand. Test of Academic Readiness (TAR) consists of six domains: verbal, visual, memory, math, logical, and general knowledge. Two pilot studies were carried out and a main study tested items in those domains. Rasch model was used to assess the…
Descriptors: Content Validity, Reading Readiness Tests, Doctoral Dissertations, Foreign Countries
Ives, Sarah Elizabeth – ProQuest LLC, 2009
The purposes of this study were to investigate preservice mathematics teachers' orientations, content knowledge, and pedagogical content knowledge of probability; the relationships among these three aspects; and the usefulness of tasks with respect to examining these aspects of knowledge. The design of the study was a multi-case study of five…
Descriptors: Preservice Teachers, Test Items, Mathematics Teachers, Probability
Wang, Jing – ProQuest LLC, 2009
The ultimate goal of physics education research (PER) is to develop a theoretical framework to understand and improve the learning process. In this journey of discovery, assessment serves as our headlamp and alpenstock. It sometimes detects signals in student mental structures, and sometimes presents the difference between expert understanding and…
Descriptors: Test Items, Mathematical Models, Educational Testing, Physics
National Assessment Governing Board, 2009
As the ongoing national indicator of what American students know and can do, the National Assessment of Educational Progress (NAEP) in Reading regularly collects achievement information on representative samples of students in grades 4, 8, and 12. The information that NAEP provides about student achievement helps the public, educators, and…
Descriptors: National Competency Tests, Reading Tests, Test Items, Test Format
Huang, Chiungjung – Educational and Psychological Measurement, 2009
This study examined the percentage of task-sampling variability in performance assessment via a meta-analysis. In total, 50 studies containing 130 independent data sets were analyzed. Overall results indicate that the percentage of variance for (a) differential difficulty of task was roughly 12% and (b) examinee's differential performance of the…
Descriptors: Test Bias, Research Design, Performance Based Assessment, Performance Tests
Jang, Eunice Eunhee – Language Testing, 2009
With recent statistical advances in cognitive diagnostic assessment (CDA), the CDA approach has been increasingly applied to non-diagnostic tests partly to meet accountability demands for student achievement. The study aimed to evaluate critically the validity of the CDA application to an existing non-diagnostic L2 reading comprehension test and…
Descriptors: Feedback (Response), Reading Comprehension, Test Items, Validity
Paek, Insu; Lee, Jihyun; Stankov, Lazar; Wilson, Mark – ETS Research Report Series, 2008
This study investigated the relationship between students' actual performance (accuracy) and their subjective judgments of accuracy (confidence) on selected English language proficiency tests. The unidimensional and multidimensional IRT Rasch approaches were used to model the discrepancy between confidence and accuracy at the item and test level…
Descriptors: Self Esteem, Accuracy, Item Response Theory, English
Ballou, Dale – National Center on Performance Incentives, 2008
As currently practiced, value-added assessment relies on a strong assumption about the scales used to measure student achievement, namely that these are interval scales, with equal-sized gains at all points on the scale representing the same increment of learning. Many of the metrics in which test results are expressed do not have this property…
Descriptors: Test Items, Intervals, Data Analysis, Item Response Theory
Ashvind Nand Singh – ProQuest LLC, 2008
Due to the relative inability of individuals with intellectual disabilities (ID) to provide an accurate and reliable self-report, assessment in this population is more difficult than with individuals in the general population. As such, assessment procedures must be adjusted to compensate for the relative lack of information that the individual can…
Descriptors: Test Items, Item Analysis, Test Construction, Behavior Rating Scales
Liu, Ou Lydia; Lee, Hee-Sun; Hofstetter, Carolyn; Linn, Marcia C. – Educational Assessment, 2008
In response to the demand for sound science assessments, this article presents the development of a latent construct called knowledge integration as an effective measure of science inquiry. Knowledge integration assessments ask students to link, distinguish, evaluate, and organize their ideas about complex scientific topics. The article focuses on…
Descriptors: Standardized Tests, Scoring Rubrics, Psychometrics, Concept Mapping
Lee, Young-Sun; Grossman, Jennifer; Krishnan, Anita – Educational and Psychological Measurement, 2008
This study examined the cultural relevance of adult attachment within a Korean sample (N = 390) using Rasch rating scale modeling. The psychometric properties of scores from the Korean version of the Revised Experiences in Close Relationships, comprised of two subscales of Anxiety (self) and Avoidance (other), were assessed. Results obtained from…
Descriptors: Cultural Relevance, Attachment Behavior, Rating Scales, Psychometrics
Belov, Dmitry I.; Armstrong, Ronald D.; Weissman, Alexander – Applied Psychological Measurement, 2008
This article presents a new algorithm for computerized adaptive testing (CAT) when content constraints are present. The algorithm is based on shadow CAT methodology to meet content constraints but applies Monte Carlo methods and provides the following advantages over shadow CAT: (a) lower maximum item exposure rates, (b) higher utilization of the…
Descriptors: Test Items, Monte Carlo Methods, Law Schools, Adaptive Testing

Peer reviewed
Direct link
