Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Noble, Tracy; Rosebery, Ann; Suarez, Catherine; Warren, Beth; O'Connor, Mary Catherine – Applied Measurement in Education, 2014
English language learners (ELLs) and their teachers, schools, and communities face increasingly high-stakes consequences due to test score gaps between ELLs and non-ELLs. It is essential that the field of educational assessment continue to investigate the meaning of these test score gaps. This article discusses the findings of an exploratory study…
Descriptors: English Language Learners, Evidence, Educational Assessment, Achievement Gap
Jennings, Jennifer L.; Bearak, Jonathan Marc – Educational Researcher, 2014
What is "teaching to the test," and can one detect evidence of this practice in state test scores? This paper unpacks this concept and empirically investigates one variant of it by analyzing test item--level data from three states' mathematics and reading tests. We show that NCLB-era state tests predictably emphasized some state…
Descriptors: Federal Legislation, Educational Legislation, Standards, Scores
Nation, Paul; Coxhead, Averil – Language Teaching, 2014
The English Language Institute (now the School of Linguistics and Applied Language Studies) at Victoria University of Wellington has a long history of corpus-based vocabulary research, especially after the arrival of the second director of the institute, H. V. George, and the appointment of Helen Barnard, whom George knew in India. George's…
Descriptors: Foreign Countries, Computational Linguistics, Vocabulary, Receptive Language
Huynh, Huynh; Meyer, Patrick – Practical Assessment, Research & Evaluation, 2010
The first part of this paper describes the use of the robust z[subscript R] statistic to link test forms using the Rasch (or one-parameter logistic) model. The procedure is then extended to the two-parameter and three-parameter logistic and two-parameter partial credit (2PPC) models. A real set of data was used to illustrate the extension. The…
Descriptors: Item Response Theory, Statistics, Educational Assessment, Test Items
Wainer, Howard; Bradlow, Eric; Wang, Xiaohui – Journal of Educational and Behavioral Statistics, 2010
Confucius pointed out that the first step toward wisdom is calling things by the right name. The term "Differential Item Functioning" (DIF) did not arise fully formed from the miasma of psychometrics, it evolved from a variety of less accurate terms. Among its forebears was "item bias" but that term has a pejorative connotation…
Descriptors: Test Bias, Difficulty Level, Test Items, Statistical Analysis
Penfield, Randall D. – Applied Psychological Measurement, 2010
In 2008, Penfield showed that measurement invariance across all response options of a multiple-choice item (correct option and the "J" distractors) can be modeled using a nominal response model that included a differential distractor functioning (DDF) effect for each of the "J" distractors. This article extends this concept to consider how the…
Descriptors: Test Bias, Test Items, Models, Multiple Choice Tests
Kilinc, Emin – ProQuest LLC, 2012
We are living in a conceptual world which we build through both informal and systematic interaction. Concepts enable us to simplify and organize our environment and communicate efficiently with others. The learning of concepts is represented by a general idea, usually expressed by a word, which represent a class or group of things or actions…
Descriptors: Concept Formation, Secondary School Students, Social Studies, Academic Achievement
Gallagher, Nancy – Delta Publishing Company, 2012
Six Practice Tests for the iBT gives students plenty of practice as they prepare for the Internet-based TOEFL (iBT) or the new form of the institutional TOEFL (ITP). This new book/audio set contains a concise description of the TOEFL and the types of questions in each section, as well as six full-length tests that have not been published before.…
Descriptors: English (Second Language), Language Tests, Internet, Computer Assisted Testing
Freeman, Andrew J.; Youngstrom, Eric A.; Frazier, Thomas W.; Youngstrom, Jennifer Kogos; Demeter, Christine; Findling, Robert L. – Psychological Assessment, 2012
Robust screening measures that perform well in different populations could help improve the accuracy of diagnosis of pediatric bipolar disorder. Changes in sampling could influence the performance of items and potentially influence total scores enough to alter the predictive utility of scores. Additionally, creating a brief version of a measure by…
Descriptors: Test Items, Sampling, Caregivers, Test Bias
Suh, Youngsuk; Cho, Sun-Joo; Wollack, James A. – Journal of Educational Measurement, 2012
In the presence of test speededness, the parameter estimates of item response theory models can be poorly estimated due to conditional dependencies among items, particularly for end-of-test items (i.e., speeded items). This article conducted a systematic comparison of five-item calibration procedures--a two-parameter logistic (2PL) model, a…
Descriptors: Response Style (Tests), Timed Tests, Test Items, Item Response Theory
Han, Kyung T. – Journal of Educational Measurement, 2012
Successful administration of computerized adaptive testing (CAT) programs in educational settings requires that test security and item exposure control issues be taken seriously. Developing an item selection algorithm that strikes the right balance between test precision and level of item pool utilization is the key to successful implementation…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection
Cawthon, Stephanie W.; Kaye, Alyssa D.; Lockhart, L. Leland; Beretvas, S. Natasha – Journal of School Psychology, 2012
Many students with learning disabilities (SLD) participate in standardized assessments using test accommodations such as extended time, having the test items read aloud, or taking the test in a separate setting. Yet there are also aspects of the test items themselves, particularly the language demand, which may contribute to the effects of test…
Descriptors: Testing Accommodations, Test Items, Linguistics, Learning Disabilities
Quinnell, Lorna; Carter, Merilyn – Australian Mathematics Teacher, 2012
The language of mathematics is unique and complex. One feature of the mathematical register is the use of symbols and abbreviations. Whilst it may be possible for a student to think mathematically in the absence of symbols, the written communication of mathematical ideas cannot be achieved concisely without the use of mathematical symbols.…
Descriptors: Test Items, Symbols (Mathematics), Numeracy, Mathematics Instruction
Wakita, Takafumi; Ueshima, Natsumi; Noguchi, Hiroyuki – Educational and Psychological Measurement, 2012
This study examined whether the number of options in the Likert scale influences the psychological distance between categories. The most important assumption when using the Likert scale is that the psychological distance between options is equal. The authors proposed a new algorithm for calculating the scale values of options by applying item…
Descriptors: Likert Scales, Test Items, Personality Measures, Item Response Theory
Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012
The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…
Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

Peer reviewed
Direct link
