Publication Date
In 2025 | 1 |
Since 2024 | 9 |
Since 2021 (last 5 years) | 42 |
Since 2016 (last 10 years) | 123 |
Since 2006 (last 20 years) | 758 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 453 |
Teachers | 184 |
Policymakers | 145 |
Researchers | 145 |
Administrators | 125 |
Students | 55 |
Parents | 49 |
Community | 31 |
Counselors | 9 |
Support Staff | 1 |
Location
Canada | 138 |
California | 104 |
New York | 102 |
United States | 94 |
Florida | 90 |
North Carolina | 90 |
Texas | 78 |
New Jersey | 62 |
Australia | 55 |
Pennsylvania | 55 |
Arizona | 51 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Shohamy, Elana – Language and Intercultural Communication, 2013
While much of the work in language testing is concerned with constructing quality tests in order to measure language knowledge in reliable and valid ways, there has been a significant movement in language testing research that examines tests in the context of their use in education and society. This line of research exits from the notion that…
Descriptors: Language Tests, Testing, Evaluation Research, Ideology
Willner, Lynn Shafer; Rivera, Charlene; Acosta, Barbara D. – Reading Teacher, 2009
This column offers an overview of the requirements for including and accommodating English-language learners (ELLs) in content assessments and an explanation of how accommodations for ELLs work. It concludes with recommendations drawn from research and practice to ensure accommodations are assigned and implemented in ways that are likely to…
Descriptors: English (Second Language), Limited English Speaking, Testing Accommodations, Testing Programs
Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011
Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…
Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences
Utah State Office of Education, 2012
Utah has successfully implemented a variety of endeavors to ensure literacy for all students. Proficiency rates in language arts in Utah have improved in all grade levels since 2005. Emphasis has been placed on grades K-3 and early intervention for students at risk. Resources available to these students include optional extended-day kindergarten,…
Descriptors: At Risk Students, Reading Programs, Reading Instruction, Early Intervention
Elliott, Stephen N.; Davies, Michael; Kettler, Ryan J. – International Journal of Disability, Development and Education, 2012
Australian legislation and educational policies may espouse, but not yet fully enact, inclusive assessments for all. In relation to the National Assessment Program for Literacy and Numeracy (NAPLAN), for example, almost 5% of students are either exempt or withdrawn. The achievement levels of these students, many of whom have disabilities, are not…
Descriptors: Academic Achievement, Achievement Rating, Comparative Analysis, Comparative Education
Filipi, Anna – Language Testing, 2012
The Assessment of Language Competence (ALC) certificates is an annual, international testing program developed by the Australian Council for Educational Research to test the listening and reading comprehension skills of lower to middle year levels of secondary school. The tests are developed for three levels in French, German, Italian and…
Descriptors: Listening Comprehension Tests, Item Response Theory, Statistical Analysis, Foreign Countries
Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010
The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…
Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level
Thurlow, Martha L.; Bremer, Chris; Albus, Deb – National Center on Educational Outcomes, University of Minnesota, 2011
This is the thirteenth annual report by the National Center on Educational Outcomes (NCEO) that analyzes public reporting practices of assessment data for students with disabilities in K-12 schools in the United States. The Individuals with Disabilities Education Act (IDEA) required states to disaggregate performance data at the state and district…
Descriptors: Elementary Secondary Education, Disabilities, Data Analysis, Accountability
Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011
A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…
Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries
Hudson, T., Ed.; Clark, M., Ed. – National Foreign Language Resource Center at University of Hawaii, 2008
Although most language programs make placement decisions on the basis of placement tests, there is surprisingly little published about different contexts and systems of placement testing. The present volume contains case studies of placement programs in foreign language programs at the tertiary level across the United States. The different…
Descriptors: Student Placement, College Second Language Programs, Testing Programs, Case Studies
Lissitz, Robert W.; Wei, Hua – Educational Measurement: Issues and Practice, 2008
In this article we address the issue of consistency in standard setting in the context of an augmented state testing program. Information gained from the external NRT scores is used to help make an informed decision on the determination of cut scores on the state test. The consistency of cut scores on the CRT across grades is maintained by forcing…
Descriptors: Testing Programs, State Programs, Standard Setting, Reliability
Kang, Taehoon; Petersen, Nancy S. – ACT, Inc., 2009
This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord (1983) characteristic curve method…
Descriptors: Standards, Testing Programs, Test Items, Statistical Distributions
Buckendahl, Chad W.; Plake, Barbara S.; Davis, Susan L. – Applied Measurement in Education, 2009
The National Assessment of Educational Progress (NAEP) program is a series of periodic assessments administered nationally to samples of students and designed to measure different content areas. This article describes a multi-year study that focused on the breadth of the development, administration, maintenance, and renewal of the assessments in…
Descriptors: National Competency Tests, Audits (Verification), Testing Programs, Program Evaluation
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011
In this technical report, we document the results of a cross-validation study designed to identify optimal cut-scores for the use of the easyCBM[R] mathematics test in the state of Washington. A large sample, randomly split into two groups of roughly equal size, was used for this study. Students' performance classification on the Washington state…
Descriptors: Testing Programs, Mathematics Tests, Prediction, Measurement Techniques
Doorey, Nancy A. – Council of Chief State School Officers, 2011
The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…
Descriptors: Testing, Sampling, Expertise, Testing Programs