Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 6 |
Descriptor
Evaluation Methods | 24 |
Test Items | 24 |
Test Use | 24 |
Test Construction | 16 |
Student Evaluation | 9 |
Test Validity | 8 |
Educational Assessment | 6 |
Elementary Secondary Education | 6 |
Test Reliability | 6 |
Foreign Countries | 5 |
Test Bias | 5 |
More ▼ |
Source
Author
Eignor, Daniel R. | 2 |
Ackerman, Terry A. | 1 |
Bauer, Scott C. | 1 |
Bishop, Laurence A. | 1 |
Buser, Karen | 1 |
Cabrera, George A. | 1 |
Cabrera, Nolan L. | 1 |
DeBlassie, Richard R. | 1 |
Dietel, Ron | 1 |
Dietel, Ronald | 1 |
Donovan, Jenny | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 3 |
Elementary Education | 1 |
Grade 6 | 1 |
Audience
Practitioners | 4 |
Teachers | 2 |
Community | 1 |
Parents | 1 |
Students | 1 |
Location
Australia | 1 |
South Korea | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
National Assessment of… | 1 |
Pennsylvania Educational… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025
In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019
Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Cabrera, Nolan L.; Cabrera, George A. – Educational Horizons, 2011
Just like all the high-stakes tests that determine students' futures nowadays, The Chorizo Test is a standardized test rooted in the culture of the test makers. It was originally created to be used with students in teacher training programs to sensitize them to the pitfalls inherent in standardized pencil-and-paper tests, such as linguistic bias…
Descriptors: Test Use, Standardized Tests, Social Sciences, High Stakes Tests
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment

Eignor, Daniel R. – Journal of Educational Measurement, 1997
The authors of the "Guidelines," a task force of eight, intend to present an organized list of features to be considered in reporting or evaluating computerized-adaptive assessments. Apart from a few weaknesses, the book is a useful and complete document that will be very helpful to test developers. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Evaluation Methods, Guidelines

Gamble, Charles W.; Hamblin, Arthur G. – Psychology in the Schools, 1986
Discusses the use of a sentence completion instrument predicated on Lazarus' multimodal system. The instrument, entitled The Multimodal Sentence Completion Form for Children (MSCF-C), is designed to systematically assess client needs and assist in identifying intervention strategies. Presents a case study of a 12-year-old, sixth-grade student.…
Descriptors: Case Studies, Counseling, Elementary Education, Elementary School Students
Parsons, Jim; Fenwick, Tara – 1999
This "toolbox" offers suggestions about how and when to create objective tests. Such tests are sometimes a quick way to find out how students are doing, and sometimes they help students focus on what they are doing in class or help teachers define the content that is worth knowing. The following suggestions are offered for developing objective…
Descriptors: Elementary Secondary Education, Evaluation Methods, Foreign Countries, Objective Tests

Ackerman, Terry A. – Applied Measurement in Education, 1994
When item response data do not satisfy the unidimensionality assumption, multidimensional item response theory (MIRT) should be used to model the item-examinee interaction. This article presents and discusses MIRT analyses designed to give better insight into what individual items are measuring. (SLD)
Descriptors: Evaluation Methods, Item Response Theory, Measurement Techniques, Models
Merz, William R. – 1980
Several methods of assessing test item bias are described, and the concept of fair use of tests is examined. A test item is biased if individuals of equal ability have different probabilities of attaining the item correct. The following seven general procedures used to examine test items for bias are summarized and discussed: (1) analysis of…
Descriptors: Comparative Analysis, Evaluation Methods, Factor Analysis, Mathematical Models

Roberts, Eileen; DeBlassie, Richard R. – Adolescence, 1983
Defines test bias as a phenomenon in which test scores result in negative outcomes for certain groups, often lower socioeconomic groups and minorities. Discusses three manifestations of test bias including content, atmosphere, and use bias and presents recommendations for remedying bias problems in testing the culturally different. (JAC)
Descriptors: Adolescents, Cultural Differences, Evaluation Methods, Intelligence Tests
Swaak, Janine; And Others – 1997
A study was conducted to develop a test that is able to capture knowledge of an intuitive nature, such as that acquired through discovery learning. The proposed test format is called the "what-if test." Test items in this format consist of the presentation of a situation. A change in the situation is introduced, and learners have to…
Descriptors: College Students, Discovery Learning, Educational Assessment, Evaluation Methods
Dietel, Ron – Center for Assessment and Evaluation of Student Learning (CAESL) at WestEd, 2004
The accuracy and fairness of standardized testing is taken very seriously in the education world. These issues are a major focus of both the testing experts who develop standardized tests and the researchers who endeavor to ensure a test's fairness, reliability, validity, and accuracy. But many issues remain both controversial and complex. The…
Descriptors: Testing, Standardized Tests, Test Items, Test Bias

Bauer, Scott C. – Education Policy Analysis Archives, 2000
Studied whether student scores on standardized tests represent reasonable measures of instructional quality using ratings by 10 parents and 11 educators (school principals) of the degree to which test items from a nationally marketed standardized achievement test represent the content actually taught. On average, raters felt that test items…
Descriptors: Achievement Tests, Educational Quality, Elementary Secondary Education, Evaluation Methods
Buser, Karen – 1996
Most seasoned test developers recognize the importance of thoughtful decision making when constructing a test. Unfortunately, many classroom achievement tests are created by novice test developed who have not received sufficient instruction in item writing (G. Gulliksen, 1986; R. J. Stiggins, 1991). The result is often a test that is poorly…
Descriptors: Achievement Tests, Decision Making, Educational Planning, Evaluation Methods
Previous Page | Next Page »
Pages: 1 | 2