Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Batty, Aaron Olaf – Language Testing, 2015
The rise in the affordability of quality video production equipment has resulted in increased interest in video-mediated tests of foreign language listening comprehension. Although research on such tests has continued fairly steadily since the early 1980s, studies have relied on analyses of raw scores, despite the growing prevalence of item…
Descriptors: Listening Comprehension Tests, Comparative Analysis, Video Technology, Audio Equipment
Sato, Takanori; Ikeda, Naoki – Language Testing in Asia, 2015
Background: High-stakes tests have an immense washback effect on what students learn and affect the content of student learning. However, if students fail to recognize the abilities that the test developers intend to measure, they are less likely to learn what the test developers wish them to learn. This study aims to investigate test-taker…
Descriptors: High Stakes Tests, Testing Problems, Test Items, College Students
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
Demars, Christine E. – Applied Measurement in Education, 2011
Three types of effects sizes for DIF are described in this exposition: log of the odds-ratio (differences in log-odds), differences in probability-correct, and proportion of variance accounted for. Using these indices involves conceptualizing the degree of DIF in different ways. This integrative review discusses how these measures are impacted in…
Descriptors: Effect Size, Test Bias, Probability, Difficulty Level
Facon, Bruno; Magis, David; Nuchadee, Marie-Laure; De Boeck, Paul – Intelligence, 2011
Standardized tests are used widely in comparative studies of clinical populations, either as dependent or control variables. Yet, one cannot always be sure that the test items measure the same constructs in the groups under study. In the present work, 460 participants with intellectual disability of undifferentiated etiology and 488 typical…
Descriptors: Intelligence Tests, Standardized Tests, Mental Retardation, Children
Ritter, Nicola; Kilinc, Emin; Navruz, Bilgin; Bae, Yunhee – Journal of Psychoeducational Assessment, 2011
This article reviews Test of Nonverbal Intelligence-Fourth Edition (TONI-4), an individually administered instrument created to assess intelligence. The distinguishing characteristic of the TONI-4 is the nonverbal, motor-reduced format that assesses common elements of intelligence without the confounding effects of motor or linguistic skills. The…
Descriptors: Nonverbal Tests, Intelligence Tests, Scoring, Test Items
Braeken, Johan – Psychometrika, 2011
Conditional independence is a fundamental principle in latent variable modeling and item response theory. Violations of this principle, commonly known as local item dependencies, are put in a test information perspective, and sharp bounds on these violations are defined. A modeling approach is proposed that makes use of a mixture representation of…
Descriptors: Test Construction, Item Response Theory, Models, Tests
O'Keefe, Robert D.; Hamer, Lawrence O.; Kemp, Philip R. – Journal of Learning in Higher Education, 2013
Assessment, or better stated, the assurance of student learning, has become a central issue in both the internal and the external evaluations of degree programs offered by colleges and universities. The continual importance of assurance of learning activities within institutions of higher education has generated a growing need to create empirical…
Descriptors: Alignment (Education), Program Evaluation, Course Evaluation, Learning Activities
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Nevid, Jeffrey S.; McClelland, Nate – Journal of Education and Training Studies, 2013
We used a set of action verbs based on Bloom's taxonomy to assess learning outcomes in two college-level introductory psychology courses. The action verbs represented an acronym, IDEA, comprising skills relating to identifying, defining or describing, evaluating or explaining, and applying psychological knowledge. Exam performance demonstrated…
Descriptors: Verbs, Taxonomy, Introductory Courses, Psychology
Solano-Flores, Guillermo; Barnett-Clarke, Carne; Kachchaf, Rachel R. – Educational Assessment, 2013
We examined the performance of English language learners (ELLs) and non-ELLs on Grade 4 and Grade 5 mathematics content knowledge (CK) and academic language (AL) tests. CK and AL items had different semiotic loads (numbers of different types of semiotic features) and different semiotic structures (relative frequencies of different semiotic…
Descriptors: English Language Learners, Performance, Mathematics Tests, Semiotics
Raker, Jeffrey R.; Holme, Thomas A. – Journal of Chemical Education, 2013
Standardized examinations, such as those developed and disseminated by the ACS Examinations Institute, are artifacts of the teaching of a course and over time may provide a historical perspective on how curricula have changed and evolved. This study investigated changes in organic chemistry curricula across a 60-year period by evaluating 18 ACS…
Descriptors: Organic Chemistry, Science Education History, Curriculum Research, Educational Development
Young, Arthur; Shawl, Stephen J. – Astronomy Education Review, 2013
Professors who teach introductory astronomy to students not majoring in science desire them to comprehend the concepts and theories that form the basis of the science. They are usually less concerned about the myriad of
detailed facts and information that accompanies the science. As such, professors prefer to test the students for such…
Descriptors: Multiple Choice Tests, Classification, Astronomy, Introductory Courses
Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013
Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…
Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing

Peer reviewed
Direct link
