Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 28 |
Descriptor
Source
Author
Publication Type
Journal Articles | 49 |
Reports - Evaluative | 25 |
Reports - Research | 16 |
Reports - Descriptive | 6 |
Book/Product Reviews | 3 |
Opinion Papers | 2 |
Speeches/Meeting Papers | 2 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 13 |
Higher Education | 4 |
High Schools | 3 |
Postsecondary Education | 3 |
Elementary Education | 2 |
Secondary Education | 2 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Audience
Location
Australia | 5 |
United States | 4 |
Canada | 2 |
Finland | 2 |
Florida | 2 |
Netherlands | 2 |
New Zealand | 2 |
Tunisia | 2 |
United Kingdom (England) | 2 |
Asia | 1 |
Azerbaijan | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Assessments and Surveys
Program for International… | 5 |
National Assessment of… | 4 |
Trends in International… | 2 |
Law School Admission Test | 1 |
North Carolina End of Course… | 1 |
What Works Clearinghouse Rating
Baird, Jo-Anne; Gray, Lena – Oxford Review of Education, 2016
The ways in which examination standards are conceptualised and operationalised differently across nations has not been given sufficient attention. The international literature on standard-setting has been dominated by the psychometrics tradition. Broader conceptualisations of examination standards have been discussed in the literature in England,…
Descriptors: Foreign Countries, Academic Standards, Position Papers, Educational Policy
LaFlair, Geoffrey T.; Isbell, Daniel; May, L. D. Nicolas; Gutierrez Arvizu, Maria Nelly; Jamieson, Joan – Language Testing, 2017
Language programs need multiple test forms for secure administrations and effective placement decisions, but can they have confidence that scores on alternate test forms have the same meaning? In large-scale testing programs, various equating methods are available to ensure the comparability of forms. The choice of equating method is informed by…
Descriptors: Language Tests, Equated Scores, Testing Programs, Comparative Analysis
Ayeni, Abiodun Olumide – Journal of Education and Practice, 2015
This paper compared technical/vocational education in: Germany, Australia, Finland, Hong Kong, Hungary, India, Japan, South Korea, Mexico, and Nigeria, and found that technical/vocational education was given proper attention in countries considered except Nigeria, where it was handled with laissez faire attitude. Set-Up of Technical/Vocational…
Descriptors: Foreign Countries, Vocational Education, Comparative Analysis, Comparative Education
Linking Errors between Two Populations and Tests: A Case Study in International Surveys in Education
Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015
This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…
Descriptors: Case Studies, Simulation, International Programs, Testing Programs
Yang, Ji Seung; Cai, Li – Journal of Educational and Behavioral Statistics, 2014
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM). Results indicate that the MH-RM algorithm can produce estimates and standard…
Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect
Ghaderi, Marzieh; Mogholi, Marzieh; Soori, Afshin – International Journal of Education and Literacy Studies, 2014
Testing subject has many subsets and connections. One important issue is how to assess or measure students or learners. What would be our tools, what would be our style, what would be our goal and so on. So in this paper the author attended to the style of testing in school and other educational settings. Since the purposes of educational system…
Descriptors: Testing, Testing Programs, Intermode Differences, Computer Assisted Testing
Debeer, Dries; Buchholz, Janine; Hartig, Johannes; Janssen, Rianne – Journal of Educational and Behavioral Statistics, 2014
In this article, the change in examinee effort during an assessment, which we will refer to as persistence, is modeled as an effect of item position. A multilevel extension is proposed to analyze hierarchically structured data and decompose the individual differences in persistence. Data from the 2009 Program of International Student Achievement…
Descriptors: Reading Tests, International Programs, Testing Programs, Individual Differences
Holme, Thomas – Journal of Chemical Education, 2014
Two different versions of "big ideas" rooted content maps have recently been published for general chemistry. As embodied in the content outline from the College Board, one of these maps is designed to guide curriculum development and testing for advanced placement (AP) chemistry. The Anchoring Concepts Content Map for general chemistry…
Descriptors: Chemistry, Advanced Placement, Curriculum Development, Curriculum Evaluation
Assouline, Susan G.; Lupkowski-Shoplik, Ann – Journal of Psychoeducational Assessment, 2012
The Talent Search model, founded at Johns Hopkins University by Dr. Julian C. Stanley, is fundamentally an above-level testing program. This simplistic description belies the enduring impact that the Talent Search model has had on the lives of hundreds of thousands of gifted students as well as their parents and teachers. In this article, we…
Descriptors: Testing Programs, Academically Gifted, Elementary Secondary Education, Talent
Hardy, Ian – Journal of Education Policy, 2014
This paper explores how the strong policy push to improve students' results on national literacy and numeracy tests -- the National Assessment Program, Literacy and Numeracy (NAPLAN) -- in the Australian state of Queensland influenced schooling practices, including teachers' learning. The paper argues the focus upon improved test scores on NAPLAN…
Descriptors: Literacy, Numeracy, Foreign Countries, Standardized Tests
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
DeBoer, George E. – Journal of Research in Science Teaching, 2011
Standards-based science education, with its emphasis on monitoring and accountability, is rapidly becoming a key part of the globalization of science education. Standards-based testing within countries is increasingly being used to determine the effectiveness of a country's educational system, and international testing programs such as Programme…
Descriptors: Testing Programs, Testing, Global Approach, Educational Change
Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011
Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…
Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement
von Davier, Alina A. – ETS Research Report Series, 2012
Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…
Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling
Somerset, Anthony – Compare: A Journal of Comparative and International Education, 2011
Educational practitioners rely predominantly on measures of outcome, rather than of inputs or process, in making judgements as to quality. Outcome measures are available from two main sources: (1) the relatively new international assessment systems; and (2) the traditional national examinations systems. The two types of system differ in their…
Descriptors: Testing Programs, Educational Quality, National Competency Tests, Educational Improvement