Publication Date
| In 2026 | 1 |
| Since 2025 | 599 |
| Since 2022 (last 5 years) | 2536 |
| Since 2017 (last 10 years) | 5571 |
| Since 2007 (last 20 years) | 9167 |
Descriptor
| Test Validity | 21743 |
| Test Reliability | 9997 |
| Test Construction | 5880 |
| Foreign Countries | 4941 |
| Psychometrics | 2956 |
| Factor Analysis | 2938 |
| Measures (Individuals) | 2370 |
| Higher Education | 2248 |
| Evaluation Methods | 2084 |
| College Students | 1810 |
| Correlation | 1722 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 805 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 170 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedMoy, Raymond H. – System, 1984
Discusses the problems associated with "grading on a curve," the approach often used for standard setting on language proficiency tests. Proposes four main steps presented in the setting of a non-arbitrary cut-score. These steps not only establish a proficiency standard checked by external criteria, but also check to see that the test covers the…
Descriptors: Cloze Procedure, Correlation, Dictation, English (Second Language)
Peer reviewedNelson, Larry R. – Educational Measurement: Issues and Practice, 1984
The author argues that scoring, reporting, and deriving final grades can be considerably assisted by using a computer. He also contends that the savings in time and the computer database formed will allow instructors to determine test quality and reflect on the quality of instruction. (BW)
Descriptors: Achievement Tests, Affective Objectives, Computer Assisted Testing, Educational Testing
Peer reviewedLinn, Robert L.; Hastings, C. Nicholas – Journal of Educational Measurement, 1984
Using predictive validity studies of the Law School Admissions Test (LSAT) and the undergraduate grade-point average (UGPA), this study examined the large variation in the magnitude of the validity coefficients across schools. LSAT standard deviation and correlation between LSAT and UGPA accounted for 58.5 percent of the variability. (Author/EGS)
Descriptors: Academic Achievement, College Applicants, College Entrance Examinations, Grade Point Average
Peer reviewedHenning, Grant; And Others – System, 1983
"Listening Recall" is a listening comprehension test which discriminates over a wide range of proficiency. Unlike traditional tests, requiring multiple choice responses, it is a listening cloze procedure; a narrative passage accompanied by a written version with deletions. Particularly suited to low proficiency learners, test has high…
Descriptors: Cloze Procedure, English (Second Language), Language Aptitude, Language Proficiency
Peer reviewedThornburg, Kathy R.; And Others – Journal of Experimental Education, 1983
The Parent as a Teacher Inventory, a measure of parents' feelings and beliefs regarding interaction with their young children, was administered to 615 parents of preschool children. The failure to validate the subsets illustrates the complexity of parenting behavior and the difficulties with content vs. construct validity. (Author/PN)
Descriptors: Adults, Attitude Measures, Factor Analysis, Factor Structure
Peer reviewedHale, Gordon A.; And Others – Modern Language Journal, 1984
Provides a bibliography of published research papers that either describe the history of the TOEFL, offer a critical review of the test, or interpret TOEFL research findings. Some topics include: the correlation of TOEFL with other standardized tests of English language proficiency, TOEFL's role as a predictor of academic performance, the…
Descriptors: Cultural Awareness, English (Second Language), Ethnic Groups, Language Proficiency
Allen, Jon G. – Journal of Counsulting and Clinical Psychology, 1976
The Test of Emotional Styles measures three broad dimensions of emotionality: responsiveness, expressiveness, and orientation. This study examined the relationships between the forced-choice Test of Emotional Styles dimensions and measures of related constructs. The patterns of correlations generally support the construct validity of the test.…
Descriptors: Adjustment (to Environment), Affective Behavior, College Students, Emotional Experience
Halliburton, Warren J. – Freedomways, 1976
Reviews the test titled Psychological Testing of Minorities and notes that this book brings to the forum by which education may be made accessible to the underprivileged people of society without penalizing them for not belonging to the middle class mainstream culture. (Author/AM)
Descriptors: Book Reviews, Cultural Differences, Educational Testing, Middle Class Standards
Peer reviewedHirvonen, P. A. – System, 1977
Defends the use of multiple-choice language tests against Pickering's criticism in a previous issue of this journal. (CHK)
Descriptors: Aptitude Tests, Language Instruction, Language Tests, Multiple Choice Tests
Peer reviewedBudoff, Milton; Hamilton, James L. – American Journal of Mental Deficiency, 1976
The validity of a learning potential assessment procedure with institutionalized moderately and severely retarded adolescents and adults was examined with 38 Ss. (Author)
Descriptors: Adolescents, Aptitude Tests, Educational Assessment, Exceptional Child Research
DeMars, Christine E. – Online Submission, 2005
Several methods for estimating item response theory scores for multiple subtests were compared. These methods included two multidimensional item response theory models: a bi-factor model where each subtest was a composite score based on the primary trait measured by the set of tests and a secondary trait measured by the individual subtest, and a…
Descriptors: Item Response Theory, Multidimensional Scaling, Correlation, Scoring Rubrics
Cowley, Kimberly S.; Voelkel, Susan; Finch, Nicole L.; Meehan, Merrill L. – Appalachia Educational Laboratory at Edvantia (NJ1), 2005
The Perceptions Of School Culture (POSC) instrument was designed to measure the perceptions of a school staff regarding various dimensions of school culture contained in a hypothesized model of school cultural change. Specifically, this model posits that the development of a high-performance learning culture is influenced by school vision and…
Descriptors: Academic Ability, Teacher Effectiveness, School Culture, Educational Environment
Ceperley, Patricia E.; Hughes, Georgia K.; Mittapalli, Kavita – Appalachia Educational Laboratory at Edvantia (NJ1), 2005
The purpose of this report is to document the pilot test of the Instruction and Learning Appraisal (ILA) and describe the quality of the ILA process. The ILA process was developed by Edvantia staff who serve as technical assistance providers to schools and districts in the region. Low-performing schools and districts (i.e., those that fail to…
Descriptors: Educational Improvement, Technical Assistance, Case Studies, Low Achievement
Sedere, Upali M. – Online Submission, 2001
This paper offers a broader framework for assessment and evaluation of teachers in developing countries. Teacher is seen not only as some one who imparts knowledge and skills but in the developing world, teacher has a wider role to play. (Contains 2 tables, 3 figures, and 13 footnotes.)
Descriptors: Teacher Evaluation, Developing Nations, Teacher Education, Teacher Competency Testing
McCowan, Richard J.; McCowan, Sheila C. – Online Submission, 1999
This paper describes major concepts related to item analysis for criterion-referenced tests including validity, reliability, item difficulty, and item discrimination, particularly in relation to criterion-referenced tests. The paper discussed how these concepts can be used to revise and improve items and listed suggestions regarding general…
Descriptors: Criterion Referenced Tests, Standard Setting, Item Analysis, Item Response Theory


