NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
Laws, Policies, & Programs
Debra P v Turlington2
What Works Clearinghouse Rating
Showing 1 to 15 of 130 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Camara, Wayne – Educational Measurement: Issues and Practice, 2014
This article reviews the intended uses of these college- and career-readiness assessments with the goal of articulating an appropriate validity argument to support such uses. These assessments differ fundamentally from today's state assessments employed for state accountability. Current assessments are used to determine if students have…
Descriptors: College Readiness, Career Readiness, Aptitude Tests, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Koch, Martha J. – Educational Measurement: Issues and Practice, 2014
Implications of the multiple-use of accountability assessments for the process of validation are examined. Multiple-use refers to the simultaneous use of results from a single administration of an assessment for its intended use and for one or more additional uses. A theoretical discussion of the issues for validation which emerge from…
Descriptors: Foreign Countries, Test Use, Accountability, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009
This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…
Descriptors: Tests, Test Validity, Scores, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009
Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…
Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Shepard, Lorrie A. – Educational Measurement: Issues and Practice, 2009
In many school districts, the pressure to raise test scores has created overnight celebrity status for formative assessment. Its powers to raise student achievement have been touted, however, without attending to the research on which these claims were based. Sociocultural learning theory provides theoretical grounding for understanding how…
Descriptors: Learning Theories, Validity, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Perie, Marianne; Marion, Scott; Gong, Brian – Educational Measurement: Issues and Practice, 2009
Local assessment systems are being marketed as formative, benchmark, predictive, and a host of other terms. Many so-called formative assessments are not at all similar to the types of assessments and strategies studied by Black and Wiliam (1998) but instead are interim assessments. In this article, we clarify the definition and uses of interim…
Descriptors: Student Evaluation, Evaluation Methods, Educational Assessment, Formative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Heritage, Margaret; Kim, Jinok; Vendlinski, Terry; Herman, Joan – Educational Measurement: Issues and Practice, 2009
Based on the results of a generalizability study of measures of teacher knowledge for teaching mathematics developed at the National Center for Research on Evaluation, Standards, and Student Testing at the University of California, Los Angeles, this article provides evidence that teachers are better at drawing reasonable inferences about student…
Descriptors: Formative Evaluation, Educational Testing, Inferences, Mathematics Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Educational Measurement: Issues and Practice, 2005
A note from the Working Group of the Joint Committee on Testing Practices: The "Code of Fair Testing Practices in Education (Code)" prepared by the Joint Committee on Testing Practices (JCTP) has just been revised for the first time since its initial introduction in 1988. The revision of the Code was inspired primarily by the revision of…
Descriptors: Measurement, Psychological Testing, Test Use, Student Evaluation
Peer reviewed Peer reviewed
Hoover, H. D. – Educational Measurement: Issues and Practice, 2003
Discusses three common misconceptions about achievement testing: (1) the idea that men generally outscore women; (2) the notion that the primary role of norms is to compare one student to another; and (3) the misconception that it is not possible to measure the achievement of kindergarten and primary grade students with group tests. (SLD)
Descriptors: Achievement Tests, Comparative Analysis, Primary Education, Student Evaluation
Peer reviewed Peer reviewed
Cizek, Gregory J. – Educational Measurement: Issues and Practice, 2003
Responds to critiques of an earlier article on the unintended consequences of high-stakes testing. Reiterates the need to look for positive as well as negative consequences of high stakes testing and calls for balance between the role of large-scale assessment in identifying achievement deficits and the responsibility educational systems have to…
Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, High Stakes Tests
Peer reviewed Peer reviewed
Popham, W. James – Educational Measurement: Issues and Practice, 2003
Suggests that members of the educational measurement community are at least in part responsible for the fact that many U.S. children are receiving a reduced quality education because of the increasing misuse of educational tests. Assessment personnel have failed to speak out against the use of tests to assess school quality. (SLD)
Descriptors: Educational Quality, Educational Testing, Elementary Secondary Education, Psychometrics
Peer reviewed Peer reviewed
Rich, William – Educational Measurement: Issues and Practice, 2003
Discusses current high-stakes standardized testing policy in response to G. Cizek through analogy with the policies of Chairman Mao in China and the decision to build the Maginot Line in France after World War I. Lessons from these events show that high-stakes policies skew outcomes and present high risk. (SLD)
Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, High Stakes Tests
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9