The state and expectations of interrater reliability in nonhuman personality research