Capsule-enhanced hierarchical vision transformers for rare disease classification from medical images