Preparing For The Next Pandemic: Transfer Learning From Existing Diseases Via Hierarchical Multi-Modal BERT Models to Predict COVID-19 Outcomes