HMT: Hierarchical Memory Transformer for Long Context Language Processing