Large Transformer Model Inference Optimization — Blankdot