What's the difference between vllm and triton-inference-server?

May vllm can achieve the performance like fastertransformer on inference side? Just curious about the detailed optimization you're done and the goal you want to achieve.
BTW, vllm really accelerate our deploy work, thx.