Generate Vietnamese speech from text
Generate high-quality images from text prompts
OmniParser, turn your LLM into GUI agent