Prismer is an example of my team's work on building foundations for multimodal LLM.
GPT-4's vision API is not publicly available yet, and will take much longer to actually become customizable for your enterprise proprietary data and unique use case.
GPT-4's vision API is not publicly available yet, and will take much longer to actually become customizable for your enterprise proprietary data and unique use case.
VIMA ("VIsual Motor Attention") is another example of my team's effort to build foundations for multimodal-prompted, robot LLMs.
Folks, multimodal is the future, both for AI research and enterprise-grade applications. Time to go way beyond strings!
Folks, multimodal is the future, both for AI research and enterprise-grade applications. Time to go way beyond strings!
To learn more, watch Jensen's GTC Keynote recording here: nvidia.com
If your bandwidth allows, watch in 4K for the stunning graphics: nvidia.com
Attend GTC with us! I will be speaking too.
If your bandwidth allows, watch in 4K for the stunning graphics: nvidia.com
Attend GTC with us! I will be speaking too.
Loading suggestions...