chenxwh/cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

Input
Configure the inputs for the AI model.
0
1

When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens

Input prompt

Input video

0
100

Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic

0
100

Maximum number of tokens to generate. A word is generally 2-3 tokens

Output
The generated output will appear here.

No output yet

Click "Generate" to create an output.

cogvlm2-video - ikalos.ai