The world of artificial intelligence

An artificial intelligence (AI) agent refers to a system or program that is capable of autonomously performing tasks on behalf of a user or another system by designing its workflow and utilizing available tools.

Manus AI a great AI agent the best to colorize black and white photos from China.
An AI tool to support the VS Code (generate codes), workflow customization and multi-agent.

ChatGPT is a large language model that generates text, translates languages, and answers questions.
offizielle App from OpenAI for ChatGPT.

Text to image
Ideogram, a free tool, specializes in creating AI-generated images that include readable text.
Midjourney is a paid AI tool used for generating stunning images from text prompts
Free AI alternative to Midjourney

Elevenlabs, a paid tool, excels in generating life like audio for various applications
Fish Audio is an open-source platform that offers text-to-speech solutions, enabling voice generation for free.

Dupdub is a free, all-in-one platform offering audio, translations, and avatar generation.

 

PAID
——–
Napkin Al
Invideo Al
Quillbot Al
Tool for
———-

Presentation
Video Generator
Writing
FREE
——–
Gamma Al
Pixverse Al
Grammerly

 

 

This model has been able to work well in various fields.


Suno Song Generator Music production
with artificial intelligence
You simply type in lyrics or any text prompt, and the AI will generate an original song composition around it

Perplexity AI is an AI-powered search engine and chatbot that utilizes advanced technologies such as natural language processing (NLP) and machine learning
Ideogram
A KI-Tool that creates images from text descriptions using Generative Adversarial Network (GAN) technology, Specifically focuses on high-quality text in images.

 

Video production with artificial intelligence an AI to make a video on travel, finance, health or any …
Convert Text to Video with AIPictory‘s powerful AI enables you to create and edit professional quality videos using text
ImageBind by Meta AI
Using an image to retrieve audio
ImageBind can instantly suggest audio by using an image or video as an input.
Using audio to retrieve images
ImageBind can instantly suggest images by using an audio clip as an input.
Llama 2 – Meta AI (open source large language model)
Code Llama is a code generation model built on Llama 2. Its available free for research and commercial use.
CodeGPT(GPT-3): The VSCode Extension  inside VSCode through the official OpenAI API
Install CodeGPT in Visual Studio
Open Visual Studio Code and move to the “View/Extensions” menu in the left panel.

API key from OpenAI

Go to the platform.openai.com/account/api-keys and click on “Create new secret key”.
Search for “codegpt” and select “Code GPT” from the search results  and select “Code GPT” from the search results.
Open the command bar by using the “Ctrl + Shift + P” shortcut, and it will open the command bar and type “codegpt” and then open “Set API KEY”.
To generate code, add a comment for the task you wish and press the “Ctrl + Shift + I” keys. or select a code, right-click on it, and ask CodeGPT to explain the code, refactor it, find problems, debug, and more.
Typical progress:
    Conventional AI -> Deep Learning -> Generative AI
Clips AI is an open-source Python library that automatically converts long form video into clips.
DALL-E 2 is a text-to-image generation model that can create realistic images from text descriptions.

Microsoft:
  • Azure Bot Service
  • Azure Cognitive Services
  • Copilot

Google:

  • Imagen: create high-quality images from text descriptions.
  • LaMDA: Can generate text, translate languages, write creative content, and answer your questions informally.
  • PaLM: writing code

Amazon:

  • Whisper : Transform human-computer interaction for speech-to-text
  • Amazon Transcribe: Service for transcribe audio and video files.
  • Amazon Polly: text-to-speech audio.
  • Amazon Lex: create chatbots.

 

LaMDA, GPT-3/chatGPT(Text)
BLOOM, Gopher(Text, Voice)
LLaMA (Text, Voice, Video)
DALL-E2, Stable Ciffusion (Image)
Whisper(Voice)
PaLM, StableLM, OpenAI(Code)
MS X-Clip(Video)

Sora is a paid model that generates videos from text prompts,
Luma AI, another paid option, focuses on creating accurate and visually compelling videos from text.

Synthesia, a paid tool, is used for generating digital avatars for videos,

 

 

Hugging Face's logo
Hugging Face is a great platform. It provides a set of tools and resources for teaching and using models.