Home

Awesome

<p align="center"> <img src="./logo.png" alt="Logo" style="width:380px;"> </p> <p align="center"> A node.js project that generates short videos using popular AI LLM. </p>

FFAIVideo

A lightweight node.js project that utilizes the currently popular AI LLM in the industry to intelligently generate short videos. Without the need for complex configurations, simply input a short piece of text, and it can automatically synthesize an exciting video content.

Features

Installation

npm install ffaivideo

Note: To run the preceding commands, Node.js and npm must be installed.

Example usage

const { generateVideo } = require('ffaivideo');

generateVideo(
  {
    provider: 'gpt4js',
    // Use the free gpt4js, g4f, or OpenAI, or Moonshot account
    // Or use openai gpt
    // provider: 'openai',
    // openai: {
    //   apiKey: '*',
    //   modelName: 'gpt-4-turbo-preview',
    //   baseUrl: 'https://api.openai.com/v1',
    // },
    termsNum: 8,
    subtitleMaxWidth: 9,
    videoClipDuration: 12,
    voiceName: 'zh-CN-YunjianNeural',
    bgMusic: path.join(__dirname, './assets/songs/m1.mp3'),
    output: path.join(__dirname, './output'),
    pexels: {
      apiKey: 'xxx',
    },
    videoScript: `
    ...Enter your text here
  `,
  },
  progress => {
    console.log(progress);
  },
).then(videoPath => {
  console.log(videoPath);
});

Installation preparation

About the config of LLM

The current project already supports multiple AI LLM models such as OpenAI, Moonshot, Azure, g4f, Google Gemini, etc. to meet your different needs. If you want to introduce other AI LLM models, please fork this project and submit a Pull Request (PR) for us to evaluate and merge.

Before using this project, please make sure that you have applied for an API Key from the corresponding service provider. For example, if you plan to use GPT-4.0 or GPT-3.5, you need to make sure that you already have an API Key from OpenAI. In addition, you can also choose to use g4f, which is an open source library that provides free GPT usage services. Please note that although g4f is free, its service stability may fluctuate, and the usage experience may be good and bad from time to time. You can find its repository link on GitHub: https://github.com/xtekky/gpt4free.

In addition, as another option, you can apply for API services by visiting the Moonshot ai platform. After registration, you will immediately receive 15 of experience money, which is enough to support about 1,500 conversations. After successfully applying, you need to set the provider to moonshot and configure the corresponding apiKey to complete the project setup.

You need to configure apiKey, modelName and baseUrl. For azure ai, you also need to configure apiVersion.

openai: {
  apiKey: 'xxxx',
  modelName: 'gpt-4-turbo-preview',
  baseUrl: 'https://api.openai.com/v1',
},

About video material site

The video resources of this project use the Pexels website. Please visit https://www.pexels.com/api/new/ and follow the instructions to apply for a new API key so that you can use the rich materials provided by Pexels in your project.

About voice tts

FFAIVideo by default integrates Microsoft Edge's online text-to-speech service. This service is not only powerful but also allows users to customize and set up their own application tokens, offering more flexible configuration and usage options. However, users in China may encounter access restrictions. For more details, please refer to this GitHub issue: https://github.com/rany2/edge-tts/issues/290. To address this issue, we provide an alternative: you can use the Azure TTS service. This requires purchasing an Azure AI Speech service account. For more information, please visit: https://azure.microsoft.com/en-us/products/ai-services/ai-speech. When using the Azure TTS service, a configuration example is as follows:

azureTTSSettings: {
  subscriptionKey: '*',
  serviceRegion: '*',
},

About installing ffmpeg

Since FFAIVideo relies on FFmpeg for its functionality, it is essential that you install a standard, well-maintained version of FFmpeg. This will ensure that FFAIVideo operates smoothly and without any compatibility issues.

API Configuration

Parameter nameTypeDefault valueDescription
providerstringgpt4jsLLM Provider
moonshotLLMConfig-Moonshot configuration
openaiLLMConfig-OpenAI configuration
azureLLMConfig-Azure configuration
geminiLLMConfig-Gemini configuration
g4fLLMConfig-G4F configuration
gpt4jsLLMConfig-GPT4js simplifies AI model interaction
customoAILLMConfig-custom ai configuration
pexelsMaterialSite-Pexels material site
videoScriptstring-Script for generating videos
videoTermsstring | string[]-Keywords for generating videos
videoAspectVideoAspectundefinedVideo aspect ratio, can be undefined by default
videoClipDurationnumber5Video clip duration, default is 5 seconds
lineBreakForcebooleantrueUse line breaks to split long subtitles
termsNumnumber5Number of keywords
outputstring-Output path
cacheDirstring-Cache directory
voiceNamestring-Voice name
voiceVolumenumber1.0Voice volume, default is 1.0
bgMusicstring-Background music
bgMusicVolumenumber0.5Background music volume, default is 0.2
fontsDirstring-Font directory
fontSizenumber24Font size
fontNamestring-Font name
textColorstring"#FFFFFF"Text color, default is "#FFFFFF"
strokeColorstring"#000000"Stroke color, default is "#000000"
strokeWidthnumber-Stroke width
textBottomnumber20Text bottom position
subtitleMaxWidthnumber-Maximum subtitle width
debugbooleanfalseDebug mode
lastTimenumber5Last time
azureTTSSettingsobjectnullAzure TTS settings
getMaterialfunctionnullA custom material synthesis
removeCachebooleantrueWhether to remove cache

Reference Project

This project is inspired by and builds upon the open-source contributions from several notable repositories, including MoneyPrinterTurbo, MoneyPrinter, and MsEdgeTTS. We express our sincere gratitude to the original authors for their dedication to the open-source community and their innovative spirit.

License

MIT LICENSE