Please describe your proposed solution.
<u>Starting point:</u>
This project started in Fund 7 as a simple concept: To offer any user of Web3 Apps / Metaverse, or even web2 Apps (twitter, discord…) a simple way to obtain a fun picture profile (PP) of themselves matching the context. If I play a game of zombies, I d like a PP of me as a zombie, instead of a generic zombie. As the digital world grows and end-users have dozens (maybe even hundreds) of online identities, they will want dozens (if not hundreds) of PP to match moods, stories, and situations.
A first iteration was developed in the past 5 months (see TaChiKu.com), and acts as a very good proof of concept. It works, it's simple to use, it's fun, and it is also deeply addictive (just try it out, you will see what we mean). At time of writing (end June 2022), the minting is not yet integrated, but there is still 6 weeks until final delivery of this "first iteration".
The promise made in November 2021 was
> "A web3 API for generating an ADA compatible NFT where the picture is generated by an AI. Input an image of you, a few words for context (zombie, punk, etc..), your ADA public address, accept to pay a few ADA for minting, and your PP appears at the other end of the API call."
A morphing of an uploaded image
https://tachiku.com/app/view/?uid_image=84469401340
TaChiKu.com does this (*minting coming by mid August latest) although we went way beyond "just an API" and instead provided a full web app for an easy end-user experience. For devs, the API integration here: https://tachiku.com/doc/
However, the current solution has both some "limitations", and some higher value "potential". This is what we would like to address in this proposal.
<u>The current limitations:</u>
1/ Difficult to setup
Setting up an AI (from an open source project) is time consuming and can be very difficult, as it will depend on OS (Windows, Mac, Linux), GPU (Nvidia, others, drivers), and compatibility of open libraries in the various combinations of OSes/GPUs.
In the past 5 months, we have delivered a morphing AI which works well for either Windows 10 + Nvidia RTX3090 or Linux Ubuntu 20.04 + Nvidia RTX3090. We would need to deliver more AIs (including the delivery of the Deepfake AI, but also potentially other image manipulation AIs) for more GPUs, and for most latest OS releases.
Delivering this part is a necessary pre-condition for the next one (2/).
2/ Slow to warm-up
Most AIs run from Python programs, and they take a very long time to warm up, therefore this is not a workable solution at scale. Recently, we have transformed the Morphing AI from a Python program to a local server. There needs to be a systematic approach with open APIs for moving Python programs to server based AIs, so that the integration is smooth.
Delivering this part is a necessary pre-condition for the next one (3/).
3/ In need of virtualization
The robustness, redundancy, and capacity under load of the entire TaChiKu solution requires several concurrent AIs running in parallel. The only good way to do this is via virtualization of those AIs.
Anyone wanting to lend GPU power should not have to worry about the 2 problems above. They should basically just drag and drop a docker image, start it, and be accepted as part of the TaChiKu AI network within seconds.
Unfortunately, the containerization of AIs is still machine dependent (in fact it is GPU dependent + python libraries may require adaptation / recompile on yet unsupported combinations of GPUs, and also Docker can make use of the physical machine GPU power only on linux…). Although some attempts at dockerization were made in the past 3 months, none were fully successful yet, and more times needs to be spent here.
This part is critical to move to the more interesting "value potential" section.
To understand more, watch this video explaining the architecture journey from initial Proof of Concept to where we need to go for a full virtualization of AIs:
<https://www.youtube.com/embed/4T7wFS0xONY>Note that because of the lack of an easy way to share and setup a working AI (via container), the current project is limited in the sense that the AI only runs on our physical machine. The TaChiKu app itself is hosted on AWS and available 24/7, but the morphing AI program is physically hosted on our machine. When the machine is Off… the Morphing AI is Off, which of course is not a very acceptable scaling proposal. It is only good for the current stage of a final Proof of Concept.
<u>The value potential:</u>
When the AIs are containerized and run on their own server, able to communicate back and forth with the main TaChiKu AI server, it then allows for an ecosystem of fully decentralized AIs. With this in mind, the potential of TaChiKu could go way beyond its initial scope.
The development effort submitted in this proposal would deliver a list of Docker images ready to be instantiated in pre-defined environments (AI + GPU), for example:
- "Morphing - RTX3090 GPU - tested on Linux Ubuntu"
- "Morphing & Deepfake - RTX3060 GPU - tested on Linux Ubuntu "
Those would be freely accessible to anyone. You could download the image, run it on your machine (one command line to execute) and it would be already pre-configured to automatically connect to the main TaChiKu AI server, which would automatically integrate your machine's GPU power into the whole decentralized TaChiKu AI ecosystem. Morphing and deepfakes executed on the TaChiKu app would then be sent to an available dockerized AI for processing, potentially yours if it is running and available at the time of the request.
You may ask: why would I want to participate into lending my GPU power? Well, we can imagine an eco-system where the participating user (lending his GPU) could be rewarded in coins taken from the end-user's fee for morphing and minting NFTs.
Note: this last part of rewarding GPU owners is not part of this proposal, but this current proposal would put in place all the bricks for this to happen thereafter.
With this proposal, TaChiKu can become a fully decentralised AI ecosystem for minting AI-generated NFTs on Cardano. Think of all the possibilities: morphing, deepfakes, generative art, a long list of whatever the combination of end-user's ideas, GPU power, and AI capabilities can deliver.
> In short, we can give to anyone on the internet, within a few clicks (and a few ADAs), unlimited access to complex pre-built AIs alongside with the GPU power to run them, and the instant minting of the result on the Cardano Blockchain, as well as automatic posting on social media.
And later, we will be able to give all lenders of their GPU power a reward / stake in the ecosystem.
<u>What we will deliver:</u>
1/ a robust virtualized network of curated AIs (morphing, deepfake) running on nodes hosted by participating community members.
2/ the ability for end-users to generate morphed images and deepfake videos from a simple image upload.
a 15sec example deepfake made post morphing (see picture above)
<https://twitter.com/TaChiKuAi/status/1541508014959726592>
3/ the ability for end-users to mint those morphed images and deepfake videos as NFTs on the Cardano blockchain, and automatically post them to relevant social media platforms.
4/ the ability for end-users to trade AI-generated artworks as native Cardano NFTs on a dedicated marketplace with like-minded users.
<u>Benefits</u>:
This project will benefits the wider Cardano community in two ways:
1/ As TaChiKu continue to deliver simplicity of use of AI capabilities and access to GPU power, it can become the global go-to platform for AI-generated NFTs. It would only take a few influencers to kick start a frenzy. The trend of avatar personalization, memes, and short but impactful videos is only going to grow, and TaChiKu has all the building blocks to be very well placed and catch this unstoppable wave. In this sense, an obvious benefit for the Cardano community is the exposure to gain large communities of end-users (and the associated fees), as well as being positioned in this growing niche market of AI-generated artwork.
2/ If the project attracts a large volume of end-users, it will then attract "lenders of GPU power" since they will have a financial incentive to be part of the network. Those participating lenders will add to the stickiness of the solution and to the overall value of the ecosystem (think of a future Utility coin) which will also greatly benefit the Cardano ecosystem.
Please describe how your proposed solution will address the Challenge that you have submitted it in.
From an end-user point of view, TaChiKu can become "The Marketplace for AI generated Art on Cardano". Although its DApp will ultimately be fairly simple and common-place (mint, buy, sell, and possibly auction), its appeal will be in its unique access to AI "magic" to create surprising images and videos within a few seconds. All mints and subsequent transactions (resale) made by end-users will benefit the Cardano ecosystem in fees.
TaChiKu will be positionned as a unique decentralized AI solution for the Cardano ecosystem, to benefit from the growing market of NFTs, avatars, AI driven artwork, generative Art, and more. The niche of AI driven art is still in its infancy, and the immediate attraction of deepfakes and personalizable video memes could be an instant hit, attracting a few influencers to mint themselves on the Cardano network, and opening viral snowball effect.
TaChiKu will directly plug into social media, and any take up will be massively amplified, thereby benefitting the Cardano community. Have a look right now to see how it works: try morphing your picture on TaChiku.com, make it publicly available, and you will see it on twitter within seconds - follow @tachikuai
Finally, TaChiKu could later on deliver its own Utility token on the Cardano blockchain, increasing TaChiKu's dependency on Cardano, and allowing Cardano to benefit greatly from any success made by TaChiKu.
What are the main risks that could prevent you from delivering the project successfully and please explain how you will mitigate each risk?
Here is the list of the identified risks for the next 6-12 months:
<u>Technology risks:</u>
The risk on the backend and web app are mostly contained, since the experience of delivering the first iteration of TaChiKu have been positive. The biggest risk here is therefore not so much of technology, but of time (Eric's time). At this stage, it is still planned that this part of the project will be delivered by Eric (to lower any risk of ramping up another person on this core part of the project). In order to mitigate the risk, the project timelines have been expanded to 9 months for the final delivery. There will be many intermediate milestones, but at least, the 9 months timeline will act as a buffer to mitigate any risk on availability of Eric's time.
The risk on the Plutus code dev is also contained in its own box. There is no question that such code can be delivered, since NFT minting is not a new feature on Cardano. It is just about finding the right person. In the unfortunate case that this part could not be delivered (which seems very unlikely), we would continue to work on the third party solution (nft-maker) for a while. So it does not break the whole project if not delivered.
The risk on delivering a fully decentralized AI network seems the most high profile technology risk of this project. It seems easy on paper (dockerise the AI and use it) but it comes with many constraints of integration, that it was not possible to deliver within a few days so far. This risk is best mitigated by finding the right person for this job. At the time of first submission of this proposal, this person is not yet identified, but it seems very reasonable that we will find a Python developer with relevant docker experience to help us for a month or two, and deliver the final dockerised versions of morphing and deepfake AIs.
<u>Commercial risks:</u>
Then, there is the risk of the project not taking up. So far, experience shows that end-users love it when they see the result of image morphing. By going one step further and delivering deepfake videos, we hope to go beyond the initial wow factor and attain long-term stickiness. By integrating with most social platforms (twitter, facebook, tiktok, linkedin), we indeed hope to get to a point of a virtuous cycle of free viral marketing. We have allocated a marketing budget to mitigate this risk and turn this project into a success with a large end-user adoption and hopefully lots of goodwill viral marketing. The recruit of influencers will likely be key (we are currently discussing with some artists with a very large fan base)
<u>Financial risks:</u>
Since this project has been initially self funded and helped by the Cardano community (fund 7), we do not depend of VC money. This project is not at risk financially, since the outgoings are so low that the grants and the self funding from Eric are enough to keep it going for as long as we can see. If this grant is not given, then the project will continue, but at a very slower pace, and at the cost of some "must have" being pushed back and/or discarded until funding is in place. Note also that beyond the funding from Catalyst, the access to the Cardano community also gives us a strong momentum and advantage.
<u>Single chain risk:</u>
It could be argued that there is also a risk associated with keeping the project only on the Cardano blockchain, and that opening it to other chains (mint on Solana, on Eth, on Cosmos, etc…) could be a good move, and make the project so much more visible. In the last few months, when the integration of Cardano NFT minting (plutus DApp) was becoming "much more painful" than initially anticipated, we could have reverted to other solutions and decide to mint on another chain with a potentially "easier" implementation. But we did NOT do this. Instead, we continued to search for a solution that would work on Cardano (and we found it).
We want to express openly and very clearly where our loyalty goes. With past and (hopefully) upcoming additional funding coming from Cardano and supported by its community, this project for delivering Morphed and Deepfake NFTs via AI will stay on Cardano and not go anywhere else.
> Loyalty is an asset that we will not exchange for a quick win elsewhere. We have embraced this community, and we are looking forward to a long term win-win relationship together.