Please describe your proposed solution.
Process leading to “Data-Driven Catalyst”:
- Stakeholder Feedback Process: Each milestone will involve presentation of objectives and finding to the Catalyst Team, Catalyst Voices Team, Proposal Assessor / Community Advisor Telegram community. There have been countless contributions to data analytics by various members of the community. The objectives have been manyfold: finding gaming of the system, duplicates and AI-generated content. Finding the most efficient categories to get funded "easily". Correlation between voting success and some other metric such as PA score, length of proposal, etc. We need to converge on the most meaningful past datasets and ensure that data collection and comparison over funds is possible.
- Dashboard and Executive Reports: Once the suitable Catalyst datasets and time series have been identified, we need to collect, clean and present the data in the most intuitive and unbiased way, give the largest possible number of community members access and the tools to tinker with the data, find flaws, be creative and converge on the most meaningful metrics to improve the Catalyst process going forward, find threats (like gaming the rules, Sybil attack vulnerability etc)
- Robust, Open Source Tech stack: We have relied too much on Big Tech tools like Google Drive, Excel spreadsheets and Google forms in the past. Catalyst Voices will be a huge improvement, but risks being accessible by far too few people with the sufficient technical expertise. We need to research, compare and test drive the most robust, open source and indestructible, free and lightweight tech stack possible. Open source data visualization and reports software using popular languages like python and/or JavaScript.
- Collaborative: There are a lot of people building similar tools in web3, in Cardano, for Catalyst - we need to move out of our silos and find the working groups creating gold standards for data-driven, open source community learning and governance around permissionless systems and data-driven self improvement.
- Flexible and reactive: We don't want to create white elephants with the Catalyst budget, but move from milestone to milestone with an open mind, pursue the datasets, tech solutions and collaboration tools that work best, and drop the ones that don't.
At the end of our 7 month project in close collaboration with the Catalyst team, Catalyst Voices and the community, we will present findings and have final working database for Funds 7 to 10, open to anyone collaboration platform and data storage and retrieval solution that allows seamless integration with the most exciting and cutting edge data science, machine learning and LLM tools out there, for example LangChain, Gemini, ChatGPT, TimeGPT, and do network graphs, identify clusters and discover insights that can drive Catalyst to get better and better over each iteration.