This section show my notes on business considerations while talking about generative AI.
[TOC]
Back in 2023 several AI specialist were highlighting 2023 as the year of POCs and 2024 the year of production. In this section I will try to identify some key data points to check what has happened in 2024.
As described by Menlo Ventures, the AI landscape is been rewritten in real time - AI spending surged 13.8Billion in 2024 - a signal that enterprises are shifting from POCs to production.
Key factors based on Menlo report on “2024”: The State of Generative AI in the Enterprise” are:
foundation model investment dominates but application layer is growing fast.
Few application layer solution in production - 1/3 been prototyped and evaluated
Most valuable use cases inside enterprise still remain:
Enterprise prioritize value over quick wins: “Return on investment and industry-specific customization matter most when selecting new tools.”
Vertical application are rising for domains such as healthcare, legal, financial services and media&entertainment.
multi-model strategy over single model: 3+ foundation models in the AI stack and using different models depending on the usecase or results. Close source model is still majority of usage with 81% market share.
Picture by Menlo Ventures,
Picture by Menlo Ventures
Picture by Menlo Ventures
Here the new word is “Service-as-Software”: where AI-driven solution offer the capability of traditional service providers but operate entirely through software.
“Service-as-Software”: where AI-driven solution offer the capability of traditional service providers but operate entirely through software.
Build versus buy is a decision factor, where Menlo has seen a near-even split of 47% solution been developed in-house while 53% sourced from vendors based on their survey of 600 US IT decision makers at enterprises with 50+ employees.
Picture by Menlo Ventures
Retrieval Augmented Generation (RAG) still dominates the adoption by 51% but agentic architectures are already powering 12% of implementations.
AI Agent automation is expected to drive the next wave of AI transformation from complex and multi-step tasks going beyond content generation and knowledge retrieval driving the need for new infrastructure such as agent authentication, tool integration platforms and specialized runtimes for AI-generated code.
Picture by Menlo Ventures
The tech industry is encountering a talent drought - not only shortage of data scientist but also expert that bridge the gap between AI capabilities and domain-specific expertise. According to report “AI Compensation Trends: The Real Cost of Top 1% AI Technical Talent”, AI scientists are at the forefront.
For example, the median base salary for technical staff at top AI startups is currently $310,000 according to H1B Salary Database.
Picture by Menlo Ventures
To better understand the cost of generative AI usecases, we describe in this section the main elements to consider in order to calculate the cost of generative AI applications.AI
Picture below helps in understanding the multifaceted nature of expenses involved in deploying and maintaining generative AI solutions.
Here are some of the cost related to generative AI, which are composed by:
The amount of investiment in generative AI lately has skyrocket. It jumpt from 2.85 billion US dollars in 2022 to 25.23 billion US dollars in 2023. (ref. IEEE Spectrum).
This has also been reflected on the race on the foundation models lately, that is clearly seen by number of foundation models been released by big industry players like Amazon, Google, Meta, Microsoft and OpenAI for example.
Generative AI is impacting every organization nowadays and an important aspect is how those organization can access the ROI (return of investiment) for generative AI. As described by Gartner, generative AI usecases can be divided in the following mix when considering value versus risk:
Quick wins: task specific usecases with productivity assistants such as Amazon Q for Developer and Microsoft 365 Copilot. Integrating these capabilities into other business process can help you maintain a competitive edge. Examples are: assistants/copilots, marketing generation and code assistants.
Differentiating use cases: industry specific or custom application that leverage your enterprise data in new ways extending your current process. Application such as Retrieval Augumented Generation (RAG) enables you to leaverage your enterprise data through for example the use of Amazon Q for Business or Amazon Bedrock. This might lead to potential benefits such as revenue generation. Additional examples are: custom support apps, sales apps, and enterprise document search and summarization.
Transformational initiatives: has the potential to turn business and model upside down. Those transformative usecases comes with complexity, risk and cost and high potential for technical depth. Examples are the creation of fine tuning foundation models or even custom pre-trained foundation models such as BoombergGPT - a foundation model specialized in financial domain.
Implementing and closely monitoring Key Performance Indicators (KPIs) is crucial to ensure that the costs associated with generative AI projects are managed effectively and remain within budgetary constraints.
Example of KPIs per usecase mix that might be relevant to keep track on are:
Driver uses ChatGPT hack to get dealer to agree to sell new car for $1 in ‘legally binding deal’ in blow for AI rollout - https://www.thesun.co.uk/motors/25091054/driver-uses-ai-loophole-buy-new-car-1/
Source: https://abovethelaw.com/2024/01/keep-your-firm-far-away-from-whatever-ai-chevy-was-using/