Open Tech Talks newsletter!

Large Language Models

Generative AI has brought Artificial intelligence to the masses, and everyone has started talking, experiencing, and utilizing it in day-to-day life. When it comes to enterprise space, earlier this month, we covered the most critical area of implementing Large Language models by organizations through opting for use cases, as everyone is trying to find out different use cases that can be incorporated or targeted for the implementation. I have covered a few use cases in the retail industry and a few already implemented examples.

This week, I thought of going over a few of the open-source and commercially available large language models available in the market.

Large Language Models

Llama 2: trained on 2 trillion tokens, has double the context length than Llama 1. Its fine-tuned models have been trained on over 1 million human annotations. Llama Chat: Llama 2 was pre-trained on publicly available online data sources. Code Llama: Code Llama is a code generation model built on Llama 2, trained on 500B tokens of code. It supports common programming languages, including Python, C++, Java, PHP, Typescript (Javascript), C#, and Bash.
Dolly: It is trained for less than $30 to display ChatGPT-like human interactivity. Dolly 2.0 is a 12B parameter language model based on the EleutherAI Pythia model family and fine-tuned exclusively on a new, high-quality human-generated instruction following dataset, crowdsourced by Databricks employees. The dataset is the first open-source, human-generated instruction dataset specifically designed to make large language models exhibit the magical interactivity of ChatGPT. More than 5,000 Databricks employees authored Databricks-dolly-15k during March and April of 2023. These training records are natural, expressive, and designed to represent various behaviors, from brainstorming and content generation to information extraction and summarization.
Guanaco is an advanced instruction-following language model based on Meta's LLaMA 7B model. Building upon the initial dataset of 52K from the Alpaca model, it has incorporated an additional 534,530 entries. These cover languages such as English, Simplified Chinese, Traditional Chinese (Taiwan), Traditional Chinese (Hong Kong), Japanese, and Deutsch, as well as various linguistic and grammatical tasks.
BLOOM: The World’s Largest Open Multilingual Language Model 176 billion parameters, BLOOM can generate text in 46 natural languages and 13 programming languages.
LLaVA Large Language and Vision Assistant is a model that blends vision and language understanding. It's like a visual version of GPT-4 and sets high standards in Science QA accuracy.
Vicuna-13B is an open-source chatbot based on LLaMA and fine-tuned with ShareGPT conversations. Vicuna LLM creates text that feels natural and is both engaging and informative.
FLAN-T5 is a model created by Google Research. It's trained on various tasks, both supervised and unsupervised, and turns them into a text-to-text format. It's a version of the google’s T5 model.
Falcon: Created by Abu Dhabi's Technology Innovation Institute (TII), it has two models: Falcon-40B and Falcon-7B. These models process web data uniquely by removing duplicates and using a special filtering system. With multi-query attention, these models work faster and better. Falcon can write like humans, translate languages, and respond to questions.

I had a chance to visit the Flacon showcase during the GITEX exhibition.

Falcon showcase at GITEX, Dubai Exhibition

Proprietary/commercial Large language Foundational models:

AI21:
- J2 Ultra Instruct
- J2 Mid Instruct
- AI21 Summarize
Anthropic:
- Claude
Cohere:
- Generate Model Command
- Generate Model Command-Light
LightOn:
- Lyra-Fr 10B
Stability AI:
- SDXL
Amazon:
- Titan Text Large

News & Updates...

This week has seen a storm of new AI features and products announced, fueling the technology revolution.

A blog post on Multi-GPU multinode fine-tuning Llama2 on OCI Data Science
Revolutionizing AI-driven research with Cleveland Clinic and OCI
Gen AI Navigator from Google Cloud, is a guide for you to adopt Gen AI.
Frontier risk and preparedness is an initiative from OpenAI.
Prompts are key in 2023: Twenty-five tips to help you unlock the potential of generative AI.

Potential of AI

Generative AI use cases across six industries by Deloitte AI Institute

Things to Know

Anthropic, Google, Microsoft, and OpenAI have announced the Frontier Model Forum and the creation of a new AI Safety Fund, with more than $10 million initiative to promote research in the field of AI safety
Policy paper Emerging Processes for Frontier AI Safety by the UK Government
NEOM and Pony.ai established a joint venture to develop, manufacture, and deliver autonomous vehicles, an autonomous driving service, and smart vehicle infrastructure.

The Opportunity...

Podcast:

This week's Open Tech Talks episode 119 on AI, Authenticity, and Marketing: A Deep Dive with Emanuel Rose

Courses to attend:

New course: Functions, Tools, and Agents with LangChain by Deep Learning
Awesome Generative Artificial Intelligence curated list of resources; I found it a good resource documented here.

Events:

UK's AI Safety Summit organized by the UK Government on Nov 1st & 2nd.
Codershq to attend the coder-specific events in Dubai.

Tech and Tools...

Llama 2 Fine-tuning / Inference Recipes, Examples and Demo Apps
The Open Source Next.js SaaS boilerplate for Enterprise SaaS app development
Generative Models by Stability AI
Jupyter AI connects generative AI with Jupyter notebooks

Until next week,

Kashif Manzoor

You have registered on OTechTalks.tv over the last five years. If you don’t want to receive it, please unsubscribe; you will not get it next time.

The opinions expressed here represent solely my own personal conjecture based upon experience, practice, and observation and do not represent the thoughts, intentions, plans, or strategies of my current or previous employers or their clients/customers. The objective of this newsletter is to share and learn with the community.