FutureSmart AI Blog

Leveraging ChromaDB for Finding Similar Products

Pawan T — Mon, 15 Apr 2024 12:20:09 GMT

Introduction

In today's dynamic e-commerce and retail landscape, the ability to efficiently find similar products is paramount for businesses looking to enhance customer experience and drive sales. Leveraging advanced technologies such as ChromaDB(VectorDB), offers a powerful solution to this challenge.

ChromaDB is an open-source vector database designed to efficiently store and retrieve vector embeddings. In the context of e-commerce, it plays a crucial role in obtaining accurate similar products by embedding product descriptions.

we'll explore how ChromaDB empowers e-commerce and retail industries to improve product discovery through its robust capabilities. And demonstrate how it can be leveraged to find similar products effectively, ultimately enhancing the overall shopping experience for consumers.

Fundamentals of ChromaDB

ChromaDB is to develop and build large language model applications. The database makes it simpler to store knowledge, skills, and facts for LLM applications.

The above Diagram shows the workings of chromaDB when integrated with any LLM application. ChromaDB gives us a tool to perform the following functions:

Store embeddings and their metadata with ids.
Embed documents and queries
Search embeddings

ChromaDB is super simple to use and set up with any LLM-powered application. It is designed to boost developer productivity, making it a developer-friendly tool.

Installation

Installing the required packages.

!pip install chromadb -q!pip install sentence-transformers -q

Creating ChromaDB Collection

We will import necessary modules to retrieve similar products from ChromaDB

import osimport csvimport chromadb

We need to create a chromadb collection to store the vector embeddings

client = chromadb.Client()product_collection = client.create_collection("product_collection")

Adding File Data to ChromaDB

For our demonstration, we will use a list of products stored in CSV files to populate a ChromaDB collection. And we will use the Sentence Transformers "all-MiniLM-L6-v2" model to create embeddings, next we load the model and create embeddings for our documents.

from sentence_transformers import SentenceTransformermodel = SentenceTransformer('all-MiniLM-L6-v2')csv_file_path = "/content/products.csv"documents = []metadatas = []ids = []with open(csv_file_path, 'r', encoding='utf-8-sig') as file:    csv_reader = csv.reader(file)    next(csv_reader)     for row in csv_reader:        metadata = {            'id': row[0],            'name': row[1],            'category': row[2],            'price': float(row[3]),            'quantity': int(row[4]),            'description': row[5]        }        documents.append(row[5])        metadatas.append(metadata)        ids.append(row[0])

Here are some records from our products.csv file, including columns for id, name, category, price, quantity, and description.

Then, we can add the documents, metadata, and ids to the "product_collection" collection

product_collection.add(    documents=documents,    metadatas=metadatas,    ids=ids)

Querying for Similar Products from chromadb

After adding the data from the CSV file to the ChromaDB, we can query ChromaDB to retrieve similar products. Additionally, we are filtering the ChromaDB collection based on 'category' and 'price'.

For example :
Imagine we're searching for a "wireless mouse with RGB lighting and programmable buttons" within a price range of $100 to $500, belonging to the "Electronics" category.

The following code retrieves products closely matching our query from a ChromaDB collection. Once the similar products are retrieved, the code will print them.

/co

# Define the text input you want to use as a queryquery = "wireless mouse with RGB lighting and programmable buttons"# Perform the query with the text inputquery_results = product_collection.query(    query_texts=[query],    n_results=5,    #using the 'where' condition for filtering based on category and price    where={        '$and': [            {'category': {'$eq': 'Electronics'}},            {'price': {'$gte': 100}},            {'price': {'$lte': 500}}        ]    })# Extract metadata from query resultsmetadata = query_results.get('metadatas', [])[0]print(metadata)

Similar Products

These are the retrieved similar products from ChromaDB.

[    {        "category": "Electronics",        "description": "Gaming mouse with customizable RGB lighting and programmable buttons",        "id": "67",        "name": "Mouse",        "price": 299.99,        "quantity": 120    },    {        "category": "Electronics",        "description": "Premium wireless mouse with ergonomic design, customizable buttons, and smooth scrolling",        "id": "66",        "name": "Mouse",        "price": 199.99,        "quantity": 150    },    {        "category": "Electronics",        "description": "Wireless optical mouse with ergonomic design and adjustable DPI settings",        "id": "30",        "name": "Mouse",        "price": 129.99,        "quantity": 150    },    {        "category": "Electronics",        "description": "Bluetooth mouse with silent clicking and rechargeable battery",        "id": "68",        "name": "Mouse",        "price": 499.99,        "quantity": 100    },    {        "category": "Electronics",        "description": "10-inch tablet with Full HD display and quad-core processor",        "id": "21",        "name": "Tablet",        "price": 299.99,        "quantity": 90    }]

Conclusion

By leveraging ChromaDB, e-commerce businesses can significantly improve the accuracy and relevance of their search functionality. The ability to efficiently retrieve similar products based on their embeddings enables users to discover items that closely match their preferences and interests. This enhanced search capability leads to increased user satisfaction and engagement, ultimately driving higher conversion rates and sales.

I hope this blog provides you with information on how to retrieve similar products from ChromaDB and how we can utilize this capability on e-commerce platforms to recommend similar products, thereby enhancing customer satisfaction

And not only this we have created several other blogs and tutorials on vector databases and semantic search that dive even deeper into these topics.

Code :https://github.com/PradipNichite/FutureSmart-AI-Blog/tree/main/Finding%20similar%20products%20using%20chromadb

If you wish to delve deeper into the Chroma vector database and Semantic search, you can explore this video tutorial or blog:

https://youtu.be/eCCHDxMaFIk?si=sM0whXw2Qk6b1bbX

https://blog.futuresmart.ai/chromadb-an-open-source-vector-embedding-database#heading-conclusion

From Novice to Expert: My Internship Odyssey at FutureSmartAI

rajashekar vt — Tue, 02 Apr 2024 17:24:02 GMT

Introduction

On December 24th, 2023, I received a life-altering message from Pradip Nichite at FutureSmartAI, inquiring if I would be available for a full-time internship. Given my commitment to completing my college education, we initially agreed on a part-time role. However, the intensity and passion I found myself investing in this opportunity quickly blurred the lines between part-time and full-time engagement. My dedication did not go unnoticed; within the first month, my role transitioned to full-time, accompanied by a compensation that exceeded our original agreementa testament to FutureSmartAI's recognition and reward for hard work and dedication. This period marked the beginning of an enriching journey filled with learning, challenges, and personal growth.

As someone who stepped into this role with a mix of enthusiasm and performance anxiety, the pressure to excel was immense.

My First Project: Diving into Lambda Functions

Stepping into my role at FutureSmartAI with a blend of eagerness and apprehension, I faced my first significant challenge: developing a lambda function to automate product descriptions for a E-Commerce brand. This task wasn't merely about programming; it represented my inaugural foray into applying AI technologies practically. Starting with learning to create a Lambda Function and integrating GPT-3 using SAM CLI, I embarked on a journey through serverless computing and the intricacies of OpenAI's language models. The project required me to dive into prompt engineering, data management with Airtable, and understanding Magento API for client product handling. It culminated in deploying a streamlined application, enhancing my skills in cloud deployments and security measures like asynchronous programming and API request limits. This initial endeavor was not only a testament to overcoming challenges but also an opportunity for knowledge sharing, leading to my first contribution to our company blog about integrating GPT-3 with AWS Lambda and API Gateway. This journey set a precedent for my internship, fueling my drive for excellence at FutureSmartAI.

This project was a catalyst for my confidence and curiosity in AI, coming at a time when 2023 marked significant advancements in the field. Fortunate to engage firsthand with cutting-edge tools, my journey deepened my understanding and kindled a passion for innovation. That fervor for excellence in AI remains undiminished, driving me forward with the same zeal today.

Leadership and Learning

In the initial months of my internship at FutureSmartAI, my journey took a significant turn toward growth and development, largely influenced by my collaboration with the mentors in the company. They were remarkable collaborators, with a friendly demeanor, quickly becaming pivotal figures in my professional journey. Their mentorship, second only to Pradip Nichite's, played a crucial role in my growth, offering me a new perspective on coding excellence and problem-solving. I had one of the best times of my life when working closely with these guys.

Working alongside other mentors and Pradip, I was part of a challenging yet rewarding project that focused on OCR and entity extraction. This project spanned over seven months, evolving from a successful proof of concept to an integral part of our production platform. The experience was not just about technical learning but also about understanding the essence of teamwork and project management in real-world scenarios.

The project's success and the day-to-day interactions with my mentors shaped my approach to leadership. Inspired by Pradip's and other mentors styles of management and mentorship, I found myself adopting similar principles when it came time to lead a small team of interns. Their example showed me the importance of collaboration, the value of constructive feedback, and the transformative power of setting high standards in both coding and strategic thinking.

This period of mentorship and leadership was a turning point, teaching me not just about the technologies we worked with but also about fostering a supportive and productive team environment. It reinforced the belief that good leadership is not just about guiding others but about inspiring them to achieve their best, a lesson I carried forward in my subsequent roles.

Exploring New Technologies

During my tenure at FutureSmartAI, my role evolved beyond the confines of a traditional internship, particularly when it came to managing and deploying technologies. My technical arsenal expanded, with AWS at the forefront, where I assumed responsibilities akin to an AWS admin. This role entailed creating roles, permissions, and users, along with managing EC2 and RDS instances, which not only sharpened my skills in cloud services but also deepened my understanding of scalable, secure application deployment.

Building and Integrating Technologies

The use of FastAPI for crafting quick, efficient APIs became a staple in my projects, paired with AWS RDS for robust database solutions. For the frontend, I ventured into using Streamlit, creating user-friendly interfaces that seamlessly communicated with our FastAPI backends, all deployed on AWS EC2 instances. This integration of technologies facilitated the development of sophisticated applications, including chatbots, NL2SQL interfaces, and entity extraction systems, showcasing the practical application of AI and machine learning in solving real-world problems.

The projects I engaged in were not only diverse but also challenging, pushing me to explore further into the AI landscape. Working with cutting-edge tools like Llama Index and Langchain opened new avenues for application development, especially in fine-tuning models with BERT for enhanced natural language processing capabilities. These experiences were invaluable, allowing me to apply theoretical knowledge in practical scenarios, thus solidifying my understanding of AI's vast potential.

The Impact of New Technologies

Embracing these technologies transformed my approach to software development and project management. The ability to quickly prototype and deploy applications using FastAPI and Streamlit, combined with the computational power and scalability of AWS services, underscored the importance of a well-integrated tech stack. Moreover, diving into model fine-tuning and leveraging platforms like Llama Index and Langchain for AI applications highlighted the continuous evolution of AI technologies and the need for perpetual learning.

Conclusion

Reflecting on my time at FutureSmartAI, I consider myself incredibly fortunate to have been part of such a pioneering team. Led by Pradip Nichitea leader who seamlessly blended the roles of a mentor and friendI was afforded every opportunity to dive deep into the realms of technology and personal development. More than just enhancing my programming skills, my tenure at FutureSmartAI bestowed upon me a gift far more valuable: confidence. This newfound self-assurance propelled me to strive for excellence, embrace challenges, and ignite a curiosity that I cherish to this day.

The principles, ethics, and practices I embraced during my internship have become a cornerstone of my professional ethos, guiding me in my current endeavors. The journey was not just about acquiring technical expertise in AWS, mastering OpenAI's GPT models, or honing my skills in prompt engineering and deploying machine learning APIs. It was about discovering my potential, learning to navigate the complexities of the tech world with humility and determination, and understanding the impact of supportive leadership.

As my internship concluded, an exciting opportunity unfolded. One of our clients, impressed by the work I had contributed to during my internship, approached me with a full-time role. This transition was not only a testament to the skills and dedication I had displayed but also came with a glowing recommendation from Pradip Nichite. His endorsement played a pivotal role in this new chapter, highlighting the significance of mentorship and the professional network I had cultivated at FutureSmartAI.

I owe a tremendous debt of gratitude to Pradip Nichite for not just providing me with this opportunity but for also being a guiding light in my journey. His humility and wisdom have left an indelible mark on me, and I hope more people have the privilege to work with and learn from such an inspirational figure in the tech industry.

Armed with the skills and insights gained from my internship, I'm excited to continue exploring the vast landscape of technology in my new role. The experiences at FutureSmartAI have equipped me with a solid foundation to tackle future challenges and seize opportunities to innovate and make a difference in the tech world.

To those embarking on their own tech journeys, remember: the blend of curiosity, confidence, and the right mentorship can transform challenges into stepping stones for growth. Let's keep pushing the boundaries of what's possible, inspired by the lessons of our past and the endless potential of our futures.

Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs -FutureSmart AI

Looking to stay up to date on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with aidemos.com

Mastering Natural Language to SQL with LangChain | NL2SQL

Pradip Nichite — Mon, 11 Mar 2024 10:38:55 GMT

Introduction

Welcome to our deep dive into revolutionizing the way we interact with databases using Natural Language Processing (NLP) and LangChain. In today's data-driven world, the ability to query databases without needing to know complex SQL syntax opens up a myriad of possibilities across various industries, from healthcare to finance, making data more accessible to everyone.

This blog post aims to guide you through a comprehensive journey to master NL2SQL using LangChain. We will explore the steps necessary to build an intuitive, efficient, and intelligent NL2SQL model that can understand and process natural language queries, dynamically select relevant database tables, and maintain a conversational context to handle follow-up questions effectively.

By the end of this post, you'll have a solid understanding of:

Building a Basic NL2SQL Model: The foundation of translating natural language queries into SQL commands.
Incorporating Few-Shot Learning: Enhancing model accuracy with examples.
Dynamic Few-Shot Example Selection: Tailoring examples to the query context for improved relevance.
Dynamic Relevant Table Selection: Automatically identifying which tables to query based on the natural language input.
Customizing Prompts and Responses: Fine-tuning the model's interaction to provide clear, concise, and relevant answers.
Adding Memory to Chatbots: Enabling the model to handle follow-up questions by remembering the context of the conversation.

Through each of these steps, we'll discuss the concepts, show you how to implement them , and illustrate the outcomes , ensuring you have the tools and knowledge needed to bring the power of NL2SQL to your databases.

Let's embark on this exciting journey to unlock the full potential of your data, making database queries as simple as conversing with a friend.

Building a Basic NL2SQL Model

The first step in our journey to revolutionize database querying with natural language is constructing a basic NL2SQL model using LangChain. This foundational model serves as the cornerstone for more advanced functionalities we'll explore later. Here's how we begin:

Understanding the Basics

At its core, an NL2SQL model aims to translate natural language queries into SQL commands. But how do we start building such a model with LangChain?

Setting Up LangChain

LangChain simplifies the process of creating NL2SQL models by providing a flexible framework that integrates seamlessly with existing databases and natural language processing (NLP) models. To get started, you'll need to:

Install LangChain: Ensure that LangChain is installed in your environment.

 pip install langchain_openai langchain_community langchain pymysql chromadb -q

Connect to Your Database: The next step involves establishing a connection to your database. LangChain supports various database systems, so you'll likely find your database among the supported ones. You'll use the database credentials to create a connection that LangChain can use to interact with your data

 import os os.environ["OPENAI_API_KEY"] = "" db_user = "" db_password = "" db_host = "" db_name = "classicmodels" from langchain_community.utilities.sql_database import SQLDatabase # db = SQLDatabase.from_uri(f"mysql+pymysql://{db_user}:{db_password}@{db_host}/{db_name}",sample_rows_in_table_info=1,include_tables=['customers','orders'],custom_table_info={'customers':"customer"}) db = SQLDatabase.from_uri(f"mysql+pymysql://{db_user}:{db_password}@{db_host}/{db_name}") print(db.dialect) print(db.get_usable_table_names()) print(db.table_info)

The First Query

Once the setup is complete, the real magic begins. You can start by formulating a simple query in natural language, such as "Show me all products priced above $100." LangChain takes this input and, through its integration with language models like ChatGPT and your database, generates an SQL query that precisely captures the intent of your request

from langchain.chains import create_sql_query_chainfrom langchain_openai import ChatOpenAIllm = ChatOpenAI(model="gpt-3.5-turbo", temperature=0)generate_query = create_sql_query_chain(llm, db)query = generate_query.invoke({"question": "what is price of `1968 Ford Mustang`"})# "what is price of `1968 Ford Mustang`"print(query)

Seeing the Results

Executing the generated SQL query against your database retrieves the data you're looking for, which LangChain can then present in a user-friendly format.

from langchain_community.tools.sql_database.tool import QuerySQLDataBaseToolexecute_query = QuerySQLDataBaseTool(db=db)execute_query.invoke(query)

Moving Forward

With the basic NL2SQL model set up, you've taken the first step towards transforming how we interact with databases. However, this is just the beginning. As we progress, we'll explore how to enhance the model's accuracy, handle more complex queries, and even maintain context over a conversation for follow-up questions.

Rephrasing Answers for Enhanced Clarity

After your NL2SQL model successfully executes a SQL query, the next pivotal step is to present the data in a manner that's easily understandable by your users. This is where the art of rephrasing SQL results into clear, natural language answers comes into play. Here's how you can achieve this with LangChain:

Implementing Rephrasing with LangChain

Use Prompt Templates: LangChain allows you to create prompt templates that can guide the model in how to rephrase SQL results. These templates can include placeholders for the original question, the SQL query, and the query result, setting the stage for generating a natural language response

 from operator import itemgetter from langchain_core.output_parsers import StrOutputParser from langchain_core.prompts import PromptTemplate from langchain_core.runnables import RunnablePassthrough answer_prompt = PromptTemplate.from_template(     """Given the following user question, corresponding SQL query, and SQL result, answer the user question. Question: {question} SQL Query: {query} SQL Result: {result} Answer: """ ) rephrase_answer = answer_prompt | llm | StrOutputParser() chain = (     RunnablePassthrough.assign(query=generate_query).assign(         result=itemgetter("query") | execute_query     )     | rephrase_answer ) chain.invoke({"question": "How many customers have an order count greater than 5"})

Example: Transforming SQL Results into User-Friendly Responses

Let's consider a user asks, "How many customers have an order count greater than 5?" and the SQL query returns a raw numerical result. The rephrasing process would convert this into a more readable answer, such as "There are 2 customers with an order count of more than 5." This step is vital in closing the loop between user queries and database responses, ensuring that the information provided is both useful and easily digestible

There are 2 customers with an order count of more than 5.

In the next section, we'll dive into the exciting world of few-shot learning and how it can be used to improve the performance of your NL2SQL model with LangChain. Stay tuned to unlock the full potential of natural language database querying.

Enhancing NL2SQL Models with Few-Shot Examples

This technique involves providing the model with a small set of carefully selected examples that demonstrate how to convert natural language questions into SQL queries. Few-shot learning can significantly improve the model's ability to understand and generate precise SQL commands based on user queries, bridging the gap between human language and database querying.

Incorporating Few-Shot Examples into LangChain

Selecting Relevant Examples: The first step is to curate a set of examples that cover a broad range of query types and complexities. These examples should ideally reflect the most common or critical queries your users might perform

 examples = [     {         "input": "List all customers in France with a credit limit over 20,000.",         "query": "SELECT * FROM customers WHERE country = 'France' AND creditLimit > 20000;"     },     {         "input": "Get the highest payment amount made by any customer.",         "query": "SELECT MAX(amount) FROM payments;"     },    ..... ]

Creating a Few-Shot Learning Template: With LangChain, you can design a prompt template that incorporates these examples into the model's workflow. The template instructs the model to consider the examples when generating SQL queries from new user questions

 from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder,FewShotChatMessagePromptTemplate,PromptTemplate example_prompt = ChatPromptTemplate.from_messages(     [         ("human", "{input}\nSQLQuery:"),         ("ai", "{query}"),     ] ) few_shot_prompt = FewShotChatMessagePromptTemplate(     example_prompt=example_prompt,     examples=examples,     # input_variables=["input","top_k"],     input_variables=["input"], ) print(few_shot_prompt.format(input1="How many products are there?"))

 Human: List all customers in France with a credit limit over 20,000. SQLQuery: AI: SELECT * FROM customers WHERE country = 'France' AND creditLimit > 20000; Human: Get the highest payment amount made by any customer. SQLQuery: AI: SELECT MAX(amount) FROM payments; ......

The Impact of Few-Shot Learning

By integrating few-shot examples, your NL2SQL model becomes more adept at handling a wider variety of user queries. This not only improves the user experience by providing more accurate and relevant responses but also reduces the potential for errors in SQL query generation.

In the next section, we'll explore the integration of dynamic example selection to further enhance the model's accuracy and relevance, ensuring that your NL2SQL system remains adaptive and responsive to user queries.

Dynamic Few-Shot Example Selection:

This advanced technique tailors the few-shot examples provided to the model based on the specific context of the user's query. It ensures that the guidance offered to the model is not just relevant but optimally aligned with the query's nuances, significantly boosting the model's ability to generate accurate SQL queries.

The Need for Dynamism

Static few-shot examples, though highly effective, have their limitations. Dynamic selection addresses this by intelligently choosing examples that closely match the intent and context of each new query, providing a customized learning experience for the model with every interaction.

Implementing Dynamic Few-Shot Selection

Example Selector Configuration: Begin by setting up an example selector that can analyze the semantics of the user's query and compare it with a repository of potential examples. Tools like semantic similarity algorithms and vector embeddings come into play here, identifying which examples are most relevant to the current query

 from langchain_community.vectorstores import Chroma from langchain_core.example_selectors import SemanticSimilarityExampleSelector from langchain_openai import OpenAIEmbeddings vectorstore = Chroma() vectorstore.delete_collection() example_selector = SemanticSimilarityExampleSelector.from_examples(     examples,     OpenAIEmbeddings(),     vectorstore,     k=2,     input_keys=["input"], ) example_selector.select_examples({"input": "how many employees we have?"}) few_shot_prompt = FewShotChatMessagePromptTemplate(     example_prompt=example_prompt,     example_selector=example_selector,     input_variables=["input","top_k"], ) print(few_shot_prompt.format(input="How many products are there?"))

Integrating with LangChain: Integrate the example selector with your LangChain workflow. When a new query is received, the selector determines the most relevant few-shot examples before the model generates the SQL query. This ensures that the guidance provided to the model is tailored to the specific requirements of the query

 final_prompt = ChatPromptTemplate.from_messages(     [         ("system", "You are a MySQL expert. Given an input question, create a syntactically correct MySQL query to run. Unless otherwise specificed.\n\nHere is the relevant table info: {table_info}\n\nBelow are a number of examples of questions and their corresponding SQL queries."),         few_shot_prompt,         ("human", "{input}"),     ] ) print(final_prompt.format(input="How many products are there?",table_info="some table info")) generate_query = create_sql_query_chain(llm, db,final_prompt) chain = ( RunnablePassthrough.assign(query=generate_query).assign(     result=itemgetter("query") | execute_query ) | rephrase_answer ) chain.invoke({"question": "How many csutomers with credit limit more than 50000"})

 There are 85 customers with a credit limit greater than 50000.

By ensuring that the examples used for guidance are always contextually relevant, the model can generate more precise SQL queries, reducing errors and improving user satisfaction. of NL2SQL technology, making data insights more accessible to everyone.

In the following section, we will explore the integration of dynamic relevant table selection, further advancing our NL2SQL model's capabilities to efficiently parse and respond to user queries.

Dynamic Relevant Table Selection

In the realm of NL2SQL models, especially when dealing with complex databases featuring 100+ tables. With databases growing in complexity and size, it's impractical and costly in terms of prompt token usage to include the schema of every table in the initial prompt for generating SQL queries. The sheer volume of information would overwhelm the model, leading to slower response times and increased computational costs. Dynamic relevant table selection emerges as a solution to this challenge, focusing the model's attention only on the tables pertinent to the user's query.

from operator import itemgetterfrom langchain.chains.openai_tools import create_extraction_chain_pydanticfrom langchain_core.pydantic_v1 import BaseModel, Fieldfrom typing import Listimport pandas as pddef get_table_details():    # Read the CSV file into a DataFrame    table_description = pd.read_csv("database_table_descriptions.csv")    table_docs = []    # Iterate over the DataFrame rows to create Document objects    table_details = ""    for index, row in table_description.iterrows():        table_details = table_details + "Table Name:" + row['Table'] + "\n" + "Table Description:" + row['Description'] + "\n\n"    return table_detailsclass Table(BaseModel):    """Table in SQL database."""    name: str = Field(description="Name of table in SQL database.")# table_names = "\n".join(db.get_usable_table_names())table_details = get_table_details()print(table_details)

Table Name:productlinesTable Description:Stores information about the differ....Table Name:productsTable Description:Contains de....

Leveraging Smaller, Focused Prompts for Faster Execution

Dynamic relevant table selection hinges on the principle that "less is more." By reducing the scope of information the model needs to consider for each query:

Improved Model Performance: Smaller prompts mean the model has fewer tokens to process, which translates to faster execution times. This is particularly crucial for interactive applications where response time is a key component of user satisfaction.
Enhanced Accuracy: Focusing on only the relevant tables minimizes the risk of generating incorrect SQL queries. This specificity ensures that the model's computational resources are dedicated to understanding and processing only the most pertinent data.
Cost-Efficiency: Reducing the amount of prompt information also means fewer token usage costs. In the context of cloud-based NLP services, where processing costs can accumulate rapidly, this efficiency is not only a technical but also a financial advantage.

table_details_prompt = f"""Return the names of ALL the SQL tables that MIGHT be relevant to the user question. \The tables are:{table_details}Remember to include ALL POTENTIALLY RELEVANT tables, even if you're not sure that they're needed."""table_chain = create_extraction_chain_pydantic(Table, llm, system_message=table_details_prompt)tables = table_chain.invoke({"input": "give me details of customer and their order count"})tables

[Table(name='customers'), Table(name='orders')]

def get_tables(tables: List[Table]) -> List[str]:    tables  = [table.name for table in tables]    return tablesselect_table = {"input": itemgetter("question")} | create_extraction_chain_pydantic(Table, llm, system_message=table_details_prompt) | get_tablesselect_table.invoke({"question": "give me details of customer and their order count"})

['customers', 'orders']

chain = (RunnablePassthrough.assign(table_names_to_use=select_table) |RunnablePassthrough.assign(query=generate_query).assign(    result=itemgetter("query") | execute_query)| rephrase_answer)chain.invoke({"question": "How many cutomers with order count more than 5"})

Enhancing Chatbots with Memory for Follow-up Database Queries

One of the most advanced steps in creating a user-friendly NL2SQL interface is endowing your chatbot with memory. This feature enables the chatbot to handle follow-up questions related to the database intelligently, providing users with a seamless conversational experience. Let's explore how adding memory to your chatbot can revolutionize interactions with your database.

The Significance of Memory in Chatbots

In real-world conversations, context matters. A question might relate to or build upon previous interactions. Similarly, when users interact with a database through a chatbot, their follow-up questions often depend on the context established by earlier queries and responses. A chatbot equipped with memory can retain this context, allowing it to generate more accurate and relevant SQL queries for follow-up questions.

Implementing Memory in Your NL2SQL Model

To equip your NL2SQL model with memory, consider incorporating a chat message history that tracks the conversation's flow. This history should include both the questions posed by the user and the chatbot's responses, enabling the model to reference previous interactions when generating SQL queries for new questions.

Setting Up Message History: Implement a mechanism to record each user query and the corresponding chatbot response. This can be achieved by defining a ChatMessageHistory object that stores this information and can be accessed when needed
```
 from langchain.memory import ChatMessageHistory history = ChatMessageHistory()
```

Leveraging Previous Interactions: Integrate this message history into your prompt generation process. Before generating a new SQL query, the model should consider the recorded history to understand the conversation's context

 final_prompt = ChatPromptTemplate.from_messages(     [         ("system", "You are a MySQL expert. Given an input question, create a syntactically correct MySQL query to run. Unless otherwise specificed.\n\nHere is the relevant table info: {table_info}\n\nBelow are a number of examples of questions and their corresponding SQL queries. Those examples are just for referecne and hsould be considered while answering follow up questions"),         few_shot_prompt,         MessagesPlaceholder(variable_name="messages"),         ("human", "{input}"),     ] ) print(final_prompt.format(input="How many products are there?",table_info="some table info",messages=[]))

Dynamic Prompt Adaptation: Use the chat message history to dynamically adapt the prompts sent to the model for generating SQL queries. This adaptation should include information from previous queries and responses, guiding the model in understanding the context of the follow-up question
```
 generate_query = create_sql_query_chain(llm, db,final_prompt) chain = ( RunnablePassthrough.assign(table_names_to_use=select_table) | RunnablePassthrough.assign(query=generate_query).assign(     result=itemgetter("query") | execute_query ) | rephrase_answer )
```

Example Scenario: Handling Follow-Up Questions

Imagine a user first asks, "How many customers have an order count more than 5?" After receiving the answer, they follow up with, "Can you list their names?" With a memory feature, the chatbot can understand that the second question relates to the subset of customers identified in response to the first question, allowing it to generate an accurate follow-up query without needing the user to re-specify the context.

question = "How many cutomers with order count more than 5"response = chain.invoke({"question": question,"messages":history.messages})There are 2 customers with an order count of more than 5.

history.add_user_message(question)history.add_ai_message(response)history.messages[HumanMessage(content='How many cutomers with order count more than 5'), AIMessage(content='There are 2 customers with an order count of more than 5.')]

response = chain.invoke({"question": "Can you list there names?","messages":history.messages})responseThe names of the customers with more than 5 orders are Mini Gifts Distributors Ltd. and Euro+ Shopping Channel.

Conclusion:

Through this guide, we've journeyed through the process of enhancing NL2SQL models using LangChain, showcasing how to transform natural language queries into precise SQL commands. This exploration not only highlights the power of LangChain in making database queries more accessible but also underscores the broader impact of integrating advanced NLP techniques for intuitive data interaction.

For those interested in delving deeper, a video walkthrough and a comprehensive GitHub notebook and Streamlit Code are available to explore these concepts further. These resources offer visual demonstrations and hands-on examples to help bring these ideas to life in your own projects.

The journey toward more natural and efficient database interactions is ongoing, and with each step, we're making the world of data more accessible to all.

If you're curious about the latest in AI technology, I invite you to visit my project, AI Demos, at aidemos.com. It's a rich resource offering a wide array of video demos showcasing the most advanced AI tools. My goal with AI Demos is to educate and illuminate the diverse possibilities of AI.

For even more in-depth exploration, be sure to visit my YouTube channel at https://www.youtube.com/@aidemos.videos. Here, you'll find a wealth of content that delves into the exciting future of AI and its various applications.

Interacting with Videos via VideoDB and OpenAI

Ved Vekhande — Wed, 28 Feb 2024 13:53:19 GMT

Introduction

Welcome to the tutorial where we use the latest "Videodb" tool to chat with youtube videos. Previously we have explored tutorials featuring ChromaDB, Pinecone which were vector store databases used for texts. VideoDB provides a User Interface to store videos just as we do with texts. A video database is a database system optimized for storing, cataloging, and retrieving digital video content. Like traditional databases, video databases rely on metadata and indexes to organize the content and enable search and retrieval.

In this blog, we will walk through the entire process of setting up the development environment, incorporating OpenAI services into our application, and crafting a chatbot that can seamlessly converse the videos.

To explore more about ChatGPT API to build conversational chatbot, you can watch our video Using OpenAI's ChatGPT API to Build a Conversational AI Chatbot

Setting Up the Environment

To begin building our voice-responsive OpenAI chatbot, it's essential to set up the right development environment. This involves installing necessary libraries and configuring API access. Here's how you can get started:

Install Required Libraries

The chatbot relies on several Python libraries, as listed in the requirements.txt file. These libraries include Streamlit for chat interface, Videodb for accessing Video database and OpenAI for accessing GPT Model. Install them by running the following command in your project directory:

pip install -r requirements.txt

Here's a complete breakdown of all the libraries:

streamlit : For building and running the web app.
openai : To access OpenAI's API for GPT Model.
videodb : To access Video Database.

Creating account on VideoDB

To use Videodb, you need to create account on videodb.io. After creating account, generate a new API key for our application and copy it to clipboard.

Set Up the`.env` File

Sensitive information such as your OpenAI API key and VideoDB API key should be stored in a .env file. This approach keeps your credentials secure. Create a .env file in the root of your project and include your OpenAI API key and VideoDB API key like this:

videodb_api_key="your_videodb_api_key_here"openai_api_key = "your_openai_api_key_here"

Ensure that this file is not shared publicly, especially if you are pushing your code to a public repository.

Understanding Project's structure

Our project primarily consists of two Python files:

streamlit.py: This file contains the Streamlit web application logic. It's where you define the user interface and manage the flow of input/output for the chatbot.
db_utils.py: This file includes functions for connecting to Video Database, processing videos, querying videos as well as generating chatbot responses.

With our environment set up and a basic understanding of our project's structure, we are now ready to start building the chatbot's functionalities.

Building the Chatbot: Streamlit Interface`(streamlit.py)`

In this section, we dive into the construction of our chatbot, focusing on how the Streamlit interface is set up and how processing is done to store Videos in streamlit.py.

Streamlit Interface Setup

Streamlit is a powerful tool that allows us to quickly build interactive web applications for our chatbot. In streamlit.py, the Streamlit application is initialized and configured to handle user interactions:

import streamlit as stfrom db_utils import upload_videos_to_database, getanswerst.set_page_config(    page_title="Youtube Video Query Bot🤖", layout="centered", initial_sidebar_state="auto")#Initialising session states to store valuesdef initialize_session_state():    if "messages" not in st.session_state.keys():        st.session_state.messages = [            {"role": "assistant", "content": "Hi! How may I assist you today?"}        ]    if "youtube_links" not in st.session_state.keys():        st.session_state.youtube_links = []    if "links_submitted" not in st.session_state.keys():        st.session_state.links_submitted = Falseinitialize_session_state()

In this setup, we initialize the Streamlit app, import necessary functions from db_utils.py, and set up the session state to track and manage chat messages.

Additional Resource: A Video Tutorial for Streamlit chatbot is embedded here for better understanding:

https://www.youtube.com/watch?v=W7kDwsWFjvE&t=1431s

Handling Youtube video links input

We take single or multiple video links input from user and store them in session state. User can enter one link at time and click Add Youtube link button which will add the video link to session state. Next user can enter new link again in the input box or continue with single video. Once all the video links have been entered, user can submit all the videos and lock the input box by clicking Submit All Youtube Links button.

st.header("Enter YouTube Video Links")if not st.session_state.links_submitted:    youtube_link = st.text_input("Enter a YouTube link:")    if st.button("Add YouTube Link"):        st.session_state.youtube_links.append(youtube_link)        st.success(f"YouTube link {youtube_link} added successfully!")        youtube_link = ""    if st.button("Submit All YouTube Links"):        st.session_state.links_submitted = True        with st.spinner("Uploading and Indexing Video"):            upload_videos_to_database() #Uploading and processing the videos to Videodb             st.success("All YouTube links uploaded and indexed successfully!")else:    st.success("YouTube links have been submitted and locked.")

Once all the youtube links have been inserted and submitted, upload_videos_to_database function is called, which will process all the videos and index them into database. Usually it takes a minute or 2 to process a single video of length 30 minutes, which may vary for user to user.

Chatbot Response Processing

Once all the videos are processed and indexed, user can proceed to query from the list of videos

st.title("Youtube Video Query Bot🤖")for message in st.session_state.messages:    with st.chat_message(message["role"]):        st.write(message["content"])if user_query := st.chat_input("Enter query"):    st.session_state.messages.append({"role": "user", "content": user_query})    with st.chat_message("user"):        st.write(user_query)if st.session_state.messages[-1]["role"] != "assistant":    with st.chat_message("assistant"):        with st.spinner("Thinking🤔"):            res_box = st.empty()            # Get the GPT generated answer and context from the function             final_response, matching_videos = getanswer(user_query)         res_box.write(f"{final_response}")        st.session_state.messages.append({"role": "assistant", "content": final_response})    # Printing the context text of the partitioned video     with st.expander(f"Context Details"):            st.write(matching_videos)

In this part of the code, getanswer function is used to generate response based on the query, user query is passed to the backend function to iterate over videos to select the correct part of the video and then it is transcribed and parsed via GPT to generate response.

Next, we display the answer along with the most relevant context from the video via expander.

Integrating VideoDB and OpenAI services `(db_utils.py)`

In db_utils.py we define key functions that integrates VideoDB and OpenAI's GPT model into our project along with the logic for generating chatbot responses.

Let's dive into these functions structure.

import streamlit as stfrom videodb import connectfrom dotenv import load_dotenvload_dotenv()import osfrom openai import OpenAI# Get the environment variablesopenai_api_key = os.getenv("openai_api_key")videodb_api_key = os.getenv("videodb_api_key")client = OpenAI(api_key = openai_api_key)conn = connect(api_key=videodb_api_key) #Initializing the connection with Videodbcoll = conn.get_collection() # Getting the collection from Database

We first start by establishing connection for Video database and getting the collection. The openai_api_key and videodb_api_key would be getting its value from .env file that we created initially. coll variable gets the collection from the database.

Uploading Video to Database

Next, we create the function that uploads the videos stored in st.session_state.youtube_links variable to database and index them.

# Function to upload videosdef upload_videos_to_database():    for link in st.session_state.youtube_links:        try:            coll.upload(url=link)            print(f"Video: {link} uploaded successfully")            for video in coll.get_videos():                video.index_spoken_words() #Indexing the texts from each video in database to perform query search                print(f"Indexed {video.name}")        except Exception as e:            print(f"Exception occured for video {link} as {e}")

coll.upload() function uploads the url link to the database where it is accessed and stored, the for loop indexes each video which then creates a transcript of the videos.

Getting the resultant portion of the video for query

Now, what happens in Videodb is, when a query is entered, videodb scans the video to find the most relevant part and returns it, the implementation for the same goes as:

# Function to get the resultant video for the query and calling GPT over context textdef getanswer(query):    result = coll.search(query = query) # Searching the database for matching part to the query    print("Type: ")    #Result returns a list of shots    first_shot = result.get_shots()[0] # Selecting only the first shot from result    print("First Shots data: ", first_shot)    video_title = first_shot.video_title #Title of the resultant cropped video    text = first_shot.text #Transcript of the resultant cropped video    print("Video Title: ", video_title)    print("Text: ", text)    answer = ""    # Now we pass the query and context text to GPT to generate more precise answer    answer = get_openai_ans(query=query, text=text)    matching_video = {        "video_title": video_title,        "text": text    }    return answer, matching_video

Here, coll.search function searches for the most relevant part to the query from the video collections, then we take the most relevant video from all the collection from the function result.get_shots()[0]. We then extract the title of the video and transcript of the resultant part from the variable first_shot by first_shot.video_title and first_shot.text respectively. You can view more attributes of the video by printing the variable first_shot. We then pass the transcript along with the query to OpenAI's GPT Model to get the answer in a more concise way.

We return the answer along with the matching_video which contains the title and transcript of the resultant video

Parsing transcript and query via GPT

In this function we take query along with transcript of the resultant video to get the answer in a more concise manner.

# Function to genrate precise answers from the context textdef get_openai_ans(query, text):    messages = [        {            "role": "assistant",            "content": f"""You are an helpful AI chatbot, who answers questions based on the generated transcript of a video. Remember to be precise and to the point while answering questions.            You need to output the query only based on the below context provided, if you don't find answer in the context, output 'Information to the query is not in the provided video context'             The context to answer query is: {text}             Now answer the queries given by user"""        },        {            "role": "user",            "content": query        }    ]    response = client.chat.completions.create(        model="gpt-3.5-turbo",        messages = messages    )    answer = response.choices[0].message.content    return answer

Initially, we create a prompt for the GPT to understand its role and parse the transcript of the video via {text} variable. Next from the role user we pass the query.

Here we are using gpt-3.5-turbo model to answer our queries and pass the message dictionary to the model, and return the answer to the function which is then passed to get_answer function which will output it on streamlit interface.

Use Case

Let us look at one of the use case of the application that we created. We pass the youtube Video links in the textbox provided and click Add Youtube Link button which will store the link in session state variable.

User can add multiple Youtube videos and once all the videos are added click Submit All Youtube Links button which will send all the videos to Videodb to process it.

This is how the User Interface looks once we submit all the links. Now we are ready to query with our Chatbot. The video that we are using for demo purpose is: Building a Conversational Voice Chatbot: OpenAI Speech-to-Text & Text-to-Speech Integration

Here's the conversation that we had with chatbot:

'Assistant': 'Hi! How may I assist you today?''User': 'What is the video talking about?''Assistant': 'The video is about converting audio or speech into text and building a chatbot using those capabilities in a streamlit application. It also mentions that the necessary code is explained in the video and suggests watching the video for a detailed understanding.''User': 'Is it a strealit based chatbot or an API that is used?''Assistant': 'It is a Streamlit-based chatbot.''User': 'How do we pass Audio file?''Assistant': 'To pass the audio file, we declare a file name variable, open the file, and write the audio bytes obtained from the audio recorder into the file. This file containing the audio data is then sent to the transcription endpoint for processing.''User': 'How do we use this Audio file to create transcript?''Assistant': 'To use the audio file to create a transcript, we first declare a file name and then write the audio bytes from the audio recorder to that file. Once we have the file containing the audio, we call the speech to text function, which internally calls the transcriptions endpoint of OpenAI for transcription.''User': 'What is speech to text function doing?''Assistant': 'The speech to text function is taking an audio file, attaching it as a file element, calling the transcriptions endpoint, and generating a transcript of the audio.''User': 'Explain about Speech to text part''Assistant': 'The speech to text function involves taking an audio file, attaching it as a file element, and then calling the transcriptions endpoint to obtain a transcript. This transcript is the text version of the audio content. The process essentially converts spoken words into written text through automated transcription.''User': 'What is passed in message array?''Assistant': 'The message array is initialized with a single message when it is not already present in the session variable. The single message is the assistants first message, which is "Hi, how may I assist you today?" and it contains content and role information.''User': 'What we will do if Message array is not present?''Assistant': 'If the messages array is not present inside the session variable, which is the Streamlit session state, we will initialize it. We will initialize the message array with a single message, which is the assistants first message: "Hi, how may I assist you today?" This helps in setting up the initial state of the messaging array for the application.'

Additionally, here's the 👇 demo link for the same:

https://www.veed.io/view/f9ca0050-c1a4-4ee8-9c50-7f38c123cdb0?panel=share

Conclusion

As we wrap up our exploration of Video database and building a chatbot over streamlit that interacts with Videos, let's reflect on what we've accomplished and the potential for further development.

Reflecting on the Project

This project demonstrates the power and versatility of integrating advanced AI services and prospects of Video database with OpenAI's GPT Model into a user-friendly application. By combining OpenAI and Videodb with streamlit we have created a chatbot that can iterate videos and respond over the queries based on the videos.

You can deploy the project on AWS instance to make it available for all by following our video tutorial for deployment on AWS.

Deploy GPT-3 Streamlit App on AWS EC2

Additional Resources

You can access the complete code and documentation here: FutureSmart AI Github.

If your company is looking to embark on a similar journey of transformation and you're in need of a tailored NLP solution, we're here to guide you every step of the way. Our team at FutureSmart AI specializes in crafting custom NLP applications, including generative NLP, RAG, and ChatGPT integrations, tailored to your specific needs.

Don't let your customer Engagement lag behind. Embrace the future of automated, intelligent customer interactions. Reach out to us at contact@futuresmart.ai, and let's discuss how we can build a smarter, more responsive customer Engagement system for your business. Join the ranks of forward-thinking companies leveraging the best of AI, and see the difference for yourself!

For further exploration and references, don't forget to check the Videodb API documentation

If you are looking to build a conversational voice assistant with OpenAI's speech to text and text to speech capabilities, you can watch the Video Tutorial here, additionally complete blog is featured at FutureSmart AI Blogs.

Stay Connected with FutureSmart AI for the Latest in AI Insights -FutureSmart AI

Eager to stay informed about the cutting-edge advancements and captivating insights in the field of AI? Explore AI Demos, your ultimate destination for staying abreast of the newest AI tools and applications. AI Demos serves as your premier resource for education and inspiration. Immerse yourself in the future of AI today by visiting aidemos.com.

Data Science Diaries: Navigating My Internship Journey

Ved Vekhande — Sun, 25 Feb 2024 07:09:01 GMT

Introduction

As I embarked on my journey, the prospect of applying theoretical knowledge to real-world scenarios fueled my passion for data science. In this blog, I aim to chronicle the transformative odyssey of my data science internship at FutureSmart AI a journey marked by significant learning curves, tangible accomplishments, and the development of skills that transcend the boundaries of traditional education. The experience wasn't merely about acquiring technical proficiency; it was a holistic exploration of teamwork, problem-solving, and the intricate dance between data and decision-making.

As I delved into the intricate world of data, the prospect of translating theoretical knowledge into actionable insights and solutions became a driving force in my academic and professional journey. The pursuit of this data science internship wasn't just a checkbox on my career to-do list; it was an opportunity to immerse myself in the real-world applications of data Science, machine learning, and Natural Language Processing.

Background

I started my Bachelor's degree in Computer Science from Indian Institute of Information Technology (IIIT) Vadodara - ICD. Being from science background with an inclination towards Statistic and Data, I started getting into Machine Learning right from second semester of my B.Tech. It was a great learning curve and best foundational course from one of the best ML professor Andrew Ng. Gradually I started exploring Data Science concepts theoretically. With getting surrounded by theoretical concepts, it was of utmost importance for me to get my hands on real life application of Data Science.

This is where my internship at FutureSmart AI gave me a breakthrough to submerge myself into this practical experience through real world projects and gave me confidence to go from Rookie to being a confident Data Science personnel.

Learning and Growth

Embarking on my data science internship, I was met with a myriad of opportunities for learning and growth, each project and task serving as a stepping stone in my professional development.

Overview of Projects and Responsibilities

The diverse array of projects entrusted to me during my internship served as an immersive expedition through the intricate landscape of the data science lifecycle. My responsibilities spanned the entire spectrum, from the initial stages of data extraction and indexing to the final phases of developing robust models and deploying them in real-world scenarios. The projects that I worked made me integrate various techstacks, one such project that I worked on made me familiar with how online AI interviews worked. I developed a FastAPI prototype where as a candidate you need to enter your basic details like Role you are applying for, your current level of work experience, later you can either choose from default roles Job description that we create via GPT or upload your Job Description file,and finally upload your resume and get your online AI interview questions.

In other project, I developed a Streamlit interface for users to input Natural Language queries, converting them into SQL queries using a combination of queries stored in ChromaDB and GPT. The system refines its accuracy by seeking user feedback, storing accepted queries in a MySQL database and ChromaDB, and utilizing ChromaDB for future query suggestions. A user-friendly loop allows corrections for rejected queries, fostering continuous improvement in the system's Natural Language understanding and SQL query generation. The SQL query generated would be used to output the data from MySQL database of the client.

With the rise of online AI interviews in the market, both candidates and interviewers often seek ways to assess the alignment between job descriptions and candidate resumes. In this project, a sophisticated system extracts essential information from resumes using specific prompts. Simultaneously, job descriptions undergo GPT processing, generating details in JSON format. The system then employs Hugging Face sentence transformer embeddings and OpenAI embedding with cosine similarity to calculate a nuanced compatibility score. This innovative approach streamlines the evaluation process, offering an effective tool for recruiters and candidates to ensure a harmonious match between skills and job requirements.

Apart from this mainstream projects I have also worked on other projects that required me to fine-tune GPT models, use OpenAI's other functional capabilities. I have made applications that can answer questions based on the uploaded PDF. One such project required me to deal with Llamaindex and Facebook AI Similarity Search (FAISS) index. Other than it, I have contributed effectively developing mini projects which were FastAPI or streamlit based.

Beyond Code to Blogs

My internship became truly memorable as I ventured beyond coding, immersing myself in blog creation for FutureSmart AI. One notable project involved harnessing OpenAI's speech-to-text and text-to-speech capabilities to craft a conversational chatbot on the Streamlit platform, showcasing not just technical prowess but also a keen understanding of human-computer interactions. Further elevating the experience, I explored the world of video content by implementing Videodba video database akin to Chromadb but tailored for videos. These blog-worthy endeavors not only added depth to my internship but also highlighted the diverse applications of AI technologies, shaping an enriching and multifaceted learning journey.

A step by step overview of working on Project

Data Extraction and Indexing:
The early stages of my internship involved working closely with raw, unstructured data. This phase required not only technical finesse in extracting relevant information but also an understanding of the importance of data quality. Working with PDFs, DOCS, Excel and other files, and extracting data from it and then later indexing them to create embeddings was one of the initial tasks I dealt with.

To mention, I didn't had much idea about embeddings and found the below videos helpful in understanding and dealing with vector embeddings which helped me a lot during my entire data science internship.

https://www.youtube.com/watch?v=OlhNZg4gOvA

https://www.youtube.com/watch?v=ld3YbhoJz9w&t=277s

Model Development:
As the internship progressed, I delved into the heart of data sciencemodel development. This encompassed crafting machine learning models, employing statistical techniques, and iteratively refining algorithms to derive meaningful patterns from the data. The hands-on experience in model development not only enhanced my coding skills but also honed my ability to select and fine-tune models based on the specific nuances of each project.

To get help with fine tuning, I would recommend watching below videos:

https://youtu.be/YioMpE2jGIU?si=MAIewzrXCANcMWbO

https://www.youtube.com/watch?v=9he4XKqqzvE

Deployment in Real-world Scenarios:

One of the most gratifying aspects of my internship was witnessing the translation of models from development environments to real-world applications. While I initially had no experience with the deployment of models to cloud platforms like AWS, this stage became an immersive learning curve that broadened my skill set and provided invaluable insights into the practical challenges of deploying data science solutions in a production environment. Taking help from seniors and colleagues could always be an option while not being dependent entirely on others was a learning experience and challenge.

To get acquainted with AWS environment and ec2 instance on AWS and deploying applications, I would say below video would help you for sure.

https://youtu.be/7FVPn25mmEQ?si=5dUYPu6e-A1jCGKk

Challenges Faced and How They Were Overcome

In the face of complex datasets and intricate problem statements, I encountered challenges that required creative problem-solving and adaptive thinking. One such challenge I faced was dealing with feedback mechanism on streamlit where I was needed to integrate "Accept" and "Reject" logic on generated answer. AI can not always solve your problem as its based on previous data and data science is a field which keeps updating everyday. I had to go through numerous discussions on Stackoverflow, streamlit discussions page, Documentations, and Youtube videos but couldn't find 100% matching solution to my problem. This helped me develop the ability to brainstorm problems and keeping patience. For a matter of fact, if I would have asked seniors for the solution, I would have required relatively lower time to solve the problem, but keeping myself intact to the problem helped me gain confidence in my problem solving skills.

Overcoming these challenges not only strengthened my problem-solving abilities but also fostered resilience in the face of ambiguitya vital skill in the dynamic field of data science.

Tools and techstack

Python

Python was like the superhero of my toolkit. It's a coding language that made it super easy to play around with data and build smart models. With libraries like Pandas, Numpy, NLTK and Scikit-learn, I could do all sorts of data magic without pulling my hair out.

OpenAI

OpenAI has been the backbone of all our projects. Leveraging GPT models to fine tune them and using with custom prompts to get the desired models output. We have used OpenAI models to generate SQL queries from Natural Language queries using table descriptions and custom input prompts and examples. Additionally, I have found its purpose dealing with other projects which includes extracting and analysing resumes, matching docs, and building Retrieval Augmented Generation (RAG) models.

ML libraries

In my data science journey, I harnessed a set of powerful machine learning libraries like Numpy, Pandas, NLTK, Scikit learn, OpenAI, etc. Numpy acted as a math wizard, simplifying complex calculations and laying the groundwork for numerical magic. Pandas emerged as a data storyteller, transforming raw datasets into coherent narratives with its organizational prowess. Scikit-learn became my model-building companion, empowering me to predict outcomes and classify data effortlessly. NLTK served as a language decoder, unraveling the secrets of textual data with its linguistic algorithms.

Langchain

I have used langchain extensively to build conversational chatbot with Documents extraction and analysis. Langchain Agents like SQL agent found its application in some of our projects where we were required to deal with MySQL database. For those who are unaware of Langchain agents, I would recommend exploring Langchain Docs. In simple terms, agents interacts with the backend source to carry out a certain task just like humans. They can analyse the response and retry to build the response again if they found it unsatisfactory.

Streamlit

Streamlit is an interactive interface for development. I have used it for multiple application development. It can provide a chat interface to test the models we have created.

Hugging Face and Sentence Transformers

I have used Hugging face to embed documents and perform similarity search operations. Additionally, cosine similarity function was used extensively to match similarity between two documents. Huggingface also provided an open access to many models like Llama, Bert, etc.

Chromadb

Chromadb is a vector store database used to store documents once they are indexed. Using Sentence Transformer model to embed the document and then Chromadb to store them was widely used in number of projects. I have also used Chromadb to build custom PDF chatbots and feedback mechanism for several apps where Accepted GPT response along with user input query would be stored in Chromadb to later use them as example prompts to pass to GPT.

Llamaindex

Llamaindex was used for indexing documents and query retrieval from documents. I have used various data types in Llamaindex Tree Index, Keyword Table Index, and Vector Store index to effectively increase the efficiency of model to perform query search and retrieval.

FastAPI

FastAPI was the key component we were using to share the access of the application with clients. FastAPI provided endpoints which can be linked to the frontend part of the website and use the model we have created in backend to perform the tasks mentioned.

Amazon Web Services (AWS)

AWS was extensively used to deploy the streamlit applications and FastAPI we created so as to provide its global access. It provided the foundational infrastructure and necessary resources to ensure the deployment and functionality of the projects.

Additionally if you want to know the entire techstack that FutureSmart use you can watch the video:

https://youtu.be/4uxKEqZV-7A?si=GCpVBrVkUn8tqvzQ

Professional Development

For stepping in the corporate world, it's highly important that you get acquainted with it. My journey at FutureSmart AI gave me valuable lessons that I need to be careful about with respect to my future career. Getting mentorship from seniors have always been an important aspect of my internship at FutureSmart AI. From being through college casual ness to behaving professionally in team and carrying out tasks in Agile manner is one of the most important learning I would carry forward in my professional career. Building connections with fellow colleagues and seniors who have worked in top MNCs have resulted in a strong professional network for me.

Soft Skills

Interning at FutureSmart AI has not only impacted my technical skill set but also contributed to my all-round development.

Effective Communication

Communication emerged as a cornerstone of my soft skills repertoire during the internship. Regular interactions with team members, and seniors refined my ability to convey ideas with clarity and precision.

Adaptability and Resilience

My internship was like a rollercoaster with lots of changes. I got good at adapting quicklywhether the project changed, the team structure shifted, or we switched technologies. Learning to go with the flow not only made me a better problem-solver but also helped me stay positive when facing unexpected challenges.

Time Management Proficiency

With lots of tasks and deadlines, I had to get good at managing my time. Figuring out what to do first, making sure things were done on time, and balancing work along with my academics was challenging at first, but with patience and passing time I figured out the solution to balance everything.

Team Collaboration

Being part of a team with different skills taught me the importance of teamwork. I learned how to work smoothly with people who had different ways of doing things. This teamwork not only made our projects better but also made the workplace a positive and supportive space.

Problem-Solving Acumen

Real-world projects often unveil unexpected challenges, and my internship was a crucible for developing robust problem-solving skills. Confronting issues with a systematic and analytical mindset, I navigated through technical glitches and devised efficient solutions. This problem-solving acumen proved indispensable in troubleshooting and ensuring the success of projects.

Conclusion

My internship journey in data science was like a thrilling adventure filled with learning and discovery. I not only became really good at using cool tools and doing tech stuff, but I also learned how to talk with people, handle changes, manage time, work in a team, and solve problems in a clever way. It's like I got a whole bag of skills, not just for data science but for being awesome at work.

Looking back, I feel super proud of what I learned and the cool projects I worked on. This internship was like a big stepping stone for my future in data science. I am truly grateful for the guidance and mentorship provided by FutureSmart AI throughout all the projects. I can't wait to use all these skills in new adventures and keep learning more because the world of data is always changing, and I want to be right there, making a difference with my data skills!

Acknowledgement

I want to give a big shoutout to the amazing people at FutureSmart AI who made my data science internship unforgettable. Huge thanks to my mentors and colleagues for being super supportive and helping me grow. You all are like the real MVPs, guiding me through challenges. Special mention to Pradip Nichite sir for being my mentor and guiding me wherever it was needed during my entire internship.

Here's to more data adventures ahead!

Stay Connected with FutureSmart AI for the Latest in AI Insights -FutureSmart AI

Exploring Data Science Horizons: A Novice's Adventure to Becoming a Data Expert

Trishanu Das — Mon, 05 Feb 2024 12:43:12 GMT

Introduction:

As a curious child, I was always fascinated by Maths and numbers. The spark ignited when I was first introduced to Calculus and Linear Algebra in high school. Upon entering college for my bachelor's degree, I explored different avenues until I stumbled across the field of 'Data Science and AI.' My curiosity led me to delve into the subject, and I was captivated by the realization that this field was a fusion of Maths, Statistics, and Programming. The concept of using numbers to teach machines how humans perceive the world intrigued me even more, prompting me to immerse myself in this field. Given my non-Computer Science background and the lack of organized resources on the internet, stepping into this domain presented its own set of challenges.🤔

Gradually, my journey into the realm of data science unfolded. I delved into Statistics, mastering the basics of Python programming, and exploring various Machine Learning and Deep Learning models, all while actively seeking internships. A big shoutout goes to the seniors who generously shared their knowledge, helping me grasp the intricate concepts along the way.

This transformative journey reached a pivotal moment when I stumbled upon an opening for a Data Science internship at FutureSmart AI. Eagerly, I applied, and after a rigorous assessment, I was fortunate enough to secure the position. A year and a half of unwavering dedication culminated in this success, marking a significant milestone in my professional growth. 🚀

Now, as my remarkable journey with FutureSmart AI comes to an end, I find myself reflecting on the incredible expedition I've undertaken. Evolving from a novice in the realm of data science to a confident intern, I've amassed a wealth of knowledge and am eager to share the highlights of my discoveries.

Learning Experience:

At FutureSmart AI, my learning curve was very steep. While engaging in scalable real-life projects, I gained knowledge about Hugging Face Transformer models, state-of-the-art language models like GPT-4, and vector databases such as ChromaDB, LangChain, and Llamaindex, among other technologies. Throughout my internship, I had the opportunity to work on various projects, ranging from chatbot development for diverse clients to creating interactive applications that allow users to create and customize chatbots at the click of a button.

Additionally, I worked on applications for fetching insightful data from databases using human-like text, analyzing heart diseases, and building predictive models for disease classification in the medical domain. Despite the challenge of balancing my college responsibilities with my internship, I successfully met all deadlines. I also had the privilege of deploying these projects on cloud platforms like AWS after appropriately load-testing them. Furthermore, I used Streamlit to create the front-end for such applications.

The six months of internship not only enriched my technical skills but also honed my time management abilities, enabling me to handle both college work and deliver results at FutureSmart AI.

Major Contributions:

My first project involved creating a classification model using fine-tuned transformer models like BERT and language models (LLMs), such as GPT-4, to classify heart diseases, with the input to the models being reports from patients. During this time, I was introduced to super exciting libraries like LangChain. I also gained hands-on experience with few-shot prompting and GPT-3.5 fine-tuning. Additionally, I was introduced to deploying the project on a cloud-based web service like Amazon Web Services (AWS).

In my next project for an e-commerce company, I utilized OpenAI models to create an end-to-end chatbot seamlessly integrated with external knowledge bases such as ChromaDB and MySQL databases. These models were equipped with the functionality to call other external APIs, enhancing the chatbot's features, particularly with Retrieval Augmented Generation (RAG). The project involved the development of various API endpoints using FastAPI for an interactive application. Within this application, end-users could create new chatbots, view insights for a specific chatbot, access conversations between the chatbot and end-users, provide external sources (such as files, text, or URLs) to feed information to ChromaDB after appropriate parsing and chunking using LangChain, and delete any available information. Additionally, I took on the responsibility of load testing the chatbot, considering its deployment in over 80 countries. Given that this application was deployed on such a large scale, I went the extra mile by creating Grafana dashboards for server monitoring and setting up alerts in case of any API failures using PromQL.

In my third project, I once again contributed to the development of an end-to-end chatbot, this time for an educational platform, integrating ChromaDB and a MySQL database. I also worked on a chatbot using Llamaindex, capable of fetching results from an SQL database using human-like text. This project deepened my understanding of working with Llamaindex and LangChain and how to develop scalable solutions using them. With a Streamlit-like interface for the frontend and FastAPI endpoints for the backend, this project also featured an application interface similar to my second project.

I also worked on utilizing various APIs for scraping LinkedIn and used GPT-4 to generate customized messages for target users in different use cases. Additionally, I had the opportunity to explore the state-of-the-art open-source language model (LLM) named Llama-2 using LangChain and HuggingFace, which you can find more about here: Integrating Llama 2 with Hugging Face and Langchain🦙

Tools and Technologies Used:

Python:

Python has played a pivotal role in my professional endeavors, serving as the backbone for extensive data manipulation, analysis, and automation tasks. The rich ecosystem of Python libraries, including Pandas, NumPy, and scikit-learn, has been instrumental in streamlining the processing and handling of substantial datasets.
FastAPI:

FastAPI was used to develop various backend APIs due to its status as a modern, high-performance web framework designed specifically for building APIs.
OpenAI API, LangChain, and Llamaindex:

These libraries were employed to construct conversational AI systems capable of comprehending and generating contextually relevant, human-like text. Additionally, they empowered me to generate syntactically correct SQL code from human-like texts, subsequently executed on a database to address various use cases.
Hugging-Face Transformers and ChromaDB:

Hugging Face Transformers were used to generate embeddings for chunked texts, stored in a ChromaDB database. This database was queried to retrieve semantically similar texts, facilitating efficient handling of large volumes of data while exploring and extracting meaningful content.
MySQL:

MySQL databases played a pivotal role in efficiently handling data through well-optimized SQL code. During my internship, I delved into crafting efficient and optimized SQL code across multiple projects, aiming to deliver robust, scalable, and high-performance database solutions.
Pandas, NumPy, and Scikit-learn:

These fundamental data science libraries played a vital role in my work, enabling tasks such as data manipulation, numerical operations, and deploying machine learning algorithms for classification, regression, and clustering.
Streamlit:

Streamlit was used to develop interactive web applications, subsequently deployed on an AWS EC2 instance. This library provides an intuitive interface for chatbot development.
Postman:

Postman served as my go-to tool for testing and debugging API endpoints. Through its features, I could seamlessly send HTTP requests, examine responses, and validate the functionality embedded in the APIs I created.
Collaborative Tools:

During my internship, I leveraged various collaborative tools to promote effective teamwork and efficient project management. Git served as a reliable version control system, Jupyter Notebook, and Google Colab enabled interactive data analysis and rapid prototyping, while diverse collaboration platforms facilitated seamless communication within the team.
AWS:

AWS EC2 played a key role in orchestrating end-to-end machine learning projects, providing the foundational infrastructure and necessary resources to ensure the deployment and functionality of the projects.

Additional Soft skills:

During my internship, I not only honed my technical skills but also cultivated a myriad of invaluable soft skills that have undoubtedly enriched my professional repertoire.

Communication Skills: One of the foremost soft skills I acquired was effective communication. Through constant interaction with team members, superiors, and clients, I learned to articulate my ideas clearly and concisely.
Adaptability: Navigating the dynamic landscape of a real-world work environment necessitated a high level of adaptability. I quickly learned to embrace change, whether it be in project requirements, team structures, or technology stacks. This adaptability not only enhanced my problem-solving abilities but also instilled in me a sense of resilience in the face of unforeseen challenges.
Time Management: Balancing multiple tasks and deadlines taught me the importance of effective time management. Prioritizing assignments, meeting deadlines, and optimizing productivity became second nature.
Team Collaboration: Working collaboratively within a diverse team allowed me to appreciate the significance of teamwork. I enhanced my ability to collaborate with individuals possessing diverse skill sets and perspectives, fostering an environment conducive to innovation. This skill is instrumental in achieving collective goals and fostering a positive workplace culture.
Problem Solving: Real-world projects often present unforeseen challenges that require innovative solutions. Through my internship, I developed strong problem-solving skills by approaching issues with a systematic and analytical mindset. This skill is vital in troubleshooting technical problems and devising efficient solutions that contribute to project success.

Conclusion:

Finally, I am truly grateful for the guidance and mentorship provided by FutureSmart AI throughout all the projects. My experience with Large Language Models, combined with proficiency in Python and various libraries, and the utilization of cloud platforms like AWS, has equipped me with the capabilities to address real-world challenges and implement scalable solutions.

Having been a part of a team with industry veterans at FutureSmart AI, I can vouch that there has been exceptional growth in my problem-solving and programming skills. To anyone who wishes to join the company in the future, you can expect phenomenal growth, both professionally and personally, as FutureSmart AI provides a holistic environment for development.🤗

Reflecting on my internship journey, I am thankful for the invaluable hands-on experiences and mentorship I have received. The skills and knowledge gained during this internship will be pivotal in laying the groundwork for my future ventures in the realm of data science. I look forward to advancing my professional development, actively contributing to innovative projects, and leveraging data-driven insights to make a positive impact.

Until then, stay curious and keep learning!🔥

Automate Customer Support with Freshchat and ChatGPT Integration

Pawan T — Fri, 26 Jan 2024 07:08:55 GMT

Introduction

Freshchat is a customer engagement solution that enables businesses to interact with their customers seamlessly. It offers a range of features to enhance communication, support, and marketing efforts. The platform is particularly known for its live chat capabilities, allowing businesses to engage with website visitors and app users in real time.

The Freshchat API is a set of rules and tools that allows developers to programmatically interact with Freshchat's messaging platform. This API enables businesses to integrate Freshchat functionality into their own applications, websites, or systems, providing a way to automate processes, extract data, and extend the capabilities of Freshchat.

ChatGPT, powered by OpenAI's language model, introduces an intelligent layer to Freshchat interactions. By leveraging OpenAI's powerful language model ChatGPT, alongside the capabilities of Freshchat, businesses can provide intelligent and automated responses to user inquiries. Importantly, this automation allows for efficient communication without the need for a human agent, contributing to quicker response times and 24/7 availability. This integration holds immense potential for improving the overall efficiency, responsiveness, and personalization of customer communication processes.

Prerequisites

Obtaining a Freshchat API Key and Freshchat URL:

To interact with the Freshchat API, acquiring an Freshchat API key and Freshchat URL is a fundamental step .

Log in to Freshchat :
Accessing a Freshchat account involves credentials to login. If we don't have a Freshchat account, we can sign up for a free trial on the Freshchat website and activate a account through a verification email, and then log in to Freshchat using email address and password or google account.
After Login to Freshchat account obtain Freshchat API Key and Freshchat URL :

Setting up the development environment:

Once we obtained Freshchat API key and Freshchat URL, the next step is to set up development environment.

Access to a Python Environment: Make sure to set up a Python environment on a machine.
Install Required Libraries: Ensure that we have the required libraries installed, with a primary emphasis on using the 'requests' library for handling HTTP requests in Python. I can install it using:
```
   pip install requests
```
Freshchat API Documentation:
- Get to know the Freshchat API documentation to understand the available endpoints, request/response formats, and any specific requirements.

Freshchat API Operations

For Performing Freshchat API Operations mainly we need to provide FRESHCHAT_API_KEY and FRESHCHAT_API_URL for the code snippet.

import requestsFRESHCHAT_API_KEY = 'Your Freshchat API Key'FRESHCHAT_API_URL = f'Your Freshchat URL'

Freshchat User Operations :

Creating a Freshchat User

The create_freshchat_user function creates the user in Freshchat. that allows businesses to manage and engage with their customer base efficiently.

 def create_freshchat_user(FRESHCHAT_API_KEY, FRESHCHAT_API_URL, user_payload):     endpoint = 'users'     # Construct the complete URL for the API request     url = FRESHCHAT_API_URL + endpoint     # Set up headers with authorization and accept content type     headers = {         'Authorization': f'Bearer {FRESHCHAT_API_KEY}',         'Content-Type': 'application/json'     }     try:         # Make a POST request to the Freshchat API         response = requests.post(url, headers=headers, json=user_payload)         response.raise_for_status()         return {"data": response.json(), "response_text": response.text}     except requests.RequestException as e:         return {"error": f"Error accessing Freshchat API: {str(e)}", "response_text": response.text} # Example user payload user_payload = {     "email": "william.jack@email.com",     "first_name": "william",     "last_name": "jack",     "phone": "9873783842", } # Creating a Freshchat user result = create_freshchat_user(FRESHCHAT_API_KEY, FRESHCHAT_API_URL, user_payload) # Checking for errors and displaying the result if 'error' in result:     print(f"Error: {result['error']}") else:     print("User created successfully.")     print("Result:", result['data'])

create_freshchat_user function takes the Freshchat API key, API URL, and user payload as parameters, and it sends a POST request to the Freshchat API to create a new user.

Retrieving Freshchat User Details

The get_freshchat_user function facilitates the retrieval of user information based on a user ID, providing businesses with a streamlined approach to user management.

 def get_freshchat_user(FRESHCHAT_API_KEY, FRESHCHAT_API_URL, user_id):     endpoint = f'users/{user_id}'     # Construct the complete URL for the API request     url = FRESHCHAT_API_URL + endpoint     # Set up headers with authorization and accept content type     headers = {         'Authorization': f'Bearer {FRESHCHAT_API_KEY}',         'Content-Type': 'application/json',     }     try:         # Make a GET request to the Freshchat API         response = requests.get(url, headers=headers)         if response.status_code == 200:             return response.json()         else:             print(f"Error: {response.status_code}\n{response.text}")             return None     except requests.RequestException as e:         print(f"Error accessing Freshchat API: {str(e)}")         return None # Example user ID to retrieve user_id_to_retrieve = '1f3ead9d-9e1c-4b28-9075-14dbe45e1eda' # Retrieving Freshchat user details user_details = get_freshchat_user(FRESHCHAT_API_KEY, FRESHCHAT_API_URL, user_id_to_retrieve) # Checking for successful retrieval and displaying user details if user_details:     print("User details retrieved successfully.")     print("User Details:", user_details)

the get_freshchat_user function takes the Freshchat API key, API URL, and a specific user ID as parameters. It sends a GET request to the Freshchat API to retrieve user details.

Freshchat Channel Operation :

Listing Freshchat Channels

The get_freshchat_channels function allows businesses to fetch information about the available channels within their Freshchat instance.

 def get_freshchat_channels(FRESHCHAT_API_KEY, FRESHCHAT_API_URL):     endpoint = 'channels'     # Construct the complete URL for the API request     url = FRESHCHAT_API_URL + endpoint     # Set up headers with authorization and accept content type     headers = {         'Authorization': f'Bearer {FRESHCHAT_API_KEY}',         'Accept': 'application/json',     }     # Define parameters for the API request, such as pagination and sorting     params = {         'page': 1,         'items_per_page': 10,         'sort_order': 'asc',         'sort_by': 'name'     }     try:         # Make a GET request to the Freshchat API         response = requests.get(url, headers=headers, params=params)         response.raise_for_status()         return response.json()     except requests.RequestException as e:         print(f"Error accessing Freshchat API: {str(e)}")         return None # Call the function with Freshchat API key and API URL to get the list of channels channels_result = get_freshchat_channels(FRESHCHAT_API_KEY, FRESHCHAT_API_URL) if channels_result:     print("Result:", channels_result)

The get_freshchat_channels function constructs a GET request to the Freshchat API, specifically targeting the 'channels' endpoint. The request includes the provided FRESHCHAT_API_KEY key for authentication and is enhanced with parameters such as page number, items per page, and sorting criteria to customize the result set.

Upon execution, the script calls the function with the Freshchat API key and API URL and retrieves the list of channels.

Freshchat Conversation Operations :

Creating a Freshchat Conversation

The create_freshchat_conversation function facilitates this process, enabling businesses to automate the creation of conversations with users.

 def create_freshchat_conversation(FRESHCHAT_API_KEY, FRESHCHAT_API_URL):     endpoint = 'conversations'     url = FRESHCHAT_API_URL + endpoint     headers = {         'Authorization': f'Bearer {FRESHCHAT_API_KEY}',         'Content-Type': 'application/json',     }     conversation_payload = {         "Status": "new",         "messages": [             {                 "message_parts": [                     {                         "text": {                             "content": "Hello, can you give my order details for my order id 12 and email id is william.jack@email.com"                         }                     }                 ],                 "channel_id": "198bbfc7-5619-43b6-9c7d-f47758134af4",                 "message_type": "normal",                 "actor_type": "user",                 "actor_id": "1f3ead9d-9e1c-4b28-9075-14dbe45e1eda"             }         ],         "channel_id": "198bbfc7-5619-43b6-9c7d-f47758134af4",         "properties": {             "priority": "Low",             "cf_type": "General Query",             "cf_rating": "3",             "cf_supported_products": ["Freshchat", "Freshdesk"]         },         "users": [             {                 "id": "1f3ead9d-9e1c-4b28-9075-14dbe45e1eda"             }         ]     }     try:         # Make a POST request to the Freshchat API         response = requests.post(url, headers=headers, json=conversation_payload)         if response.status_code == 201:             return response.json()         else:             print(f"Error: {response.status_code}\n{response.text}")             return None     except requests.RequestException as e:         print(f"Error accessing Freshchat API: {str(e)}")         return None # Example: Creating a Freshchat conversation conversation_result = create_freshchat_conversation(FRESHCHAT_API_KEY, FRESHCHAT_API_URL) # Checking for successful creation and displaying the result if conversation_result:     print("Conversation created successfully.")     print("Result:", conversation_result)

The create_freshchat_conversation function constructs a POST request to the Freshchat API, targeting the 'conversations' endpoint. The request includes the provided conversation payload in the request body, featuring details such as the message content, channel information, priority, and user details.

Upon execution, the script calls the function with the Freshchat API key and API URL, and creates a Freshchat conversation.

We can observe the conversation created on our Freshchat.

Retrieving Freshchat Conversation Details

The get_freshchat_conversation function empowers businesses to programmatically retrieve details about a specific Freshchat conversation, facilitating a comprehensive understanding of user interactions.

 def get_freshchat_conversation(FRESHCHAT_API_KEY, FRESHCHAT_API_URL, conversation_id):     endpoint = f'conversations/{conversation_id}'     url = FRESHCHAT_API_URL + endpoint     headers = {         'Authorization': f'Bearer {FRESHCHAT_API_KEY}',         'Accept': 'application/json',     }     try:         # Make a GET request to the Freshchat API         response = requests.get(url, headers=headers)         response.raise_for_status()         return response.json()     except requests.RequestException as e:         print(f"Error accessing Freshchat API: {str(e)}")         return None # Example: Retrieving details for a Freshchat conversation conversation_id = 'b2c25cca-b0e2-4334-88b7-a891634827f2' conversation_result = get_freshchat_conversation(FRESHCHAT_API_KEY, FRESHCHAT_API_URL, conversation_id) # Checking for successful retrieval and displaying the result if conversation_result:     print("Conversation retrieved successfully.")     print("Result:", conversation_result)

The get_freshchat_conversation function constructs a GET request to the Freshchat API, targeting the 'conversations' endpoint with a specific conversation ID. The request includes the Freshchat API key for authentication and is designed to retrieve detailed information about the specified conversation.

Upon execution, the script calls the function with the Freshchat API key, API URL, and a conversation ID and it retrieves the conversation details.

Use Case: Integrating Freshchat with ChatGPT to Generate Response for user without an Agent.

Installing the required modules:

pip install openai -qpip install requests

Defining `get_freshchat_message` Function:

The get_freshchat_message function plays a pivotal role in retrieving messages from a specific Freshchat conversation, offering insights into user queries and concerns.

import requestsfrom openai import OpenAIimport jsonclient = OpenAI(api_key="Your API Key ") # Replace with your OpenAI API keydef get_freshchat_message(conversation_id: str):    try:        FRESHCHAT_API_KEY = 'YOUR_FRESHCHAT_API_KEY'        url = f'https://futuresmartai-660352448177757789-92a0b0db5e3f7c317037774.freshchat.com/v2/conversations/{conversation_id}/messages'        # Set up headers for the API request, including authorization with the API key        headers = {            'accept': 'application/json',            'Authorization': f'Bearer {FRESHCHAT_API_KEY}'        }        # Make a GET request to the Freshchat API        response = requests.get(url, headers=headers)        response.raise_for_status()  # Raise an exception for HTTP errors        # Parse the JSON response containing messages data        messages_data = response.json()        if 'messages' in messages_data:            # Find the latest message in the conversation            latest_message = max(messages_data['messages'], key=lambda x: x.get('created_time', ''))            # Extract actor type (user or system) and message content            actor_type = latest_message.get('actor_type', '')            message_content = latest_message.get('message_parts', [{}])[0].get('text', {}).get('content', '')            print(f"{actor_type}: {message_content}")            return message_content        else:            print("No 'messages' key found in the response.")            return None    except requests.exceptions.RequestException as e:        print(f"Error fetching Freshchat messages: {e}")        return None

The get_freshchat_message function utilizes a GET request to the Freshchat API, targeting the 'messages' endpoint within a specific conversation. It retrieves and prints the latest message content along with the actor type (user or system). This functionality is pivotal for seamlessly integrating Freshchat with ChatGPT, enabling personalized and automated responses based on the most recent user interactions.

Sequence of Message Reception: We need to address the sequence of message reception. When a user initiates a conversation, understanding whether the message reaches ChatGPT first or the agent is vital for comprehending the flow of interactions.
Defining the Interaction : It encompasses the touchpoints between Freshchat, ChatGPT, and the end user. By exploring and defining this interaction zone, we ensure a smooth and cohesive user experience.
Decoding Agent vs ChatGPT Responses: A common source of confusion lies in whether a response originates from the Agent or ChatGPT. We'll explore the decision-making process behind this, offering insights into how the system seamlessly navigates between user and AI-generated responses.
Agent Availability: The online presence of the agent is important in the interaction dynamics. Understanding whether the agent remains online consistently or if specific conditions govern their availability contributes to managing user expectations during the conversation.

Defining the `get_order_details` Function:

We define a function get_order_details, which makes an API request to retrieve information for a given order.

def get_order_details(order_id, email_id):    # Replace the actual endpoint URL for fetching order details    url = "http://your_api_endpoint/order_info"    # Set up parameters for the POST request, including order_id and email_id    params = {'order_id': order_id, 'email_id': email_id}    # Make a POST request to the API endpoint    response = requests.post(url, params=params)    if response.status_code == 200:        return response.json()['Result'][0]    else:        return f"Error: Unable to fetch order details. Status code: {response.status_code}"

Describing Functions:

get_order_info function takes an argument called "order_id" and "email_id". it returns the order details associated with the provided order_id and email_id.

functions = [    {        "name": "get_order_info",        "description": "The function retrieves all the Order specific information associated with the provided Order ID.",        "parameters": {            "type": "object",            "properties": {                "order_id": {                    "type": "string",                    "description": "unique id of the order to fetch order details",                },                "email_id": {                    "type": "string",                    "description":"A email id requires to fetch order details",                },            },            "required": ["order_id","email_id"]        }    }]

Defining the `get_chatgpt_response_for_freshchat` Function:

The get_chatgpt_response_for_freshchat function utilizes ChatGPT for extracting order details.

def get_chatgpt_response_for_freshchat(prompt):    # Generate a response using ChatGPT    response = client.chat.completions.create(        model="gpt-3.5-turbo-1106",        response_format={ "type": "json_object" },        messages=[            {"role": "system", "content": "You are a helpful assistant designed to output JSON. Please generate response for freshchat message, 'Order ID' is of numeric type and 'Email ID' is of string type. retrieve order id and email id from given conversation and Order details can be fetched by 'get_order_info' function call."},            {"role": "user", "content": prompt},        ],        temperature=0,        functions=functions,        function_call="auto",    )    saved_response = response    # Extract the message from the response    response = response.choices[0].message    if response.function_call:        function_name = response.function_call.name        if function_name == "get_order_info":            order_id = json.loads(response.function_call.arguments)["order_id"]            email_id = json.loads(response.function_call.arguments)["email_id"]            # Call a function to get order details based on order ID and email ID            order_details = get_order_details(order_id ,email_id)            if order_details:                 return {"order_details": order_details, "status": True }    return {"order_details": "", "status":False}

In get_chatgpt_response_for_freshchat function we use function_call parameter to call a function in the prompt. The get_order_info functions defined in the prompt. The function_call parameter is set to "auto" to automatically call the function. GPT calls the function in the prompt and returns the order details.

Decision-Making Process: The Process to distinguish whether the response should be done by a human agent or generated using ChatGPT. This decision-making involves evaluating factors such as the query's nature, the ongoing conversation context, and predefined rules. These considerations are essential to meaningful and contextually relevant responses that enhance the user experience.

If you want to learn more about the ChatGPT Function calling. you can watch this video tutorial :

https://youtu.be/pI1yUiNKyDA?si=k7k0aqSIK4D6tiRA

Defining `gpt_function` Function :

Introducing the gpt_function function that integrates with ChatGPT.

def gpt_function(conversation_id: str):    # Retrieve the message with the specified conversation_id    message_text = get_freshchat_message(conversation_id)    extraction_prompt = f""" Given conversation: {message_text}"""    # Generate a GPT response using the retrieved message text    output = get_chatgpt_response_for_freshchat(extraction_prompt)    print("GPT Response:", output)

It begins by retrieving the message text using the get_freshchat_message function. Subsequently, it leverages ChatGPT through the get_chatgpt_response_for_freshchat function to extract order details.

Generating GPT Response:

conversation_id = 'b2c25cca-b0e2-4334-88b7-a891634827f2'gpt_function(conversation_id)

The GPT Response will retrieve the order details for provided order id and email id from user.

user: Hello, can you give my order details for my order id 12 and email id is william.jack@email.comGPT Response: {'order_details': 'Order ID : 12\n Customer Name: william jack\n Order Date:2024-01-02 12:15:01\n Order status:payment_accepted\n Carrier:UPS - Gratis Vans\n Delivery Date: 2024-01-05 00:00:00', 'status': True}

Conclusion

In conclusion, the pivotal role of automation in achieving optimal customer engagement. By leveraging automated solutions, businesses can streamline and enhance their customer interactions, providing a seamless and efficient experience.

We have explored the integration of Freshchat and ChatGPT operations using Python, showcasing the power of these tools in revolutionizing customer engagement strategies. The emphasis on automation not only facilitates faster response times but also ensures a personalized and effective approach to addressing customer queries without an human agent.

For further exploration and references, don't forget to check the Freshchat API documentation

Stay Connected with FutureSmart AI for the Latest in AI Insights - FutureSmart AI

Leveraging LinkedIn Data with GenAI: From Scraping to Personalized Outreach

Trishanu Das — Mon, 15 Jan 2024 06:25:20 GMT

Introduction:

In the expansive realm of professional data, LinkedIn stands out as an invaluable treasure trove, concealing its true gems beneath layers of text and profiles. Expanding upon the insights provided in 'A Comprehensive Guide to Scraping LinkedIn Data,' this article explores the impactful utilization of Large Language Models, such as GPT-4. It unequivocally demonstrates how these models possess the capability to transmute raw data scraped from LinkedIn into robust products. This transformation facilitates targeting potential users for your company through email marketing or crafting personalized messages for talent acquisition, among other compelling use cases. With a focus on GenAI applications using LinkedIn data, the article underscores the transformative potential of leveraging advanced language models in the professional landscape. We have chosen Python as the programming language.

1. Installing Dependencies:

%%capture!pip install openai

2. Using ProxyCurl API:

We will be using ProxyCurl API to scrape the profile data of the target user. The data scraped will be further used for different use cases.

import requestsimport timeapi_key = 'Your ProxyCurl API_KEY' # Put your API Key hereheaders = {'Authorization': 'Bearer ' + api_key}api_endpoint = 'https://nubela.co/proxycurl/api/v2/linkedin'params = {    'linkedin_profile_url': "https://www.linkedin.com/in/pradipnichite/",    'skills': 'include',    'use_cache': 'if-recent',    'fallback_to_cache': 'never',}start_time = time.time()response = requests.get(api_endpoint,                        params=params,                        headers=headers)end_time = time.time()time_taken = end_time - start_timeprint("Time taken:", time_taken)data3 = response.json()

Response:

{'public_identifier': 'pradipnichite', 'profile_pic_url': 'https://s3.us-west-000.backblazeb2.com/proxycurl/person/pradipnichite/profile?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=0004d7f56a0400b0000000001%2F20231230%2Fus-west-000%2Fs3%2Faws4_request&X-Amz-Date=20231230T045709Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=f5d750fabd42a7722147553735236f0da5634b5d09459d61cd1d5304e841c72b', 'background_cover_image_url': 'https://s3.us-west-000.backblazeb2.com/proxycurl/person/pradipnichite/cover?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=0004d7f56a0400b0000000001%2F20231230%2Fus-west-000%2Fs3%2Faws4_request&X-Amz-Date=20231230T045709Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=146e3be4527725176d897603eca7b424a026ffeb7a8215be5257808836f9d339', 'first_name': 'Pradip', 'last_name': 'Nichite', 'full_name': 'Pradip Nichite', 'follower_count': 20901, 'occupation': 'Founder & Lead Data Scientist at FutureSmart AI', 'headline': 'Top Rated Plus - NLP Freelancer | Custom NLP Solutions | GPT-4 | AI Demos', 'summary': "🚀 I'm a Top Rated Plus NLP freelancer on Upwork with over $100K in earnings and a 100% Job Success rate. This journey began in 2022 after years of enriching experience in the field of Data Science.\n\nhttps://www.upwork.com/freelancers/pradipnichite\n\n📚 Starting my career in 2013 as a Software Developer focusing on backend and API development, I soon pursued my interest in Data Science by earning my M.Tech in IT from IIIT Bangalore, specializing in Data Science (2016 - 2018).\n\n💼 Upon graduation, I carved out a path in the industry as a Data Scientist at MiQ (2018 - 2020) and later ascended to the role of Lead Data Scientist at Oracle (2020 - 2022).\n\n🌐 Inspired by my freelancing success, I founded FutureSmart AI in September 2022. We provide custom AI solutions for clients using the latest models and techniques in NLP.\n\n🎥 In addition, I run AI Demos, a platform aimed at educating people about the latest AI tools through engaging video demonstrations.\n\n🧰 My technical toolbox encompasses:\n🔧 Languages: Python, JavaScript, SQL.\n🧪 ML Libraries: PyTorch, Transformers, LangChain.\n🔍 Specialties: Semantic Search, Sentence Transformers, Vector Databases.\n🖥 Web Frameworks: FastAPI, Streamlit, Anvil.\n Other: AWS, AWS RDS, MySQL.\n\n🚀 In the fast-evolving landscape of AI, FutureSmart AI and I stand at the forefront, delivering cutting-edge, custom NLP solutions to clients across various industries.\n\nUpwork Profile: https://www.upwork.com/freelancers/~014fdabc6436bf9bd4?viewMode=1", 'country': 'IN', 'country_full_name': 'India', 'city': 'Mumbai', 'state': 'Maharashtra', 'experiences': [{'starts_at': {'day': 1, 'month': 2, 'year': 2022},...

3.Formatting the data:

The data needs to be formatted properly before being fed to large language models, such as GPT-3.5-turbo or GPT-4, to witness the magic unfold!

Here's an example of how you can do it!🚀

# Extract information from the data3 dictionarypublic_identifier = data3.get('public_identifier', '')profile_pic_url = data3.get('profile_pic_url', '')background_cover_image_url = data3.get('background_cover_image_url', '')first_name = data3.get('first_name', '')last_name = data3.get('last_name', '')full_name = data3.get('full_name', '')follower_count = data3.get('follower_count', 0)occupation = data3.get('occupation', '')headline = data3.get('headline', '')summary = data3.get('summary', '')country = data3.get('country', '')country_full_name = data3.get('country_full_name', '')city = data3.get('city', '')state = data3.get('state', '')experience_data = data3.get('experiences', [])education_data = data3.get('education', [])skills = data3.get('skills', '')linkedin_profile_url = data3.get('linkedin_profile_url', '')linkedin_recommendations_received = data3.get('linkedin_recommendations_received', 0)linkedin_established_at = data3.get('linkedin_established_at', {})linkedin_joined_at = data3.get('linkedin_joined_at', '')# Store information in a single string variablelinkedin_info = f"Public Identifier: {public_identifier}\n"linkedin_info += f"Profile Picture URL: {profile_pic_url}\n"linkedin_info += f"Background Cover Image URL: {background_cover_image_url}\n"linkedin_info += f"First Name: {first_name}\n"linkedin_info += f"Last Name: {last_name}\n"linkedin_info += f"Full Name: {full_name}\n"linkedin_info += f"Follower Count: {follower_count}\n"linkedin_info += f"Occupation: {occupation}\n"linkedin_info += f"Headline: {headline}\n"linkedin_info += f"Summary: {summary}\n"linkedin_info += f"Country: {country}\n"linkedin_info += f"Country Full Name: {country_full_name}\n"linkedin_info += f"City: {city}\n"linkedin_info += f"State: {state}\n"def format_date(date_dict):    return f"{date_dict['day']} {date_dict['month']} {date_dict['year']}"# Add experiences information to the stringlinkedin_info += "\n\nExperiences:\n"for experience in experience_data:    linkedin_info += f"Company: {experience['company']}\n"    linkedin_info += f"Title: {experience['title']}\n"    linkedin_info += f"Duration: {format_date(experience['starts_at'])} - "    if experience['ends_at']:        linkedin_info += f"{format_date(experience['ends_at'])}\n"    else:        linkedin_info += "Present\n"    linkedin_info += f"Description: {experience['description']}\n"    linkedin_info += f"LinkedIn Profile: {experience['company_linkedin_profile_url']}\n"    linkedin_info += f"Logo URL: {experience['logo_url']}\n"    linkedin_info += "\n"# Add education information to the stringlinkedin_info += "\nEducation:\n"for education_entry in education_data:    linkedin_info += f"\n{education_entry['field_of_study']} ({education_entry['starts_at']['month']}/{education_entry['starts_at']['year']} - {education_entry['ends_at']['month']}/{education_entry['ends_at']['year'] if education_entry['ends_at'] else 'Present'}):\n"    linkedin_info += f"Degree: {education_entry['degree_name']}\n"    linkedin_info += f"School: {education_entry['school']}\n"    linkedin_info += f"Grade: {education_entry['grade']}\n"    linkedin_info += f"Activities and Societies: {education_entry['activities_and_societies']}\n"# Extracting accomplishment projectsprojects = "\nProjects:\n"for project in data3.get("accomplishment_projects", []):    projects += f"{project['title']} ({project['starts_at']['month']}/{project['starts_at']['year']} - "    if project['ends_at']:        projects += f"{project['ends_at']['month']}/{project['ends_at']['year']})"    else:        projects += "Present)"    projects += f"\nDescription: {project['description']}\n\n"# Extracting accomplishment test scorestest_scores = "\nTest Scores:\n"for test_score in data3.get("accomplishment_test_scores", []):    test_scores += f"{test_score['name']}: {test_score['score']} ({test_score['date_on']['month']}/{test_score['date_on']['year']})\n"# Extracting certificationscertifications = "\nCertifications:\n"for certification in data3.get("certifications", []):    certifications += f"{certification['name']} - {certification['authority']} ({certification.get('url', 'No URL available')})\n"# Combine all information into a single stringlinkedin_info += f"{data3['full_name']} - Accomplishments\n\n{projects}\n{test_scores}\n{certifications}"# Add skills information to the stringlinkedin_info += f"\n\nSkills: {', '.join(skills)}\n"# Display the final stringprint(linkedin_info)

Response:

Public Identifier: pradipnichiteProfile Picture URL: https://s3.us-west-000.backblazeb2.com/proxycurl/person/pradipnichite/profile?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=0004d7f56a0400b0000000001%2F20231226%2Fus-west-000%2Fs3%2Faws4_request&X-Amz-Date=20231226T140742Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=31ed42fcde14ed8c34c58a44d7ce6140f710776d4c69f480427a6d01680912faBackground Cover Image URL: https://s3.us-west-000.backblazeb2.com/proxycurl/person/pradipnichite/cover?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=0004d7f56a0400b0000000001%2F20231226%2Fus-west-000%2Fs3%2Faws4_request&X-Amz-Date=20231226T140742Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=c3a4363d504c75034096d6ebb0dd876c59a0b41c88e4560bd383d66ab09695f3First Name: PradipLast Name: NichiteFull Name: Pradip NichiteFollower Count: 20901Occupation: Founder & Lead Data Scientist at FutureSmart AIHeadline: Top Rated Plus - NLP Freelancer | Custom NLP Solutions | GPT-4 | AI Demos...

4. Using OpenAI Models

Before utilizing OpenAI models, it is imperative to obtain an OpenAI API key and define the system prompt that aligns with the specific use case. To illustrate this process, we'll examine two distinct use cases:

a. Creating Personalized Messages for Hiring Talent:

Picture yourself in the role of a Hiring Manager searching for suitable candidates to fulfill your company's needs, while also drafting personalized messages to inspire them to become part of your workforce.

Here's how you can do it!🤔

i. Defining the system prompt:

system_prompt =  "You are Adam Stokes, an HR Manager who wants to hire for the role of a Data Scientist at XYZ.AI. Based on the biodata available below, write a personalized message to offer the role to the person"

ii. Calling an OpenAI Model

Subsequently, we'll employ the GPT-4 model to generate a personalized message tailored to the candidate targeted by the HR Manager.

from openai import OpenAIclient = OpenAI(api_key='Your OPENAI_API_KEY')  # Put your OpenAI API Key hereresponse = client.chat.completions.create(  model="gpt-4-1106-preview",  messages=[    {"role": "system", "content": system_prompt},    {"role": "user", "content": f"Biodata: {linkedin_info}"}  ])

Response:

Subject: Exciting Opportunity Awaits You at XYZ.AI  Offer for Data Scientist PositionDear Pradip Nichite,I hope this message finds you thriving and excel in your innovative pursuits at FutureSmart AI. I am Adam Stokes, HR Manager at XYZ.AI, a pioneering company in AI and Machine Learning solutions. Our mission resonates with your passion for delivering cutting-edge NLP solutions, a dedication that doesnt go unnoticed in the strides you have attained in the industry.As we follow the evolution of AI closely, its evident that your extensive experience, particularly your remarkable tenure as a founder and lead data scientist at FutureSmart AI, is exactly the calibre of expertise we are eager to onboard at our firm.We are impressed by the impressive array of projects under your belt  from sentiment analysis to automating the detection of diabetic retinopathy. Your proficiency with Python, NLP techniques, ML libraries, and A.I. frameworks, merged with a proven history of significant freelance success and impactful roles at reputable companies like Oracle and MiQ, ensures you're the embodiment of the skill set we are seeking for our next Data Scientist.At XYZ.AI, we believe that you would not only continue to flourish professionally but also lead our team through the forefront of artificial intelligence innovations. As part of our team, youll have access to state-of-the-art resources, a diverse problem-solving playground, and opportunities to further influence the industry through our various channels.We would be thrilled to discuss how your visions and our goals at XYZ.AI can align for a transformative collaboration. Were confident that together, we can shape the future of AI applications and spearhead the new wave of intelligent solutions for our global clients.Kindly let us know a convenient time for you to have an in-depth conversation about what this role entails and how XYZ.AI envisions your involvement in advancing our frontier in A.I.We eagerly await the prospect of you bringing your strategic insight, technical mastery, and inspiring leadership to our dynamic team.Warm regards,Adam StokesHR Manager, XYZ.AI[Company Contact information][LinkedIn Profile URL]

b. Personalised Email Marketing Campaigns:

Imagine yourself assuming the role of a passionate salesperson at your company, tasked with captivating the end user and driving product sales through email marketing.

By understanding the language and preferences of our target audience through NLP, we can create personalized marketing campaigns that resonate deeply. Imagine product ads tailored to each customer's needs and interests, leading to higher conversions and brand loyalty.

Here's how you can do it!🚀

i. Defining the system prompt:

system_prompt =  "You are Ben Grantt, a salesperson in a company called GranttCourses that sells educational courses. Based on the available biodata, craft a message to pitch a course to the user and how it would benefit them in their career."

ii. Calling an OpenAI Model

Subsequently, we'll employ the GPT-4 model to generate a personalized message tailored to the targeted customer.

from openai import OpenAIclient = OpenAI(api_key='Your OPENAI_API_KEY')  # Put your OpenAI API Key hereresponse = client.chat.completions.create(  model="gpt-4-1106-preview",  messages=[    {"role": "system", "content": system_prompt},    {"role": "user", "content": f"Biodata: {linkedin_info}"}  ])

Response:

Subject: Take Your NLP and AI Expertise to New Heights with GranttCoursesHi Pradip,I hope this message finds you well. I took the liberty to explore your impressive professional journey and your contributions to the realm of data science and AI. As a founder and lead data scientist of FutureSmart AI, as well as a noted freelancer with exemplary accomplishments, your dedication toward advancing in machine learning and NLP is clear and inspiring.While you are evidently at the forefront of AI technology, especially in the NLP niche, continuous learning is what keeps such experts as yourself at the top of the game. At GranttCourses, we specialize in providing advanced educational courses tailored for thought leaders and innovators in the field of AI.The reason for my reach out is to introduce you to our latest course offering: "Advanced NLP with Deep Learning." This course is not your run-of-the-mill educational material; it is specifically designed to enhance already strong foundations, like yours, with the latest breakthroughs and in-depth understanding of advanced neural network architectures, sequence modeling, and state-of-the-art NLP applications.Heres what you can expect from this course:- Deep dive into transformer architectures and how they are revolutionizing the NLP space.- Applying BERT and GPT models towards creating more nuanced text generation and comprehension systems, moving beyond the already sophisticated systems you've expertly fine-tuned.- Hands-on projects that mirror real-world challenges, allowing you to directly apply your skills to solve complex problems efficiently.Additionally, we offer live sessions with industry experts, including individuals who have contributed to seminal AI research. This will not only equip you with cutting-edge knowledge but also the opportunity to network with fellow AI pioneers and discuss potential collaborations or innovations.Bearing in mind your extensive experience with PyTorch, Transformers, and similar libraries, this course might offer you the incremental updates and insights that could be instrumental in FutureSmart AI's endeavors, or in the personal brand you are creating through your educational platform, AI Demos.As data science and machine learning continue to evolve, staying current with the latest techniques and theories isnt just valuable, it's essential for maintaining the competitive edge that has marked your career so brilliantly. And with your mission to educate others, what you gain from this course could transcend your own knowledge, fostering the growth of your audience and client base who look to your expertise to understand the future of AI.Let's discuss how GranttCourses can be a part of your continued success.Best regards,Ben GranttSales ExecutiveGranttCourses[Your Contact Information]

Boom! These amazing tools unlock superpowers for our business! We can snag the dream team of talented peeps and blast our products to the moon with rocket-powered APIs and LLMs! It's game-changing, game-winning, and totally awesome!

5. Conclusion:

With scraping and NLP, the power to shape your professional future is right at your fingertips. So, dive in, explore, and unleash the hidden gems within your LinkedIn data. Remember, the sky's not the limit it's just the launchpad for your success!

You can also refer to these resources for creating a complete backend application using ChatGPT and deploy it on AWS: Beginner's Guide to FastAPI & OpenAI ChatGPT API Integration | Code and Deploy FastAPI & Open AI ChatGPT on AWS EC2: A Comprehensive Step-by-Step Guide 🚀 .

Ditch the manual scraping and unlock automation, efficiency, and deeper connections with your target audience. Dive into the world of GenAI-powered LinkedIn data analysis and discover how it can drive personalized outreach, optimize marketing campaigns, and propel your business forward. Reach out to us at contact@futuresmart.ai and let's discuss how we can build a smarter, more responsive system that aids in targeted marketing or finding the best talent for your company. Join the ranks of forward-thinking companies leveraging the best of AI and see the difference for yourself!

Stay Connected with FutureSmart AI for the Latest in AI Insights - FutureSmart AI

A Comprehensive Guide to LinkedIn Scraping Apis

Trishanu Das — Mon, 15 Jan 2024 06:16:14 GMT

Hey there, LinkedIn explorers! Tired of the endless scrolling?

Gone are the days of endless profile browsing and manual copy-pasting. We're entering the era of the API, where data flows like espresso at a networking event (without the awkward silence, thank goodness).

This blog is your roadmap to the hidden world of LinkedIn scraping APIs. We'll unveil the top contenders, exploring their strengths and weaknesses and their use cases. Whether you're a seasoned data extractor seeking peak efficiency or a curious newcomer to the world of LinkedIn data, we'll equip you with the knowledge to choose the perfect API for your professional intelligence goals. We have chosen Python as the programming language.

1. ProxyCurl (Rating: 8/10)

Streamline LinkedIn Data Extraction with Proxycurl's API. Uncover valuable insights from profiles, jobs, and more with unparalleled speed and ease. Ditch the manual labor and empower your data-driven strategies with this high-performance tool. Explore Proxycurl today and unlock the potential of your LinkedIn data.

Here's how you can use it!🔥

import requestsimport timeapi_key = 'Your ProxyCurl API_KEY' # Put your API Key hereheaders = {'Authorization': 'Bearer ' + api_key}api_endpoint = 'https://nubela.co/proxycurl/api/v2/linkedin'params = {    'linkedin_profile_url': 'https://www.linkedin.com/in/pradipnichite/',    'skills': 'include',    'use_cache': 'if-recent',    'fallback_to_cache': 'never',}start_time = time.time()response = requests.get(api_endpoint,                        params=params,                        headers=headers)end_time = time.time()latency = end_time - start_timeprint("Time taken:", latency)data = response.json()print(data)

Response:

{'public_identifier': 'pradipnichite', 'profile_pic_url': 'https://s3.us-west-000.backblazeb2.com/proxycurl/person/pradipnichite/profile?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=0004d7f56a0400b0000000001%2F20231221%2Fus-west-000%2Fs3%2Faws4_request&X-Amz-Date=20231221T062406Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=0032cb56e455b805568193d9712cb1547484fa04e7447b6cc5025cb45ba089c3', 'background_cover_image_url': 'https://s3.us-west-000.backblazeb2.com/proxycurl/person/pradipnichite/cover?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=0004d7f56a0400b0000000001%2F20231221%2Fus-west-000%2Fs3%2Faws4_request&X-Amz-Date=20231221T062406Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=5409bcd64031af889774383e4aed7d74fb85d048057d7f3a5d6ac75be6c88eef', 'first_name': 'Pradip', 'last_name': 'Nichite', 'full_name': 'Pradip Nichite', 'follower_count': 20901, 'occupation': 'Founder & Lead Data Scientist at FutureSmart AI', 'headline': 'Top Rated Plus - NLP Freelancer | Custom NLP Solutions | GPT-4 | AI Demos', 'summary': "🚀 I'm a Top Rated Plus NLP freelancer on Upwork with over $100K in earnings and a 100% Job Success rate. This journey began in 2022 after years of enriching experience in the field of Data Science.\n\nhttps://www.upwork.com/freelancers/pradipnichite\n\n📚 Starting my career in 2013 as a Software Developer focusing on backend and API development, I soon pursued my interest in Data Science by earning my M.Tech in IT from IIIT Bangalore, specializing in Data Science (2016 - 2018).\n\n💼 Upon graduation, I carved out a path in the industry as a Data Scientist at MiQ (2018 - 2020) and later ascended to the role of Lead Data Scientist at Oracle (2020 - 2022).\n\n🌐 Inspired by my freelancing success, I founded FutureSmart AI in September 2022. We provide custom AI solutions for clients using the latest models and techniques in NLP.\n\n🎥 In addition, I run AI Demos, a platform aimed at educating people about the latest AI tools through engaging video demonstrations.\n\n🧰 My technical toolbox encompasses:\n🔧 Languages: Python, JavaScript, SQL.\n🧪 ML Libraries: PyTorch, Transformers, LangChain.\n🔍 Specialties: Semantic Search, Sentence Transformers, Vector Databases.\n🖥 Web Frameworks: FastAPI, Streamlit, Anvil.\n Other: AWS, AWS RDS, MySQL.\n\n🚀 In the fast-evolving landscape of AI, FutureSmart AI and I stand at the forefront, delivering cutting-edge, custom NLP solutions to clients across various industries.\n\nUpwork Profile: https://www.upwork.com/freelancers/~014fdabc6436bf9bd4?viewMode=1" ...

Priced at $49 for 2500 credits or $0.020 per credit for the monthly plan, and with a latency of around 2648ms, ProxyCurl extracts information such as full name, follower count, occupation, headline, summary, country, state, experiences, education, languages, certifications, and projects, among many others. However, it cannot scrape the posts or skills of the target person and does not scrape data in real-time.

2.Fresh LinkedIn Profile Data (Rating: 7.5/10)

Scrape valuable data from LinkedIn profiles like a pro with Fresh LinkedIn Profile Data API. This powerful tool unlocks a treasure trove of information, from contact details and skills to experience and education. Say goodbye to endless scrolling and hello to targeted data extraction, all with a simple API call. 🚀

Here's how you can use it!🔥

import requestsimport timeurl = "https://fresh-linkedin-profile-data.p.rapidapi.com/get-linkedin-profile"querystring = {"linkedin_url":'https://www.linkedin.com/in/pradipnichite/',"include_skills":"true"}headers = {    "X-RapidAPI-Key": 'Your Rapid API_KEY', # Put your API Key here    "X-RapidAPI-Host": "fresh-linkedin-profile-data.p.rapidapi.com"}start_time = time.time()response = requests.get(url, headers=headers, params=querystring)end_time = time.time()latency = end_time - start_timeprint("Time taken:", latency)data = response.json()print(data)

Response:

{'data': {'about': "🚀 I'm a Top Rated Plus NLP freelancer on Upwork with over $100K in earnings and a 100% Job Success rate. This journey began in 2022 after years of enriching experience in the field of Data Science.\n\nhttps://www.upwork.com/freelancers/pradipnichite\n\n📚 Starting my career in 2013 as a Software Developer focusing on backend and API development, I soon pursued my interest in Data Science by earning my M.Tech in IT from IIIT Bangalore, specializing in Data Science (2016 - 2018).\n\n💼 Upon graduation, I carved out a path in the industry as a Data Scientist at MiQ (2018 - 2020) and later ascended to the role of Lead Data Scientist at Oracle (2020 - 2022).\n\n🌐 Inspired by my freelancing success, I founded FutureSmart AI in September 2022. We provide custom AI solutions for clients using the latest models and techniques in NLP.\n\n🎥 In addition, I run AI Demos, a platform aimed at educating people about the latest AI tools through engaging video demonstrations.\n\n🧰 My technical toolbox encompasses:\n🔧 Languages: Python, JavaScript, SQL.\n🧪 ML Libraries: PyTorch, Transformers, LangChain.\n🔍 Specialties: Semantic Search, Sentence Transformers, Vector Databases.\n🖥 Web Frameworks: FastAPI, Streamlit, Anvil.\n Other: AWS, AWS RDS, MySQL.\n\n🚀 In the fast-evolving landscape of AI, FutureSmart AI and I stand at the forefront, delivering cutting-edge, custom NLP solutions to clients across various industries.\n\nUpwork Profile: https://www.upwork.com/freelancers/~014fdabc6436bf9bd4?viewMode=1",  'city': 'Mumbai',  'company': 'FutureSmart AI',  'company_domain': 'futuresmart.ai',  'company_employee_range': '1',  'company_industry': 'IT Services and IT Consulting',  'company_linkedin_url': 'https://www.linkedin.com/company/futuresmartai',  ...

This API provides the company_names, about section, education, past experience, job_titles among other important data fields in the response body. The latency is around 4,031ms. Unlike many apis, this API can list all the skills too.

Priced at $45.00/month for the Pro version for 3000 requests a month, this can scrape data in real-time.

3. Scraping-bot.io (Rating: 9/10)

Streamline your web data extraction with ScrapingBot. This efficient tool effortlessly retrieves the precise information you need from any website, eliminating time-consuming manual copying and complex coding. Focus on your core tasks while ScrapingBot handles the data retrieval seamlessly. Discover effortless web harvesting and bid farewell to tedious data collection methods.

Here's how you can use it!🔥

import requestsimport jsonfrom time import sleepusername = 'Your USERNAME' # Put your username hereapiKey = 'Your Scraping-bot.io API_KEY' # Put your API Key herescraper = 'linkedinProfile'url = 'https://www.linkedin.com/in/pradipnichite/'apiEndPoint = "http://api.scraping-bot.io/scrape/data-scraper"apiEndPointResponse = "http://api.scraping-bot.io/scrape/data-scraper-response?"payload = json.dumps({"url": url, "scraper": scraper})headers = {    'Content-Type': "application/json"}start_time = time.time()response = requests.request("POST", apiEndPoint, data=payload, auth=(username, apiKey), headers=headers)if response.status_code == 200:    print(response.json())    print(response.json()["responseId"])    responseId = response.json()["responseId"]    pending = True    count = 0    while pending:        count+=1        # sleep 5s between each loop, social-media scraping can take quite long to complete        # so there is no point calling the api quickly as we will return an error if you do so        sleep(5)        finalResponse = requests.request("GET", apiEndPointResponse + "scraper=" + scraper + "&responseId=" + responseId                                         , auth=(username, apiKey))        result = finalResponse.json()        if type(result) is list:            pending = False            print(finalResponse.text)        elif type(result) is dict:            if "status" in result and result["status"] == "pending":                print(result["message"])                continue            elif result["error"] is not None:                pending = False                print(json.dumps(result, indent=4))else:    print(response.text)end_time = time.time()latency = end_time - start_timeprint("Time taken:", latency)

Response:

[{'url': 'https://www.linkedin.com/in/pradipnichite/?_l=en',  'name': 'Pradip Nichite',  'current_company': {'name': 'FutureSmart AI',   'link': 'https://in.linkedin.com/company/futuresmartai?trk=public_profile_topcard-current-company'},  'avatar': 'https://media.licdn.com/dms/image/D4D03AQFU1AiD1jO0fg/profile-displayphoto-shrink_200_200/0/1674710874602?e=2147483647&v=beta&t=KWT0V2ZwkvcBxBK3vACgifZijNQ9JcQrlJdnW6n4yF8',  'about': "🚀 I'm a Top Rated Plus NLP freelancer on Upwork with over $100K in earnings and a 100%",  'city': 'Mumbai, Maharashtra, India',  'followers': '21K followers',  'following': '500+ connections',  'educations_details': 'International Institute of Information Technology  Bangalore',  'posts': [{'title': 'How does a machine learning algorithm learn? (with intuition and math that you already know )',    'attribution': 'By Pradip Nichite',    'img': 'https://static.licdn.com/scds/common/u/img/pic/pic_pulse_stock_article_9.jpg',    'link': 'https://www.linkedin.com/pulse/how-does-machine-learning-algorithm-learn-intuition-math-nichite?trk=public_profile_article_view',    'created_at': '2021-09-09T00:00:00.000Z'}],  'experience': [],  ...

Beyond capturing traditional biographical information like projects, company and certifications, this API possesses the unique capability to monitor a target user's social media engagement. This includes tracking the sharing of their own posts and their reactions to those of others, offering profound insights into their interests, preferences, and online behavior. However, it fails to extract the user's skills and it is not consistent with the output of the education and experiences field.

With a competitive price of 0.00039 per scrape, this innovative tool makes a compelling case for real-time LinkedIn data extraction.

4. Linkedin Data Scraper (Rating: 8.5/10)

Enter Linkedin Data Scraper API, your secret weapon for scraping profiles faster than a recruiter on Red Bull. Extract experience, education, certifications and courses among many other with the ease of a point-and-click adventure, and leave the data hoarding to the API. Let's dive in and turn your prospecting pan into a data-powered gold mine!

Here's how you can use it!🔥

import requestsimport timeurl = "https://linkedin-data-scraper.p.rapidapi.com/person"payload = { "link": 'https://www.linkedin.com/in/pradipnichite/' }headers = {    "content-type": "application/json",    "X-RapidAPI-Key": 'Your Rapid API_KEY', # Put your API Key here    "X-RapidAPI-Host": "linkedin-data-scraper.p.rapidapi.com"}start_time = time.time()response = requests.post(url, json=payload, headers=headers)end_time = time.time()latency = end_time - start_timeprint("Time taken:", latency)data = response.json()print(data)

Response:

{'success': True, 'status': 200, 'data': {'data': {'firstName': 'Pradip',   'lastName': 'Nichite',   'fullName': 'Pradip Nichite',   'publicIdentifier': 'pradipnichite',   'headline': 'Top Rated Plus - NLP Freelancer | Custom NLP Solutions | GPT-4 | AI Demos',   'associatedHashtags': 'Talks about #nlp, #gpt3, #freelancing, #machinelearning, and #artificialintelliegence',   'connections': 1,   'followers': 21286,   'emailRequired': False,   'creatorWebsite': {'name': 'My Youtube Channel  ',    'link': 'https://www.youtube.com/c/PradipNichiteAI'},   'openConnection': True,   'urn': 'ACoAAA0aCz0B_r8k5MLp8w-N_giV2qCoIIYco6w',   'updates': [{'postText': "I've earned over $100K 💰 and achieved Expert-Vetted status (top 1% 🌟) on Upwork, specializing in #NLP and #generativeai . Interested in NLP or generative AI? \n\nExplore my YouTube tutorials 📺, featuring a range of topics: Hugging Face Transformers, SentenceTransformers, OpenAI's ChatGPT, GPT-4, various LLM libraries including LangChain and LlamaIndex, Vector databases like Pinecone and Chroma DB, plus insights on deploying LLM applications 🚀\n\nChannel: https://lnkd.in/dR5x4A3y\n\nNLP Roadmap 2023: Step-by-Step Guide with Resources\nhttps://lnkd.in/gYw59y4T\n\nLearn How to use Hugging face Transformers Library\nhttps://lnkd.in/gf-j-CXr\n\nFine Tune Transformers Model like BERT on Custom Dataset.\nhttps://lnkd.in/gAdbr-9T\n\nSentence Transformers: Sentence Embedding, Sentence Similarity, Semantic Search and Clustering\nhttps://lnkd.in/gispjP44\n\nVector Database Beginer hands on Tutorial\nhttps://lnkd.in/gB5pVnac\n\nSemantic Search with Open-Source Vector DB: Chroma DB | Pinecone Alternative\nhttps://lnkd.in/gWTtzBC5\n\nBuilding a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4.\nhttps://lnkd.in/gvsSFptJ\n\nChatbot Answering from Your Own Knowledge Base: Langchain, ChatGPT, Pinecone, and Streamlit\nhttps://lnkd.in/gx2WBatQ\n\nLangChain, SQL Agents & OpenAI LLMs: Query Database Using Natural Language\nhttps://lnkd.in/gdeygpUb\n\nMastering LlamaIndex : Create, Save & Load Indexes, Customize LLMs, Prompts & Embeddings\nhttps://lnkd.in/gfkqWNH4\n\nNL2SQL with LlamaIndex: Querying Databases Using Natural Language\nhttps://lnkd.in/g6v3a6MG\n\nUsing OpenAI's ChatGPT API to Build a Conversational AI Chatbot\nhttps://lnkd.in/g58GWQyM\n\nBuild Chatbot using OpenAI's Latest Assistants API - A Beginner's Guide\nhttps://lnkd.in/ghRcZgnF\n\nOpenAI Function Calling Explained: Chat Completions & Assistants API\nhttps://lnkd.in/gSBY6spe\n\nFine-Tuning GPT-3.5 on Custom Dataset: A Step-by-Step Guide\nhttps://lnkd.in/gDCcNBnz\n\nDeploy FastAPI & Open AI ChatGPT on AWS EC2: A Comprehensive Step-by-Step Guide\nhttps://lnkd.in/gCFXWT_r\n\nDeploy GPT Streamlit App on AWS EC2 | OpenAI | AWS Tutotrials\nhttps://lnkd.in/gqMZASjn",     'image': 'https://media.licdn.com/dms/image/sync/D4D27AQEBngiDneaqxQ/articleshare-shrink_800/0/1701979150910?e=1703750400&v=beta&t=RbrowfFjx78SD_HZwYYG-fi3gxw2hQbOZS1ox-cSHfk',     'postLink': 'https://www.linkedin.com/feed/update/urn:li:activity:7133477037373104128?updateEntityUrn=urn%3Ali%3Afs_feedUpdate%3A%28V2%2Curn%3Ali%3Aactivity%3A7133477037373104128%29',     'numLikes': 551,     'numComments': 22,     'reactionTypeCounts': [{'count': 510, 'reactionType': 'LIKE'},      {'count': 17, 'reactionType': 'PRAISE'},      {'count': 13, 'reactionType': 'EMPATHY'},      {'count': 7, 'reactionType': 'INTEREST'},      {'count': 4, 'reactionType': 'APPRECIATION'}]},    {'postText': "😃Wow, it's an honor to have been featured on Darshil Parmar's podcast! I'll never forget how his tips on writing proposals helped shape my journey as a data science freelancer in my early days. \n\n🙏 I'm grateful for the opportunity to share my story and pay it forward.\n\n👇",     'image': 'https://media.licdn.com/dms/image/sync/D4D27AQHlqqpakeq6Pg/articleshare-shrink_800/0/1702179391806?e=1703750400&v=beta&t=4nZLOx2bJBnkSYUIRXYioOr1t4wBtylwyjD3_ZHVOpc',     ...

With a latency of 1,897ms, this API can extract basic features like experiences, education, licenseAndCenrtificates, honorsAndAwards and languages among other fields in real-time. Priced at $25.00/month for 20,000 requests per month for the Pro subscription, this can also scrape the skills and featured posts of the target user.

Comparative Analysis:

Parameters	ProxyCurl	Fresh LinkedIn Profile Data	Scraping-bot.io	Linkedin Data Scraper
Speed	2,648ms	4,031ms	22,2990ms(also includes 5000s sleep time and varies according to different user profiles)	1,897ms
Recency	Does not scrape in real time.	Real-time scraping	Real-time scraping	Real-time scraping
Pricing	$0.020/credit	$45.00/month for 3000 requests per month	0.00039 per scrape	$25.00/month for 20,000 requests per month
Accuracy	10/10	10/10	10/10	10/10
Completeness	8/10	7.5/10	9/10	8.5/10
Comments	Not all skills are included, and posts are not shown. The projects of the target user are displayed.	Skills or top skills are shown. Posts are not shown.	Skills or top skills are not displayed. The system tracks user activities over the last approximately four days, such as sharing or liking posts. It also displays the projects of the target user. Note: Scraping-bot.io retrieves the titles of user posts, their reactions, and shares to other posts. The content of the posts can be obtained by modifying the query.	Skills shown but not all are included. Posts are not shown. Featured posts are shown.

Applications:

Some notable use cases for data scraped from LinkedIn profiles using APIs can be:

1. Lead Generation and Sales:

Identify potential customers: Target individuals matching your ideal customer profile based on their job titles, industries, skills, and interests.
Personalize outreach: Craft highly relevant messages that resonate with their professional needs and aspirations.

2. Recruitment and Talent Acquisition:

Build talent pools: Create a database of potential candidates for future roles, saving time and effort in the long run.
Analyze candidate trends: Identify in-demand skills, experience levels, and salary expectations within specific industries or regions.

3. Competitive Intelligence:

Understand competitor strengths and weaknesses: Analyze their employee profiles to uncover expertise, growth areas, and potential vulnerabilities.
Track industry thought leaders: Identify influential individuals within your industry to stay abreast of key trends and developments.

4. Learning and Development:

Identify skill gaps: Assess employees' existing skills and compare them to industry trends to identify areas for training and development.
Connect employees with mentors and experts: Leverage LinkedIn's vast network to facilitate knowledge sharing and professional development opportunities.

Conclusion:

So, there you have it! From unearthing hidden skills with LLMs to mining real-time data with Proxycurl or Scraping-bot.io, the LinkedIn API landscape offers a treasure trove of insights for savvy prospectors. Remember, choose the right tool for the job, wield it responsibly, and unlock the true potential of your social media data for success. For further reading, you can refer to this: https://github.com/PradipNichite/FutureSmart-AI-Blog/tree/main/Linkedin%20Scraping

For a practical guide to using ProxyCurl API, feel free to refer to this video: Best Way to Scrape LinkedIn Profiles with ProxyCurl API | Python | Code

Say goodbye to manual data scrapingunlock automation, efficiency, and a deeper understanding of your target audience through LinkedIn Scraping APIs. Reach out to us at contact@futuresmart.ai, and let's discuss how we can build a smarter, more efficient LinkedIn scraping system for personalized outreach and growth, driving your business ahead. Join the ranks of forward-thinking companies leveraging the best of AI, and witness the difference for yourself!

Stay Connected with FutureSmart AI for the Latest in AI Insights - FutureSmart AI

Happy scraping!😎

Automate Your Customer Support: Integrating ChatGPT with Odoo HelpDesk

Pawan T — Mon, 08 Jan 2024 06:53:26 GMT

Introduction

In the dynamic landscape of business management, Odoo stands out as a comprehensive solution, offering a versatile platform that caters to diverse organizational needs. The Odoo Helpdesk module is a crucial component for managing customer inquiries, issues, and feedback.

Odoo Helpdesk serves as the nerve center for efficient customer support within the Odoo ecosystem. It empowers organizations to streamline their customer interactions, providing a centralized hub for managing tickets, resolving issues, and ensuring customer satisfaction.

As businesses seek innovative ways to enhance customer support, the fusion of ChatGPT with Odoo Helpdesk using Python emerges as a revolutionary approach. ChatGPT places a strong emphasis on natural language processing. The integration of ChatGPT with Odoo Helpdesk signifies a shift towards automated, intelligent responses that not only address customer queries but also elevate the overall support experience.

Setting up the development environment

Access to a Python Environment: Make sure to set up a Python environment on a machine.

Initializing the odoorpc library :

The odoorpc library facilitates communication with an Odoo instance, allowing you to perform various operations programmatically.

pip install odoorpc

Connecting to Odoo Helpdesk:

import odoorpc# Define the data for connecting to the Odoo Helpdesk.db_name = "Your Odoo Helpdesk database name"username = "Your Odoo Helpdesk username"password = "Your Odoo Helpdesk password"# Initialize an 'odoo' object using odoorpc. Set the Odoo Helpdesk domain, protocol, and port.# In this example, we use the 'jsonrpc+ssl' protocol and port 443, which is common for secure connections.odoo = odoorpc.ODOO('Your Odoo Helpdesk domain', protocol='jsonrpc+ssl', port=443)# Login to the Odoo Helpdesk with the provided database name, username, and password.odoo.login(db_name, username, password)

The odoo variable is created as an instance of the ODOO class from the odoorpc library.
The ODOO class takes the Odoo Helpdesk URL, the protocol used (JSON-RPC with SSL in this case), and the port number.
The login method is then called to authenticate and log in to the Odoo instance using the provided database name, username, and password.

Odoo helpdesk models :

- helpdesk_team_model: This variable is assigned the value of the Odoo model associated with helpdesk teams.
```
   # This allows you to interact with the 'helpdesk.team' model.    helpdesk_team_model = odoo.env['helpdesk.team']
```
  - odoo.env['helpdesk.team'] : This expression accesses the Odoo environment (odoo.env) and specifies the model related to helpdesk teams. In Odoo, a "team" in the helpdesk context might refer to a group of individuals responsible for handling specific types of inquiries or support tickets.
- helpdesk_ticket_model: This variable is assigned the value of the Odoo model associated with helpdesk tickets.
```
   # This allows you to interact with 'helpdesk.ticket' model.   helpdesk_ticket_model = odoo.env['helpdesk.ticket']
```
  - odoo.env['helpdesk.ticket']: This expression accesses the Odoo environment and specifies the model related to helpdesk tickets. In Odoo, a "ticket" typically represents a customer inquiry, issue, or support request.

Creating Helpdesk Team:

To effectively manage customer inquiries and support tickets, establishing teams within the helpdesk module is crucial.

# Define team datateam_data = {    'name': 'Sales order team',    'description': 'Sales order team to handling order issue given by customer.',}# Create a helpdesk teamteam_id = helpdesk_team_model.create(team_data)print(f"Sales order team (ID: {team_id}) is created!")

team_data encapsulates essential information about the new helpdesk team, such as its name and a brief description. The helpdesk_team_model.create() method is then utilized to create a new team in the Odoo helpdesk module, with the provided data. The unique identifier (ID) assigned to the newly created team is stored in the variable team_id.

We can observe that helpdesk team as been created on our Odoo Helpdesk.

Ticket Management

Creating a Ticket:

The creation of a new helpdesk ticket within the Odoo platform.

# Define ticket dataticket_data = {    'name': 'Delay in order',    'team_id': team_id,    'description': 'I have ordered a product with order id 46 but still i didnt receive my order',}# Create a new ticketticket_id = helpdesk_ticket_model.create(ticket_data)print(f"New Ticket (ID: {ticket_id}) is created!")

The ticket_data holds essential information about the ticket, such as its name, associated team ID, and a description of the reported issue. The helpdesk_ticket_model.create(ticket_data) function is then invoked, initiating the creation process of the new ticket. The unique identifier (ID) assigned to the newly created ticket is stored in the variable ticket_id.

We can observe that Ticket as been created on our Odoo Helpdesk.

Reading a Ticket :

Retrieving information about a specific helpdesk ticket within the Odoo system.

#read a ticket by providing ticket_idread_ticket = helpdesk_ticket_model.read(ticket_id, ['name', 'description'])print("Read Ticket Information:", read_ticket)

The helpdesk_ticket_model.read(ticket_id, ['name', 'description']) function is employed to extract specific details associated with the identified ticket, referenced by its unique identifier stored in the variable ticket_id. The parameters passed to the function, namely ['name', 'description'], indicate the specific fields of the ticket that should be included in the retrieval.

Updating a Ticket :

Updating the information associated with a specific helpdesk ticket.

# Provide a ticket_id to updateupdate_ticket_id = ticket_id # Define the values you want to update in the ticketupdate_values = {'name' : 'Updated Ticket name',                'description': 'Updated Tiket description' }# Use the 'write' method to update the ticket with the specified valueshelpdesk_ticket_model.write(update_ticket_id, update_values)print("Ticket Updated Successfully")

The update_ticket_id as the identifier of the ticket that needs to be modified, and update_values containing the new information to be applied. The helpdesk_ticket_model.write(update_ticket_id, update_values) function is to execute the update. It takes the ticket's ID (update_ticket_id) and the updated values (update_values) as parameters, making the necessary modifications to the ticket's details.

Deleting a Ticket :

The process of deleting a helpdesk ticket within the Odoo platform.

# Use the unlink method to delete the ticket with the given ticket_idhelpdesk_ticket_model.unlink([ticket_id])print("Ticket Deleted Successfully")

The helpdesk_ticket_model.unlink([ticket_id]) function is utilized to initiate the deletion of the specified ticket, identified by its ticket ID (ticket_id). The unlink method is the mechanism employed by Odoo to remove records from the database, and in this case, it targets the ticket identified by the provided ID.

Use case: Automating odoo helpdesk with chatgpt

Installing the OpenAI Python Package: To interact with OpenAI's APIs, start by installing the OpenAI Python package.

pip install openai -q

Extracting Order Details, Summarizing Ticket Conversation and Retrieving Knowledge base answers

Setting up odoo helpdesk connection :

Please refer to the above explanation for connecting to Odoo Helpdesk.

from openai import OpenAIimport jsonimport odoorpcclient = OpenAI(api_key="Your API Key") # Replace with your OpenAI API keydb_name = "Your odoo helpdesk database name"username = "Your odoo helpdesk username"password = "Your odoo helpdesk password"odoo = odoorpc.ODOO('Your odoo helpdesk domain', protocol='jsonrpc+ssl', port=443)odoo.login(db_name, username, password)

Defining `get_odoo_ticket_description` Function:

The get_odoo_ticket_description function is to retrieve the description of a helpdesk ticket from odoo using the provided ticket ID.

def get_odoo_ticket_description(ticket_id : int):    try:        # Access the Odoo environment and retrieve the 'helpdesk.ticket' model        helpdesk_ticket_model = odoo.env['helpdesk.ticket']        # Read the description field for the specified ticket_id        ticket_info = helpdesk_ticket_model.read(ticket_id, ['description'])        # Check if ticket_info is a non-empty list        if isinstance(ticket_info, list) and ticket_info:            ticket_info = ticket_info[0]        # Return the description field or an empty string if not found        return ticket_info.get('description', "")    except Exception as e:        print(f"Error fetching ticket information from Odoo: {e}")        return ""

The function uses the Odoo environment to access the helpdesk.ticket model, reads the description field for the specified ticket, and returns the description.

Defining the `get_knowledge_chromadb` Function:

The get_knowledge_chromadb function used to get knowledge base data from chromadb, this function takes single argument called query. It utilizes the Sentence Transformer to encode the query and uses chromadb to query the encoded query. The function concludes by returning the top 2 similar data points obtained from the query results.

def get_knowledge_chromadb(query):    import chromadb    from sentence_transformers import SentenceTransformer    # Specify the model for sentence embeddings    model_name = "multi-qa-MiniLM-L6-cos-v1"    # Initialize ChromaDB PersistentClient with the path to the index    client = chromadb.PersistentClient(path="chromadb_index")    # Retrieve the specified collection from ChromaDB    collection = client.get_collection("index_name")    #Load Sentence Transformer model    model = SentenceTransformer(model_name)     # Encode the input query using the Sentence Transformer model    query_embedding = model.encode(query)    # Query the collection to retrieve relevant knowledge    # n_results specifies the number of results to retrieve (here, 2 results)    results = collection.query(query_embedding, n_results=2)    if results['metadatas'][0]:        # Extract knowledge from the metadata of the first two results        context =results['metadatas'][0][0]['knowledge']+"\n"+results['metadatas'][0][1]['knowledge']    else:        context=""    return results

If you want to learn more about Chroma vector database and Semantic search you can watch this video tutorial :

Semantic Search with Open-Source Vector DB: Chroma DB

Defining the `get_order_details` Function:

We define a function get_order_details, which makes an API request to retrieve information for a given order.

def get_order_details(order_id):    url = "http://your_api_endpoint/order_info"  # Replace with your actual endpoint    # Give the parameters for the API request    params = {'order_id': order_id}    # Send a POST request to the API endpoint with the specified parameters    response = requests.post(url, params=params)    # Check if the API request was successful (status code 200)    if response.status_code == 200:        return response.json()['Result'][0]    else:        return f"Error: Unable to fetch order details. Status code: {response.status_code}"order_details = get_order_details(order_id=46)print(order_details)

Describing Functions:

get_order_info function takes an argument called "order_id" and it returns the order details associated with the order_id.

knowledge_base function takes an arguments called "query" and it returns the knowledge base data from Chromadb using the query provided.

functions = [    {        "name": "get_order_info",        "description": "The function retrieves all the Order specific information associated with the provided Order ID.",        "parameters": {            "type": "object",            "properties": {                "order_id": {                    "type": "string",                    "description": "unique id of the order",                }            },            "required": ["order_id"],        }    },    {"name": "knowledge_base",     "description": "The function retrieves the knowledge from chromadb using the query provided.",        "parameters": {            "type": "object",            "properties": {                "query": {                    "type": "string",                    "description": "query to search in chromadb",                }            },            "required": ["query"],        }    }]

Defining the `generate_ticket_response` Function:

while the generate_ticket_response function utilizes ChatGPT for extracting order details.

def generate_ticket_response(text):    conversation_str = f"""Given conversation : {text}"""    # Use OpenAI GPT-3.5-turbo to generate a response    response = client.chat.completions.create(        model="gpt-3.5-turbo-1106",        response_format={ "type": "json_object" },        messages=[            {"role": "system", "content": "You are a helpful assistant designed to output JSON. 'Order ID' is of numeric type. 'Order ID' can sometimes be followed by '#' symbol, please extract the 'Order ID' by removing the symbol. Order details can be fetched by 'get_order_info' function call."},            {"role": "user", "content": conversation_str},        ],        temperature=0,        functions=functions,        function_call="auto",    )    saved_response = response    # Extract the message from the response    response = response.choices[0].message    if response.function_call:        function_name = response.function_call.name        if function_name == "get_order_info":            order_id = json.loads(response.function_call.arguments)["order_id"]            # Call the get_order_details function to fetch order details            order_details = get_order_details(order_id=46)             if order_details:                return {"order_details": order_details, "status": True }        elif function_name == "knowledge_base":            query = json.loads(response.function_call.arguments)["query"]            #call the get_knowledge_chromadb function to get knowledge base data            data = get_knowledge_chromadb(query)            # Use GPT-3.5-turbo to generate a response in formal language            response = client.chat.completions.create(                model="gpt-3.5-turbo-1106",                response_format={ "type": "text" },                messages=[                    {"role": "system", "content": "You are a helpful assistant. Provide response in formal language."} ,                    {"role": "user", "content": f"""Here is the knowledge base data for the given conversation: {data}""" },                    {"role": "user", "content": f"Given conversation : \n{query}"},                ],                temperature=0,            )            return {"response": response.choices[0].message.content, "status": True}    return {"order_details": "", "status":False}

In generate_ticket_response function we use function_call parameter to call a function in the prompt. There are two functions defined in the prompt, get_order_info and knowledge_base. The function_call parameter is set to "auto" to automatically call the function.

GPT calls the function in the prompt and returns the order details in JSON format and the order details are extracted from the JSON response and returned.

If you want to learn more about the ChatGPT Function calling. you can watch this video tutorial :

OpenAI Function Calling Explained: Chat Completions & Assistants API

Defining `get_chatgpt_summary_orderID_details` Function:

def get_chatgpt_summary_orderID_details(text):    conversation_str = f"""Given conversation : {text}"""    # Use OpenAI GPT-3.5-turbo to generate a summary and extract information from the conversation    response = client.chat.completions.create(        model="gpt-3.5-turbo-1106",        response_format={ "type": "json_object" },        messages=[            {"role": "system", "content": "You are a helpful assistant designed to output JSON.Please generate summary of given conversation in 'summary' . Also extract topics from given conversation in 'topics' tag. 'Order ID' is of numeric type. 'Order ID' can sometimes be followed by '#' symbol, please extract the 'Order ID' by removing the symbol. Extract order ID in 'order_id'"},            {"role": "user", "content": conversation_str},        ],        temperature=0,    )    return response.choices[0].message.content

The get_chatgpt_summary_orderID_details function demonstrates how to use the function_call parameter to call a function in the prompt and this function is to generate summary of given conversation and extract topics from given conversation and extract order ID from given conversation.

If you'd like to Prefer a Visual Guide? on how to implement OpenAI's ChatGPT API in your projects and how to use OpenAI's ChatGPT API to Build a Conversational AI Chatbot check this video tutorial:

Using OpenAI's ChatGPT API to Build a Conversational AI Chatbot

Defining `gpt_function` Function

Introducing the gpt_function function that integrates with GPT to handle ticket-related information and user queries, providing meaningful responses based on the context of the conversation.

def gpt_function(ticket_id: int):    # Retrieve the description with the specified ticket_id    ticket_text = get_odoo_ticket_description(ticket_id)    # Generate a response using the retrieved ticket text     output = generate_ticket_response(ticket_text)    print(output)    # Obtain a summary of order details using the ticket text    out = get_chatgpt_summary_orderID_details(ticket_text)    print(json.loads(out))    # Generate a response for the ticket conversation    ticket_conversation = "what is your return policy?"    output = generate_ticket_response(ticket_conversation)    print(output)

It begins by retrieving the ticket text using the get_odoo_ticket_description function. Subsequently, it leverages ChatGPT through the generate_ticket_response function to extract order details and generate comprehensive summaries of ticket conversations. Furthermore, the code demonstrates the model's capability by simulating user inquiries, such as querying the return policy, and showcases ChatGPT's responses.

Generating GPT Response

ticket_id = 16gpt_function(ticket_id)

The gpt response will retrieve order details, summarizing conversation and knowledge base answer.

{'order_details': 'Customer Name: John Doe\nOrder Date:2023-12-05 21:15:01\nOrder status:payment_accepted\nCarrier:UPS - Gratis Vans\nDelivery Date: 2023-12-20 00:00:00', 'status': True}{'summary': 'The customer has mentioned that they have ordered a product with order id 46 but have not received the order yet.', 'topics': ['order status', 'product delivery'], 'order_id': 46}{'response': 'The return policy allows for items to be returned within 7 days of purchase', 'status': True}

Conclusion

In conclusion, the Odoo helpdesk and its integration with ChatGPT has illuminated the effective ticket management for customer support. The Odoo helpdesk module, with its versatile capabilities, serves a businesses seeking to provide exceptional customer service by managing inquiries and concerns efficiently.

The ChatGPT, a powerful language model, introduces a new era in customer interactions. With natural language processing at its core, ChatGPT enables intelligent, human like responses for customer support.

if you wish to integrate ChatGPT with Freshdesk for customer support you can refer this Freshdesk Blog.

Don't let your customer support lag behind. Embrace the future of automated, intelligent customer interactions. Reach out to us at contact@futuresmart.ai, and let's discuss how we can build a smarter, more responsive customer support system for your business. Join the ranks of forward-thinking companies leveraging the best of AI, and see the difference for yourself!

Stay Connected with FutureSmart AI for the Latest in AI Insights - FutureSmart AI

Keep up with the latest breakthroughs and fascinating insights in the world of AI by staying connected with FutureSmart AI. Discover AI Demos, for staying updated on the newest AI tools and applications. AI Demos is your top resource for both education and inspiration. Dive into the future of AI now by visiting aidemos.com.

Automate Your Customer Support: Integrating ChatGPT with Freshdesk

Pawan T — Thu, 28 Dec 2023 14:04:42 GMT

Introduction

Overview of Freshdesk and its API

Freshdesk, a leading customer support platform, transforms how businesses engage with customers. With its user-friendly interface and powerful features, Freshdesk streamlines support processes, enhances customer interactions, and delivers outstanding service.

The Freshdesk API acts as a powerful bridge, enabling seamless integration and automation between the Freshdesk platform and external applications. It's a collection of web-based protocols and tools, allowing developers to programmatically engage with the Freshdesk customer support platform. This facilitates the exchange of data for creating, retrieving, updating, and deleting various entities.

Automating Freshdesk Support Using Chatgpt presents a comprehensive guide on streamlining customer support processes within the Freshdesk platform. By integrating ChatGPT, a powerful language model, into Freshdesk, businesses can enhance their customer service capabilities. This blog explores various use cases, including summarizing ticket conversations, extracting order details, and retrieving knowledge base answers. By automating routine tasks and leveraging the capabilities of ChatGPT, companies can provide more efficient and personalized customer support experiences. Let's get started!

Getting Started

Key Features and Capabilities:

To harness the full potential of the Freshdesk API, it's essential to explore its key features and capabilities.

Key Features:

Ticket CRUD Operations:
- An acronym for Create, Read, Update, and Delete. In the context of Freshdesk, these operations helps for managing support tickets.
Real-time Ticket Management
- Enables real-time ticket management for efficient support operations.
Seamless Integration
- Facilitates seamless integration of Freshdesk support functionalities with ChatGPT.

Capabilities:

RESTful Architecture
- The Freshdesk API adheres to RESTful principles, ensuring simplicity, scalability, and ease of integration.
Authentication and Security
- Secure API key-based authentication ensures data integrity and confidentiality.
Data Retrieval and Filtering
- Retrieve specific data sets using flexible filtering options. Optimize data retrieval based on parameters such as date, status, and priority.

Prerequisites

Obtaining a Freshdesk API Key:

To interact with the Freshdesk API, acquiring an API key is a fundamental step. The API key serves as an authentication mechanism, allowing secure access to your Freshdesk account programmatically.

Steps to Obtain Freshdesk API Key:

Log in to Freshdesk:
- Accessing my Freshdesk account involves using my credentials. If I don't have a Freshdesk account, I can sign up for a free trial on the Freshdesk website

To do this, I need to complete a registration form, activate my account through a verification email, and then log in to Freshdesk using my email address and password.

After Login to Freshdesk account navigate to Profile Settings:
Navigate to the 'View API Key' and we can see API Key Generated:

Setting up the development environment:

Once I've obtained my Freshdesk API key, the next critical step is to set up my development environment. This ensures that I have the necessary tools and configurations to interact seamlessly with Freshdesk API endpoints. Here's a step-by-step guide to help me establish an environment conducive to efficient API integration

Install Required Libraries:
- Ensure that I have the required libraries installed, with a primary emphasis on using the 'requests' library for handling HTTP requests in Python. I can install it using:
```
  pip install requests
```
Access to a Python Environment:
- Make sure to set up a Python environment on our machine, as Python scripts are frequently used for Freshdesk API interactions, making Python an essential prerequisite.
Freshdesk API Documentation:
- Get to know the Freshdesk API documentation to understand the available endpoints, request/response formats, and any specific requirements.

Making Requests

Function explanation:

The make_freshdesk_request function is a versatile tool for interacting with the Freshdesk API. Let's delve into the details

import requestsFRESHDESK_API_KEY = 'Your Freshdesk API Key'FRESHDESK_API_URL = 'https://domain.freshdesk.com/api/v2/'def make_freshdesk_request(FRESHDESK_API_KEY, FRESHDESK_API_URL, method, endpoint, json_data=None):    url = FRESHDESK_API_URL + endpoint    headers = {"Authorization": f"Basic {FRESHDESK_API_KEY}", "Content-Type": "application/json"}    try:        response = requests.request(method, url, headers=headers, json=json_data)        response.raise_for_status()        return {"data": response.json(), "response_text": response.text}    except requests.RequestException as e:        return {"error": f"Error accessing Freshdesk API: {str(e)}", "response_text": response.text}

This function serves as our intermediary for communication with Freshdesk. Whether we're creating a ticket, fetching details, or performing other tasks, this function handles the back-and-forth with Freshdesk seamlessly.

Handling Authentication with API Key:

    headers = {"Authorization": f"Basic {FRESHDESK_API_KEY}", "Content-Type": "application/json"}

Before we make any requests to Freshdesk, we need to prove our identity. The function accomplishes this by including our Freshdesk API key in the request headers. This key is like our secret password, allowing Freshdesk to recognize and authorize our requests.

Making requests using requests library:

    response = requests.request(method, url, headers=headers, json=json_data)    response.raise_for_status()    return {"data": response.json(), "response_text": response.text}

To communicate effectively with Freshdesk, we need a reliable courier. The requests library serves as our postman, handling the intricacies of delivering our requests and receiving the responses. This function orchestrates the entire process, making it easy for us to interact with Freshdesk through Python code.

In essence, the make_freshdesk_request function acts as our messenger, including our special key (Freshdesk API key) in the message and utilizing the trusty requests library to facilitate smooth communication with Freshdesk. It streamlines the process, making it user-friendly and accessible.

Ticket Management

Listing Tickets:

This section process the retrieving a list of tickets from Freshdesk.

def list_tickets(FRESHDESK_API_KEY, FRESHDESK_API_URL):    result = make_freshdesk_request(FRESHDESK_API_KEY, FRESHDESK_API_URL, "GET", "tickets")    tickets = result.get("data", [])    print("Result (List Tickets):" if tickets else "No tickets found.")    for ticket in tickets:        print(f"Ticket ID: {ticket['id']}, Subject: {ticket['subject']}, Priority: {ticket['priority']}, Status: {ticket['status']}")

To retrieve a list of tickets from Freshdesk, the list_tickets function is utilized. This function takes two parameters, FRESHDESK_API_KEY and FRESHDESK_API_URL, enabling authentication and specifying the Freshdesk domain. Upon making a GET request to the Freshdesk API, it fetches ticket data.

The result is then processed, and if tickets are found, relevant details such as Ticket ID, Subject, Priority, and Status are printed for user visibility. This function facilitates the efficient retrieval and display of ticket information.

Response:

{  "Result (List Tickets)": [    {      "Ticket ID": 11,      "Subject": "Quality issue",      "Priority": 1,      "Status": 2    },    {      "Ticket ID": 10,      "Subject": "Delay in my order",      "Priority": 1,      "Status": 2    }  ]}

Creating a Ticket:

Creating a ticket involves initiating a new ticket in Freshdesk.

def create_ticket(FRESHDESK_API_KEY, FRESHDESK_API_URL):    ticket_data = {        "description": input("Enter issue description: "),        "subject": input("Enter ticket subject: "),        "email": input("Enter user email: "),        "priority": int(input("Enter priority (1, 2, 3, etc.): ")),        "status": int(input("Enter status (2 for open, 3 for pending, etc.): "))    }    result = make_freshdesk_request(FRESHDESK_API_KEY, FRESHDESK_API_URL, "POST", "tickets", json_data=ticket_data)    print("Result (Create Ticket):", result)

This function create_ticket, allows users to input details for a new Freshdesk ticket, including the issue description, ticket subject, user email, priority, and status. It leverages the provided parameters, FRESHDESK_API_KEY and FRESHDESK_API_URL, to authenticate and send a POST request to the Freshdesk API, for creating a new ticket. Finally, the ticket will be created.

We can observe the ticket creation on our Freshdesk User Interface.

Getting Ticket Details:

This section outlines the process of obtaining comprehensive information on a particular ticket in Freshdesk.

def get_ticket_details(FRESHDESK_API_KEY, FRESHDESK_API_URL):    ticket_id = input("Enter ticket ID: ")    result = make_freshdesk_request(FRESHDESK_API_KEY, FRESHDESK_API_URL, "GET", f"tickets/{ticket_id}")    print("Result (Get Ticket Details):", result)

The get_ticket_details function helps users to retrieve specific details about a Freshdesk ticket by providing the ticket ID as an input. Leveraging the parameters FRESHDESK_API_KEY and FRESHDESK_API_URL, this function securely authenticates and sends a GET request to the Freshdesk API, fetching detailed information about the specified ticket.

Response:

{  "Result (Get Ticket Details)": {    "data": {      "cc_emails": [],      "fwd_emails": [],      "reply_cc_emails": [],      "ticket_cc_emails": [],      "fr_escalated": false,      "spam": false,      "email_config_id": 1070000038299,      "group_id": null,      "priority": 1,      "requester_id": 1070012864262,      "responder_id": null,      "source": 2,      "company_id": null,      "status": 2,      "subject": "Cancel order",      "association_type": null,      "support_email": null,      "to_emails": null,      "product_id": 1070000031097,      "id": 12,      "type": null,      "due_by": "2023-12-22T22:00:00Z",      "fr_due_by": "2023-12-20T22:00:00Z",      "is_escalated": false,      "description": "can you try to cancel my order with order id 18
",      "description_text": "can you try to cancel my order with order id 18",      "custom_fields": {        "cf_reference_number": null      },      "created_at": "2023-12-20T10:38:40Z",      "updated_at": "2023-12-20T10:38:40Z",      "tags": [],      "attachments": [],      "source_additional_info": null,      "form_id": 1070000103738,      "nr_due_by": null,      "nr_escalated": false    },    "response_text": "{\"cc_emails\":[],\"fwd_emails\":[],\"reply_cc_emails\":[],\"ticket_cc_emails\":[],\"fr_escalated\":false,\"spam\":false,\"email_config_id\":1070000038299,\"group_id\":null,\"priority\":1,\"requester_id\":1070012864262,\"responder_id\":null,\"source\":2,\"company_id\":null,\"status\":2,\"subject\":\"Cancel order\",\"association_type\":null,\"support_email\":null,\"to_emails\":null,\"product_id\":1070000031097,\"id\":12,\"type\":null,\"due_by\":\"2023-12-22T22:00:00Z\",\"fr_due_by\":\"2023-12-20T22:00:00Z\",\"is_escalated\":false,\"description\":\"can you try to cancel my order with order id 18
\",\"description_text\":\"can you try to cancel my order with order id 18\",\"custom_fields\":{\"cf_reference_number\":null},\"created_at\":\"2023-12-20T10:38:40Z\",\"updated_at\":\"2023-12-20T10:38:40Z\",\"tags\":[],\"attachments\":[],\"source_additional_info\":null,\"form_id\":1070000103738,\"nr_due_by\":null,\"nr_escalated\":false}"  }}

Updating a Ticket:

This section provides insights into the process of updating a ticket within Freshdesk.

def update_ticket(FRESHDESK_API_KEY, FRESHDESK_API_URL):    ticket_id = input("Enter ticket ID to update: ")    updated_ticket_data = {        "description": input("Enter updated issue description: "),        "subject": input("Enter updated ticket subject: "),        "email": input("Enter updated user email: "),        "priority": int(input("Enter updated priority (1, 2, 3, etc.): ")),        "status": int(input("Enter updated status (2 for open, 3 for pending, etc.): "))    }    result = make_freshdesk_request(FRESHDESK_API_KEY, FRESHDESK_API_URL, "PUT", f"tickets/{ticket_id}", json_data=updated_ticket_data)    print("Result (Update Ticket):", result)

The update_ticket function helps users to modify existing Freshdesk tickets with ease. By inputting the desired ticket ID, users can then provide updated details such as the issue description, ticket subject, user email, priority, and status. Leveraging the parameters FRESHDESK_API_KEY and FRESHDESK_API_URL, this function utilizes a PUT request to the Freshdesk API, it update the specified ticket. The result of the update will be displayed.

we can also observe the updating existing ticket on our Freshdesk User Interface.

Deleting a Ticket:

This section outlines the process of deleting a ticket.

def delete_ticket(FRESHDESK_API_KEY, FRESHDESK_API_URL):    ticket_id = input("Enter ticket ID to delete: ")    result = make_freshdesk_request(FRESHDESK_API_KEY, FRESHDESK_API_URL, "DELETE", f"tickets/{ticket_id}")    print("Result (Delete Ticket): Ticket successfully deleted.")

The delete_ticket function helps users to effortlessly remove unwanted tickets from Freshdesk. By prompting users to input the ticket ID they wish to delete, this function utilizes the FRESHDESK_API_KEY and FRESHDESK_API_URL parameters to authenticate and send a DELETE request to the Freshdesk API, initiating the ticket deletion process. Upon successful deletion, a confirmation message will be displayed.

Response:

{  "Result": "Ticket successfully deleted."}

If you'd like to Prefer a Visual Guide? for building APIs powered by ChatGPT. you can watch the video tutorial :

Beginner's Guide to FastAPI & OpenAI ChatGPT API Integration

Use Case: Automating Freshdesk Support with ChatGPT

Installing the required modules:

pip install openai -qpip install requests

Extracting Order Details, Summarizing Ticket Conversation and Retrieving Knowledge base answers.

Defining `get_ticket_text` Function:

The get_ticket_text function retrieves ticket information from Freshdesk

import requestsfrom openai import OpenAI import jsonclient = OpenAI(api_key="Your API Key") # Replace with your OpenAI API keydef get_ticket_text(ticket_id: int):    try:        FRESHDESK_API_KEY = 'Your Freshdesk API Key'        url = f'https://domain.freshdesk.com/api/v2/tickets/{ticket_id}'        headers = {'Authorization': f'Basic {FRESHDESK_API_KEY}'}        response = requests.get(url, headers=headers)        response.raise_for_status()        ticket_info = response.json()        return ticket_info.get("description", "")    except requests.exceptions.RequestException as e:        print(f"Error fetching ticket information: {e}")        return ""

Defining the `get_knowledge_chromadb` Function:

def get_knowledge_chromadb(query):    import chromadb    from sentence_transformers import SentenceTransformer    model_name = "multi-qa-MiniLM-L6-cos-v1"    client = chromadb.PersistentClient(path="chromadb_index")    collection = client.get_collection("index_name")    model = SentenceTransformer(model_name)    query_embedding = model.encode(query)    results = collection.query(query_embedding, n_results=2)    if results['metadatas'][0]:        context =results['metadatas'][0][0]['knowledge']+"\n"+results['metadatas'][0][1]['knowledge']    else:        context=""    return results

If you want to learn more about Chroma vector database and Semantic search you can watch this video tutorial :

https://youtu.be/eCCHDxMaFIk?si=sM0whXw2Qk6b1bbX

Defining the `get_order_details` Function:

We define a function get_order_details, which makes an API request to retrieve information for a given order.

def get_order_details(order_id):    url = "http://your_api_endpoint/order_info"  # Replace with your actual endpoint    params = {'order_id': order_id}    response = requests.post(url, params=params)    if response.status_code == 200:        return response.json()['Result'][0]    else:        return f"Error: Unable to fetch order details. Status code: {response.status_code}"order_details = get_order_details(order_id=12)print(order_details)

Describing Functions:

get_order_info function takes an argument called "order_id" and it returns the order details associated with the order_id.

knowledge_base function takes an arguments called "query" and it returns the knowledge base data associated with the query.

functions = [    {        "name": "get_order_info",        "description": "The function retrieves all the Order specific information associated with the provided Order ID.",        "parameters": {            "type": "object",            "properties": {                "order_id": {                    "type": "string",                    "description": "unique id of the order",                }            },            "required": ["order_id"],        }    },    {"name": "knowledge_base",     "description": "The function retrieves the knowledge from chromadb using the query provided.",        "parameters": {            "type": "object",            "properties": {                "query": {                    "type": "string",                    "description": "query to search in chromadb",                }            },            "required": ["query"],        }    }]

Defining the `generate_ticket_response` Function:

while the generate_ticket_response function utilizes ChatGPT for extracting order details.

def generate_ticket_response(text):    conversation_str = f"""Given conversation : {text}"""    response = client.chat.completions.create(        model="gpt-3.5-turbo-1106",        response_format={ "type": "json_object" },        messages=[            {"role": "system", "content": "You are a helpful assistant designed to output JSON. 'Order ID' is of numeric type. 'Order ID' can sometimes be followed by '#' symbol, please extract the 'Order ID' by removing the symbol. Order details can be fetched by 'get_order_info' function call."},            {"role": "user", "content": conversation_str},        ],        temperature=0,        functions=functions,        function_call="auto",    )    saved_response = response    response = response.choices[0].message    if response.function_call:        function_name = response.function_call.name        if function_name == "get_order_info":            order_id = json.loads(response.function_call.arguments)["order_id"]            order_details = get_order_details(order_id=12)             if order_details:                return {"order_details": order_details, "status": True }        elif function_name == "knowledge_base":            query = json.loads(response.function_call.arguments)["query"]            data = get_knowledge_chromadb(query)            response = client.chat.completions.create(                model="gpt-3.5-turbo-1106",                response_format={ "type": "text" },                messages=[                    {"role": "system", "content": "You are a helpful assistant. Provide response in formal language."} ,                    {"role": "user", "content": f"""Here is the knowledge base data for the given conversation: {data}""" },                    {"role": "user", "content": f"Given conversation : \n{query}"},                ],                temperature=0,            )            return {"response": response.choices[0].message.content, "status": True}    return {"order_details": "", "status":False}

GPT calls the function in the prompt and returns the order details in JSON format and the order details are extracted from the JSON response and returned.

If you'd like to Prefer a Visual Guide? for ChatGPT Function calling. you can watch the video tutorial :

https://youtu.be/pI1yUiNKyDA?si=BrkMNclGb-YmRvhd

Defining `get_chatgpt_summary_orderID_details` Function:

def get_chatgpt_summary_orderID_details(text):    conversation_str = f"""Given conversation : {text}"""    response = client.chat.completions.create(        model="gpt-3.5-turbo-1106",        response_format={ "type": "json_object" },        messages=[            {"role": "system", "content": "You are a helpful assistant designed to output JSON.Please generate summary of given conversation in 'summary' . Also extract topics from given conversation in 'topics' tag. 'Order ID' is of numeric type. 'Order ID' can sometimes be followed by '#' symbol, please extract the 'Order ID' by removing the symbol. Extract order ID in 'order_id'"},            {"role": "user", "content": conversation_str},        ],        temperature=0,    )    return response.choices[0].message.content

If you'd like to Prefer a Visual Guide? on how to implement OpenAI's ChatGPT API in your projects and how to use OpenAI's ChatGPT API to Build a Conversational AI Chatbot check out this helpful video tutorial:

https://youtu.be/RALmm6flXII?si=f2QTY2nB1dZTUGjz

Defining `gpt_function` Function

Introducing the gpt_function function that integrates with GPT to handle ticket-related information and user queries, providing meaningful responses based on the context of the conversation.

def gpt_function(ticket_id: int):    ticket_text = get_ticket_text(ticket_id)    output = generate_ticket_response(ticket_text)    print(output)    out = get_chatgpt_summary_orderID_details(ticket_text)    print(json.loads(out))    ticket_conversation = "what is your return policy?"    output = generate_ticket_response( ticket_conversation)    print(output)

It begins by retrieving the ticket text using the get_ticket_text function. Subsequently, it leverages ChatGPT through the generate_ticket_response function to extract order details and generate comprehensive summaries of ticket conversations. Furthermore, the code demonstrates the model's capability by simulating user inquiries, such as querying the return policy, and showcases ChatGPT's responses.

Generating GPT Response

ticket_id = 10gpt_function(ticket_id)

The gpt response will retrieve order details, summarizing conversation and knowledge base answer.

{'order_details': 'Customer Name: Alex\nOrder Date:2023-12-05 21:15:01\nOrder status:payment_accepted\nCarrier:UPS - Gratis Vans\nDelivery Date: 2023-12-20 00:00:00', 'status': True}{'summary': 'The customer has ordered an SSD CARD on 14-12-2023 and has not received any updates for a week. The order ID mentioned is 12, and the customer is inquiring about the expected delivery date.', 'topics': ['SSD CARD', 'order status', 'delivery date'], 'order_id': 12}{'response': 'The return policy allows for items to be returned within 7 days of purchase.','status': True}

Conclusion

In conclusion, integrating advanced NLP solutions like ChatGPT into customer support systems can significantly enhance efficiency and customer satisfaction. Our deep dive into Freshdesk's APIs demonstrates the seamless connection between these technologies, allowing for automated ticket management and knowledgeable customer interactions.

For further exploration and references, don't forget to check the Freshdesk API Documentation.

Stay Connected with FutureSmart AI for the Latest in AI Insights - FutureSmart AI

Building a Conversational Voice Chatbot: Integrating OpenAI's Speech-to-Text & Text-to-Speech

Ved Vekhande — Thu, 21 Dec 2023 14:28:53 GMT

Introduction

Welcome to an engaging tutorial where we'll develop a voice-responsive chatbot utilizing OpenAI's advanced speech-to-text and text-to-speech services, all integrated within a Streamlit web application. This project is not just about textual interactions; it's about enabling a natural, voice-based dialogue with a chatbot.

For those who might not be familiar with OpenAI's capabilities in handling speech, I recommend watching my detailed video (watch here). It provides an excellent introduction to the speech-to-text and text-to-speech functionalities that are central to our project.

Setting Up the Environment

1. Install Required Libraries

Your chatbot relies on several Python libraries, as listed in the requirements.txt file. These libraries include Streamlit for the web interface, OpenAI for accessing speech processing services, and others for specific functionalities like audio recording. Install them by running the following command in your project directory:

pip install -r requirements.txt

Here's a quick breakdown of the key libraries:

streamlit: For building and running the web app.
openai: To access OpenAI's API for speech-to-text and text-to-speech services.
audio_recorder_streamlit: To record audio within the Streamlit app.
streamlit-float: Provides floating elements in the Streamlit interface.

2. Set Up the `.env` File

Sensitive information such as your OpenAI API key should be stored in a .env file. This approach keeps your credentials secure. Create a .env file in the root of your project and include your OpenAI API key like this:

OPENAI_API_KEY='your_openai_api_key_here'

Ensure that this file is not shared publicly, especially if you are pushing your code to a public repository.

3. Understanding the Project Structure

Your project primarily consists of two Python files:

app.py: This file contains the Streamlit web application logic. It's where you define the user interface and manage the flow of input/output for the chatbot.
utils.py: This file includes functions for processing speech-to-text and text-to-speech, as well as generating chatbot responses.

With your environment set up and a basic understanding of your project's structure, you're now ready to start building the chatbot's functionalities.

Building the Chatbot: Streamlit Interface (`app.py`)

In this section, we dive into the construction of our chatbot, focusing on how the Streamlit interface is set up and how voice inputs are handled and processed in app.py.

Streamlit Interface Setup

Streamlit is a powerful tool that allows us to quickly build interactive web applications for our chatbot. In app.py, the Streamlit application is initialized and configured to handle user interactions:

import streamlit as stfrom utils import get_answer, text_to_speech, autoplay_audio, speech_to_textfrom audio_recorder_streamlit import audio_recorderfrom streamlit_float import *# Initialize floating features for the interfacefloat_init()# Initialize session state for managing chat messagesdef initialize_session_state():    if "messages" not in st.session_state:        st.session_state.messages = [{"role": "assistant", "content": "Hi! How may I assist you today?"}]initialize_session_state()st.title("OpenAI Conversational Chatbot 🤖")

In this setup, we initialize the Streamlit app, import necessary functions from utils.py, and set up the session state to track and manage chat messages. The float_init() function from streamlit_float is used to create floating elements, enhancing the user interface.

Handling Voice Inputs

The core functionality of our chatbot is its ability to handle voice inputs. This is achieved using the audio_recorder_streamlit library, which allows us to record audio directly in the Streamlit interface:

# Create a container for the microphone and audio recordingfooter_container = st.container()with footer_container:    audio_bytes = audio_recorder()

The audio_recorder() function captures audio input from the user. Once the audio is recorded, it's processed to extract the spoken text:

if audio_bytes:    with st.spinner("Transcribing..."):        # Write the audio bytes to a temporary file        webm_file_path = "temp_audio.mp3"        with open(webm_file_path, "wb") as f:            f.write(audio_bytes)        # Convert the audio to text using the speech_to_text function        transcript = speech_to_text(webm_file_path)        if transcript:            st.session_state.messages.append({"role": "user", "content": transcript})            with st.chat_message("user"):                st.write(transcript)            os.remove(webm_file_path)

Here, we write the recorded audio to a file and then use the speech_to_text function from utils.py to convert it into text. The transcribed text is then added to the session state for the chatbot to process.

Chatbot Response Processing

Once a user's voice input is converted to text, the chatbot processes this input to generate a response:

if st.session_state.messages[-1]["role"] != "assistant":    with st.chat_message("assistant"):        with st.spinner("Thinking🤔..."):            final_response = get_answer(st.session_state.messages)        with st.spinner("Generating audio response..."):                audio_file = text_to_speech(final_response)            autoplay_audio(audio_file)        st.write(final_response)        st.session_state.messages.append({"role": "assistant", "content": final_response})        os.remove(audio_file)

In this part of the code, the get_answer function is used to generate a text response based on the user's input. This response is then converted to speech using the text_to_speech function, and the audio is played back to the user.

Integrating OpenAI's Services (`utils.py`)

In utils.py, we have defined key functions that integrate OpenAI's speech-to-text and text-to-speech services, along with the logic for generating chatbot responses. Let's explore these functions in detail.

`speech_to_text` Function

The speech_to_text function is responsible for converting the audio input from the user into text. This is a critical step in enabling the chatbot to understand and process user queries:

def speech_to_text(audio_data):    with open(audio_data, "rb") as audio_file:        transcript = client.audio.transcriptions.create(            model="whisper-1",            response_format="text",            file=audio_file        )    return transcript

In this function, the audio file captured from the user is opened and sent to OpenAI's speech-to-text service. The service transcribes the audio into text using the Whisper model, which is known for its high accuracy in speech recognition. The transcribed text is then returned for further processing by the chatbot.

`text_to_speech` Function

Conversely, the text_to_speech function takes the chatbot's textual response and converts it into an audio format, allowing the chatbot to 'speak' back to the user:

def text_to_speech(input_text):    response = client.audio.speech.create(        model="tts-1",        voice="nova",        input=input_text    )    webm_file_path = "temp_audio_play.mp3"    with open(webm_file_path, "wb") as f:        response.stream_to_file(webm_file_path)    return webm_file_path

Here, the chatbot's response text is converted into speech using OpenAI's text-to-speech service. The output is saved as an audio file, which is then played back to the user, creating an audio response.

`get_answer` Function

The get_answer function generates the chatbot's responses to user inputs. It uses OpenAI's language models to create contextually appropriate and conversational replies:

def get_answer(messages):    system_message = [{"role": "system", "content": "You are an helpful AI chatbot, that answers questions asked by User."}]    messages = system_message + messages    response = client.chat.completions.create(        model="gpt-3.5-turbo-1106",        messages=messages    )    return response.choices[0].message.content

In this function, the conversation history is combined with a system message defining the chatbot's role. This data is then sent to OpenAI's conversational AI model, which generates a response based on the input and context.

Chatbot Interaction Flow

The interaction flow of the chatbot, as orchestrated in app.py, is a seamless integration of these functionalities. When a user speaks to the chatbot, the audio is recorded and converted to text using speech_to_text. The chatbot then processes this input with get_answer to generate a response. Finally, this response is converted back into speech using text_to_speech, allowing the chatbot to audibly communicate with the user. This flow creates a natural and interactive conversational experience, showcasing the potential of integrating advanced AI and speech processing technologies in a user-friendly application.

Conclusion

As we wrap up our exploration of building a voice-responsive OpenAI chatbot with Streamlit, let's reflect on what we've accomplished and the potential for further development.

Reflecting on the Project

This project demonstrates the power and versatility of integrating advanced AI services into a user-friendly application. By combining OpenAI's speech-to-text and text-to-speech capabilities with Streamlit, we've created a chatbot that can understand spoken language and respond in kind. The key functionalities we've implemented, such as handling voice inputs, generating intelligent responses, and speaking back to the user, exemplify how AI can be used to create more natural and engaging user interfaces.

Additional Resources

For a detailed walkthrough of this project and a practical demonstration, make sure to watch my YouTube video. Also, you can access the complete code and documentation on my GitHub repository.

For even more in-depth exploration, be sure to visit my YouTube channel at youtube.com/@aidemos.futuresmart. Here, you'll find a wealth of content that delves into the exciting future of AI and its various applications.

A Comprehensive Guide to OpenAI's Text-to-Speech and Speech-to-Text APIs

Pradip Nichite — Mon, 18 Dec 2023 14:56:16 GMT

Introduction

Welcome to a deep dive into the world of AI-driven communication technologies! In this blog, we'll explore OpenAI's groundbreaking text-to-speech and speech-to-text capabilities. These tools are not just transforming how machines interact with us but are also unlocking new realms of accessibility and efficiency. Whether you're a developer, a tech enthusiast, or simply curious about the advancements in AI, this guide will provide you with valuable insights and practical demonstrations of these powerful tools

Overview of OpenAI's Text-to-Speech API

OpenAI's Text-to-Speech API stands at the forefront of speech synthesis technology. With models like TTS-1, optimized for real-time applications, and TTS-1-HD, which focuses on high-quality audio output, this API offers versatility for diverse requirements. What sets it apart is its ability to handle multiple languages seamlessly, making it a tool of choice for global applications.

Setting Up

Start by installing the OpenAI library. This Python library is essential for interacting with OpenAI's APIs:

!pip install openai -q

Next, initialize your OpenAI client with your API key. This key is critical for authenticating your requests to the OpenAI services:

api_key = "your_api_key_here"from openai import OpenAIclient = OpenAI(api_key=api_key)

Generating English Speech

To create speech from English text, choose the high-definition model for superior audio quality. You can experiment with different voices to find the one that suits your needs:

speech_file_path = "steve_jobs_speech_generated_hd.mp3"response = client.audio.speech.create(    model="tts-1-hd",    voice="echo",    input="Your time is limited, so dont waste it living someone elses life...")response.stream_to_file(speech_file_path)

Generating Hindi Speech

For Hindi, switch to a model optimized for real-time applications. This demonstrates the API's ability to handle multiple languages effectively:

speech_file_path = "different_language.mp3"response = client.audio.speech.create(    model="tts-1",    voice="onyx",    input="     ,   ...")response.stream_to_file(speech_file_path)

OpenAI's Whisper API: Speech-to-Text in Action

Transcribing Speech with Whisper

The Whisper API is adept at converting spoken words into text. Let's see it in action with a demonstration that includes transcribing an English speech and translating a Hindi speech.

Transcribing an English Speech

First, we load the audio file and use the Whisper API to transcribe it:

audio_file = open("/content/steve_jobs_speech_generated_hd.mp3", "rb")transcript = client.audio.transcriptions.create(    model="whisper-1",    response_format="text",  # Default output format is json    file=audio_file)print("Transcript: ", transcript)

Translating and Transcribing a Hindi Speech

Next, we apply the same approach to a Hindi audio file, demonstrating the API's translation capabilities:

audio_file = open("/content/different_language.mp3", "rb")translated_transcript = client.audio.translations.create(    model="whisper-1",    response_format = "text",    file=audio_file)print("Translated Transcript: ", translated_transcript)original_transcript = client.audio.transcriptions.create(    model="whisper-1",    response_format="text",    file=audio_file)print("Original Transcript: ", original_transcript)

This demonstration illustrates Whisper's prowess in accurately transcribing and translating spoken words from different languages. Such capabilities are invaluable for creating inclusive, multilingual applications and services.

Handling Long Audio Files with PyDub

Segmenting Audio for Efficient Processing

When dealing with lengthy audio files, it's often necessary to segment them for easier processing. PyDub, a flexible audio processing library in Python, is an excellent tool for this task.

Installing and Using PyDub

Start by installing PyDub:

!pip install pydub -q

Then, use PyDub to segment an audio file. Here, we'll take a long audio file and extract the first five minutes:

from pydub import AudioSegment# Load the audio filesong = AudioSegment.from_mp3("/content/NLP Roadmap 2024 Step-by-Step Guide Resources.mp3")# PyDub handles time in millisecondsfive_minutes = 5 * 60 * 1000# Extract the first 5 minutesfirst_5_minutes = song[:five_minutes]# Export the segmentfirst_5_minutes.export("split_speech.mp3", format="mp3")

Using the Segmented Audio with Whisper

With the segmented audio file, we can now efficiently utilize Whisper for transcription:

audio_file = open("/content/split_speech.mp3", "rb")# Use Whisper API for transcription

Segmenting audio files is a practical approach to handling long recordings, making them more manageable for transcription or other audio processing tasks. PyDub's simplicity and efficiency make it an ideal choice for such operations.

Correcting Transcriptions with GPT-4

Enhancing Transcript Accuracy

Transcription errors are common, especially with unique terms or accents. In this section, we demonstrate how to use GPT-4 to correct transcription errors, focusing on specialized terminology related to Data Science.

The Process

Transcribe Audio: First, we transcribe the audio file using the Whisper API:

 def transcribe(audio_file):     transcript = client.audio.translations.create(         model="whisper-1",         response_format="text",         file=audio_file     )     return transcript

Set Up the Correction Prompt: Prepare a system prompt instructing GPT-4 to correct spelling mistakes and ensure proper case usage for specialized terms.

 system_prompt = """You are given a video transcript with spelling mistakes... Rewrite transcript in the same format correcting spelling mistakes..."""

Generate Corrected Transcript: Combine the transcription with GPT-4 to produce a corrected version:

 def generate_corrected_transcript(system_prompt, audio_file):     text = transcribe(audio_file)     response = client.chat.completions.create(         model="gpt-4",         temperature=0,         messages=[             {"role": "system", "content": system_prompt},             {"role": "user", "content": text}         ]     )     return response.choices[0].message.content audio_file = open("/content/split_speech.mp3", "rb") corrected_text = generate_corrected_transcript(system_prompt, audio_file)

This approach showcases how GPT-4 can be leveraged to enhance the accuracy of transcriptions, especially for specialized or technical content. It's a valuable step towards ensuring clarity and precision in AI-generated transcripts.

Prefer a Visual Guide? Watch Our Video!

If you're someone who learns better through visual content, be sure to check out our detailed video tutorial on OpenAI's Text-to-Speech and Speech-to-Text APIs. It's packed with visual demonstrations and step-by-step coding walkthroughs that complement this blog.

https://youtu.be/z7sFaPMbv54

Jupyter Notebook: https://github.com/PradipNichite/Youtube-Tutorials/blob/main/OpenAI_Speech_to_Text_and_Text_to_Speech_Tutorial.ipynb

Unlocking the Potential of LangChain Expression Language (LCEL): A Hands-On Guide

Pradip Nichite — Mon, 11 Dec 2023 12:32:49 GMT

Introduction

LangChain Expression Language (LCEL) represents a transformative approach in working with large language models (LLMs). It simplifies complex workflows, making it more accessible for developers to leverage the power of AI in their applications. In this blog, we'll explore LCEL through practical examples.

Setting the Stage: Basic Setup

Before diving into LCEL, it's crucial to set up the necessary environment. This setup involves installing the LangChain library along with other essential packages:

!pip install langchain!pip install openai!pip install chromadb!pip install tiktoken

Once installed, you can begin coding by importing the required modules and setting up your API keys:

import osfrom langchain.chat_models import ChatOpenAIfrom langchain.prompts import ChatPromptTemplatefrom langchain.schema.output_parser import StrOutputParser# Set your API keyos.environ['OPENAI_API_KEY'] = "sk-..."

Understanding the Output: Detailed Examples

model = ChatOpenAI()output_parser = StrOutputParser()prompt = ChatPromptTemplate.from_template(    "Create a lively and engaging product description with emojis based on these notes: \n{product_notes}")

Let's dive deeper into the output generated at each step of our LCEL example to fully grasp its functionality.

First, we invoke the prompt with specific product notes:

prompt_value = prompt.invoke({"product_notes": "Multi color affordable mobile covers"})

The prompt_value here holds the structured request we sent to the model:

ChatPromptValue(messages=[HumanMessage(content='Create a lively and engaging product description with emojis based on these notes: \nMulti color affordable mobile covers')])

We can convert this value to a string to see how it's presented:

prompt_value.to_string()

This yields a human-readable format of the prompt:

Human: Create a lively and engaging product description with emojis based on these notes:Multi color affordable mobile covers

To pass this prompt to the model, we convert it to messages:

prompt_value.to_messages()

This conversion is crucial for the model to understand and process the request:

[HumanMessage(content='Create a lively and engaging product description with emojis based on these notes: \nMulti color affordable mobile covers')]

Next, the model is invoked with these messages:

model_output = model.invoke(prompt_value.to_messages())

The model_output contains the AI-generated product description:

AIMessage(content="🌈📱Get ready to dress up your phone in a kaleidoscope of colors with our multi-color affordable mobile covers! 🎉💃...")

Finally, we parse this output:

output_parser.invoke(model_output)

This yields a well-formatted, human-readable product description:

🌈📱Get ready to dress up your phone in a kaleidoscope of colors with our multi-color affordable mobile covers! 🎉💃...

This step-by-step breakdown showcases the power and flexibility of LCEL, illustrating how it handles and transforms data at each stage of the process. The ability to see and understand each component's output is invaluable for debugging and refining your AI-driven applications.

The Power of Chaining in LCEL

One of the most powerful features of LCEL is the ability to chain operations. This capability is showcased in the following example:

chain = prompt | model | output_parserproduct_description = chain.invoke({"product_notes": "Multi color affordable mobile covers"})print(product_description)

The | operator elegantly chains the prompt, model, and output parser, simplifying what would typically be a complex series of operations.

Streaming and Batch Processing

LCEL also supports streaming and batch processing, allowing for efficient handling of multiple inputs and real-time data flows:

# Streaming Examplefor chunk in chain.stream({"product_notes": "Multi color affordable mobile covers"}):    print(chunk, end="", flush=True)# Batch Processing Exampleproduct_notes_list = [    {"product_notes": "Eco-friendly reusable water bottles"},    # Add more product notes here]batch_descriptions = chain.batch(product_notes_list)

These examples illustrate LCEL's versatility in handling various types of data inputs and processing needs.

Advanced Use Case: Retrieval Augmented Generation (RAG)

LCEL goes beyond simple chaining. The following RAG example demonstrates its capability in more complex scenarios:

from langchain.embeddings import OpenAIEmbeddingsfrom langchain.vectorstores import Chroma# Sample documentsdocs = ["Document on Climate Change...", "AI in Healthcare..."]# Retrieval setupvectorstore = Chroma.from_texts(docs, embedding=OpenAIEmbeddings())retriever = vectorstore.as_retriever(search_kwargs={"k": 1})chain = {"context": retriever, "question": RunnablePassthrough()} | prompt | model | output_parser# Invoke the chain for a queryresponse = chain.invoke("Query about social media and politics")print(response)

Conclusion

In conclusion, LangChain Expression Language (LCEL) presents a flexible and powerful way to work with large language models, allowing for easy composition of complex tasks. As we've seen through the code snippets and explanations, LCEL simplifies the process of generating dynamic content, handling data streams, and performing advanced operations like Retrieval Augmented Generation (RAG).

If you're keen on exploring more about LCEL and would like to see these concepts in action, I highly recommend watching our detailed tutorial video. This video provides a practical, visual guide to using LCEL, complementing the insights shared in this blog post. It's a great resource for both beginners and experienced users looking to deepen their understanding of LangChain and its capabilities.

https://youtu.be/NQWfvhw7OcI

Whether you're a developer, researcher, or just someone fascinated by the potential of AI and language models, the video will offer valuable insights and enhance your skills in AI-driven application development.

Thank you for reading, and happy coding with LCEL!

Jupyter Notebook: https://github.com/PradipNichite/Youtube-Tutorials/blob/main/LangChain_Expression_Language_(LCEL)_Tutorial.ipynb

OpenAI Function Calling Explained: Chat Completions & Assistants API

Pradip Nichite — Mon, 20 Nov 2023 12:52:22 GMT

Introduction

In the rapidly evolving landscape of AI and chatbot technology, mastering the art of function calling within OpenAI's APIs has become a cornerstone for developers looking to push the boundaries of conversational AI. This blog delves into the intricate world of OpenAI's Chat Completions and Assistant APIs, unlocking the potential of custom function integration to revolutionize chatbot interactions.

Our journey through this blog will take you from the foundational concepts of function calling in the Chat Completions API to the more advanced, stateful interactions enabled by the Assistant API. We'll explore these concepts through the lens of a practical application: an e-commerce chatbot adept at handling complex user queries about order status. By the end of this guide, you'll not only grasp the theoretical aspects but also acquire hands-on skills for implementing these advanced functionalities in your own AI projects.

For those who prefer a more visual and interactive learning experience, don't miss our accompanying YouTube tutorial, which brings these concepts to life. Together, the blog and the video provide a comprehensive toolkit for any developer eager to explore the frontiers of AI-driven chatbot development.

Setting Up the Environment

Before we dive into the specifics of function calling with OpenAI's APIs, it's essential to ensure your development environment is ready. Here's a quick setup guide:

Installing the OpenAI Python Package: To interact with OpenAI's APIs, start by installing the OpenAI Python package. Run the following command in your Python environment:

pip install openai -q

Initializing the OpenAI Client: Next, initialize the OpenAI client in your Python script. This client will facilitate your interactions with OpenAI's services. Here's a sample initialization:

from openai import OpenAIclient = OpenAI(    api_key="your_api_key",  # Replace with your actual OpenAI API key)

Remember to replace "your_api_key" with your actual API key.

With these two steps, your environment is set up and ready for exploring OpenAI's API functionalities.

Understanding Function Calling in Chat Completions API:

The Chat Completions API from OpenAI offers a unique capability to integrate custom functions into chatbot conversations. In this section, we'll set up a function that our chatbot can call to retrieve order details, and then integrate it with the Chat Completions API.

Defining the `get_order_details` Function:

The core of our chatbot's functionality lies in its ability to fetch specific order details. We define a function, get_order_details, which makes an API request to retrieve information about a given order:

import requestsdef get_order_details(order_id):    url = "http://your_api_endpoint/order_info"  # Replace with your actual endpoint    params = {'order_id': order_id}    response = requests.post(url, params=params)    if response.status_code == 200:        return response.json()['Result'][0]    else:        return f"Error: Unable to fetch order details. Status code: {response.status_code}"

order_details = get_order_details(order_id=4)print(order_details){    'order_id': 4,    'total_amount': 299.99,    'delivery_status': 'Shipped',    'current_location': 'Warehouse B',    'expected_delivery_date': '2023-01-18',    'customer_name': 'Alice Brown',    'product_id': 4,    'product_name': 'Tablet'}

Describe functions

To integrate this function with the Chat Completions API, we need to define it in a format that the API can understand. We create two dictionaries: available_functions and functions. The former maps function names to their corresponding Python functions, and the latter describes the functions for the API.

available_functions = {    "get_order_details": get_order_details,}functions = [    {        "name": "get_order_details",        "description": "Retrieves the details of an order given its order ID.",        "parameters": {            "type": "object",            "properties": {                "order_id": {"type": "integer", "description": "The order ID."}            },            "required": ["order_id"],        },    }]

Implementing the `get_gpt_response` Function:

Now, we need a function to send user messages to the Chat Completions API and receive AI-generated responses. This function will also specify that our get_order_details function can be called by the model if needed.

def get_gpt_response(messages):    return client.chat.completions.create(        model="gpt-3.5-turbo",        messages=messages,        functions=functions,        function_call="auto",    )

With these elements in place, our chatbot is now equipped to understand when to call the get_order_details function based on user queries and to integrate the response into the ongoing conversation.

When the AI model decides that a function call is needed, it responds with a message object containing the function_call attribute. For instance, if the bot decides to call get_order_details with order_id 4, the response would look like this:

ChatCompletionMessage(content=None, role='assistant', function_call=FunctionCall(arguments='{"order_id":4}', name='get_order_details'), tool_calls=None)

Here, function_call is not None, indicating that the model has identified a need to call a function. The arguments attribute contains the parameters for the function call, in this case, order_id: 4

Perfect, let's proceed to the next section which will focus on executing the function and submitting the function response back to the Chat Completions API. This step is crucial as it demonstrates how the chatbot processes and responds to user queries in real time by leveraging custom functions.

Having established how our chatbot decides when to call a function, let's now look at how it executes the function and integrates the response into the chat.

Executing the Function Call:

def execute_function_call(function_name,arguments):    function = available_functions.get(function_name,None)    if function:        arguments = json.loads(arguments)        results = function(**arguments)    else:        results = f"Error: function {function_name} does not exist"    return results

Extract Function Details: The chatbot first extracts the function name and arguments from the function_call attribute in the API's response.
Execute Function: Using the extracted details, the chatbot calls the appropriate function. In our case, it's the get_order_details function with the provided order_id.
Function Response: The execute_function_call function executes the specified function with the given arguments and returns the result.

function_name = response.choices[0].message.function_call.namearguments = response.choices[0].message.function_call.argumentsfunction_response = execute_function_call(function_name, json.loads(arguments))

Submitting the Function Response:

With the function response obtained, the next step is to submit this response back to the Chat Completions API. This allows the API to continue the conversation, incorporating the function response into its next message.

Append Function Response: Let append the existing assistant reply and the function response to the conversation history. This step is crucial as it updates the chat context with the new information.

# extend conversation with assistant's replymessages.append(response.choices[0].message)  messages.append(    {        "role": "function",        "name": function_name,        "content": str(function_response),    })messages

Here's what the messages array looks like at this stage:

[    {'role': 'user', 'content': 'what is my order status of order id 4'},    ChatCompletionMessage(content=None, role='assistant', function_call=FunctionCall(arguments='{"order_id":4}', name='get_order_details'), tool_calls=None),    {'role': 'function',     'name': 'get_order_details',     'content': "{'order_id': 4, 'total_amount': 299.99, 'delivery_status': 'Shipped', 'current_location': 'Warehouse B', 'expected_delivery_date': '2023-01-18', 'customer_name': 'Alice Brown', 'product_id': 4, 'product_name': 'Tablet'}"}]

Generating the Final GPT Response:

With the updated messages array, the chatbot then calls the get_gpt_response function again. This time, the API uses the entire conversation context, including the function response, to generate its next message. The final GPT output provides a synthesized and user-friendly summary of the order details:

resposne = get_gpt_response(messages)print(resposne.choices[0].message.content)"The order with ID 4 is for a Tablet with a total amount of $299.99. The order has been shipped and is currently at Warehouse B. The expected delivery date is 2023-01-18. The customer's name is Alice Brown."

This response demonstrates the chatbot's ability to interpret the function output and convey the information clearly and concisely, tailored to the user's original query.

By effectively managing the conversation flow and utilizing the function output, the chatbot can provide detailed and context-specific responses, enhancing the user experience significantly.

Integrating Function Calling with the Assistants API

After exploring the Chat Completions API, we now turn our attention to the Assistant API, which offers a more advanced and stateful way to create assistant-like experiences. This part of the blog will cover how to integrate function calling into the Assistants API, using the given code as a basis.

Creating Functions for the Assistant API:

Similar to the Chat Completions API, you need to define the functions your assistant will use. In our example, we've used the get_order_details function.

Defining Tools for the Assistant:

In the Assistant API, functions are referred to as tools. You need to define these tools in a similar manner to how we defined functions for the Chat Completions API. The tool definition includes the function name, description, and parameters.

tools = [    {        "type": "function",        "function": {            "name": "get_order_details",            "description": "Retrieves the details of an order given its order ID.",            "parameters": {                "type": "object",                "properties": {                    "order_id": {                        "type": "integer",                        "description": "The unique identifier of the order."                    }                },                "required": ["order_id"]            }        }    }]

Creating an Assistant:

You create an assistant by providing its name, instructions, model, and the tools it has access to. The instructions are crucial as they guide the assistant on how to use the provided tools.

assistant = client.beta.assistants.create(  name="Ecommerce bot",  instructions="You are an ecommerce bot. Use the provided functions to answer questions. Synthesise answer based on provided function output and be consise",  model="gpt-4-1106-preview",  tools = tools)

Create Message and Run:

The Assistant API manages conversations through threads. You create a message within a thread and then run the assistant. The assistant processes the user input and determines if a tool (function) needs to be called.

Create a message in the conversation thread and initiate a run of the assistant.

def create_message_and_run(assistant,query,thread=None):  if not thread:    thread = client.beta.threads.create()  message = client.beta.threads.messages.create(    thread_id=thread.id,    role="user",    content=query  )  run = client.beta.threads.runs.create(  thread_id=thread.id,  assistant_id=assistant.id  )  return run,threadquery = "I want to know my order status" run,thread = create_message_and_run(assistant=assistant,query=query)

Process Function Call Requirements:

def get_function_details(run):  print("\nrun.required_action\n",run.required_action)  function_name = run.required_action.submit_tool_outputs.tool_calls[0].function.name  arguments = run.required_action.submit_tool_outputs.tool_calls[0].function.arguments  function_id = run.required_action.submit_tool_outputs.tool_calls[0].id  print(f"function_name: {function_name} and arguments: {arguments}")  return function_name, arguments, function_id

When a user's message triggers a function, the run enters a requires_action state, indicating a function needs to be called:

Check Run Status:
- Regularly check the run's status. When it's requires_action, it means the Assistant needs a function to be executed.
Retrieve Required Action:
- Fetch the required action details to identify which function to call and its arguments.

Submit Function Response:

def submit_tool_outputs(run,thread,function_id,function_response):    run = client.beta.threads.runs.submit_tool_outputs(    thread_id=thread.id,    run_id=run.id,    tool_outputs=[      {        "tool_call_id": function_id,        "output": str(function_response),      }    ]    )     return run

After executing the function, submit its output back to the assistant. The assistant will then continue the conversation, incorporating the function response.

Looping for Continuous Interaction:

For an ongoing conversation, you can loop through these steps, allowing the assistant to handle multiple queries and function calls as the conversation progresses.

while True:    run = client.beta.threads.runs.retrieve(thread_id=thread.id, run_id=run.id)    print("run status", run.status)    if run.status=="requires_action":        function_name, arguments, function_id  = get_function_details(run)        function_response = execute_function_call(function_name,arguments)        run = submit_tool_outputs(run,thread,function_id,function_response)        continue    if run.status=="completed":        messages = client.beta.threads.messages.list(thread_id=thread.id)        latest_message = messages.data[0]        text = latest_message.content[0].text.value        print(text)        user_input = input()        if user_input == "STOP":          break        run,thread = create_message_and_run(assistant=assistant,query=user_input,thread=thread)        continue;    time.sleep(1)

run status completedTo help you with your order status, I will need the order ID. Please provide the order ID associated with the purchase you are inquiring about.order id is 1run status queuedrun status requires_actionrun.required_action RequiredAction(submit_tool_outputs=RequiredActionSubmitToolOutputs(tool_calls=[RequiredActionFunctionToolCall(id='call_SOy0g26dzHmNG6HS4qobEUvk', function=Function(arguments='{"order_id":1}', name='get_order_details'), type='function')]), type='submit_tool_outputs')function_name: get_order_details and arguments: {"order_id":1}run status queuedrun status in_progressrun status in_progressrun status completedThe order with ID 1 has the following status:- Delivery Status: Shipped- Current Location: Warehouse A- Expected Delivery Date: 2023-01-10The order consists of a laptop, and the total amount for this order is $999.99. If you have any more questions or need further assistance, feel free to ask!when can i expect deliveryrun status queuedrun status in_progressrun status in_progressrun status completedYou can expect the delivery of your order with ID 1 by the expected delivery date, which is January 10, 2023. Please note that delivery times may vary due to shipping and handling procedures, so it's always a good idea to track your order for any updates closer to the delivery date.STOP

The Assistant API's approach to function calling is more interactive and continuous compared to the Chat Completions API. It's particularly well-suited for scenarios where a stateful conversation is necessary, like in our e-commerce chatbot example.

Final Conclusion:

As we conclude our journey through the intricacies of OpenAI's function calling capabilities, it's clear that the landscape of AI-driven chatbot development is brimming with possibilities. The Chat Completions and Assistant APIs offer not just tools, but gateways to creating more intuitive, responsive, and intelligent chatbot experiences. By mastering these functionalities, developers can transcend traditional boundaries, crafting solutions that are not only technically proficient but also contextually aware and user-centric.

In this blog, we've traversed the path from the basic setup to executing complex function calls, all within the realm of a practical e-commerce chatbot scenario. This hands-on approach, coupled with the visual aid of our YouTube tutorial, aims to equip you with both the knowledge and the confidence to explore these technologies in your own projects.

If you're curious about the latest in AI technology, I invite you to visit my project, AI Demos, at https://www.aidemos.com/. It's a rich resource offering a wide array of video demos showcasing the most advanced AI tools. My goal with AI Demos is to educate and illuminate the diverse possibilities of AI.

For even more in-depth exploration, be sure to visit my YouTube channel at https://www.youtube.com/@aidemos.futuresmart. Here, you'll find a wealth of content that delves into the exciting future of AI and its various applications.

https://youtu.be/pI1yUiNKyDA?si=mliLWaPuHqxZLDc8

Code : https://github.com/PradipNichite/Youtube-Tutorials/blob/main/OpenAI_Function_Calling_Tutorial_Assistents_API_ipynb.ipynb

Fine-Tuning GPT-3.5: A Step-by-Step Guide

Pradip Nichite — Mon, 13 Nov 2023 13:03:22 GMT

Introduction

In the rapidly evolving world of AI and machine learning, fine-tuning pre-trained models like GPT-3.5 has become a pivotal step in achieving enhanced and specialized performance. This guide will walk you through the fine-tuning process of the GPT-3.5 model, explaining its benefits and providing a step-by-step tutorial with code.

Why Fine-Tune GPT-3.5?

Fine-tuning GPT-3.5 has several advantages:

Improved Quality: It leads to higher quality results compared to using the model with generic prompts.
Customization: Fine-tuning allows the model to adapt to specific use cases or domains, which might not be effectively covered in the standard model.
Efficiency: It can reduce the need for long, complex prompts by embedding domain knowledge directly into the model.

Preparing for Fine-Tuning

Before starting the fine-tuning process, it's crucial to prepare your dataset. This dataset should consist of examples relevant to the specific task or domain you're targeting.

A Step-by-Step Guide to Fine-Tuning GPT-3.5

Install OpenAI Library:

Begin by installing the OpenAI library in your Python environment.

!pip install -U openai

Prepare Your Dataset:

Load and format your dataset. This example uses a dataset for customer support queries:

import pandas as pddf = pd.read_csv("your_dataset.csv")df.head()

Format the Data:

Convert your data into a format suitable for GPT-3.5. This involves structuring your examples as a series of messages, emulating a conversation.

def convert_to_gpt35_format(dataset):    fine_tuning_data = []    for _, row in dataset.iterrows():        json_response = '{"Top Category": "' + row['Top Category'] + '", "Sub Category": "' + row['Sub Category'] + '"}'        fine_tuning_data.append({            "messages": [                {"role": "user", "content": row['Support Query']},                {"role": "system", "content": json_response}            ]        })    return fine_tuning_data

Creating Training and Validation Sets

After formatting the data, the next step is to split it into training and validation sets. This is crucial for training the model on a subset of data and then validating its performance on a different subset.

from sklearn.model_selection import train_test_split# Stratified splitting. Assuming 'Top Category' can be used for stratificationtrain_data, val_data = train_test_split(    converted_data,    test_size=0.2,    stratify=dataset['Top Category'],    random_state=42  # for reproducibility)

Creating JSONL Files

Fine-tuning with OpenAI requires the data to be in JSONL format. The code demonstrates how to convert the training and validation sets into this format and save them as files.

def write_to_jsonl(data, file_path):    with open(file_path, 'w') as file:        for entry in data:            json.dump(entry, file)            file.write('\n')training_file_name = "train.jsonl"validation_file_name = "val.jsonl"write_to_jsonl(train_data, training_file_name)write_to_jsonl(val_data, validation_file_name)

Uploading Data and Starting the Fine-Tuning Job

With the JSONL files ready, you upload them to OpenAI and initiate the fine-tuning process.

from openai import OpenAIclient = OpenAI(api_key="your_open_ai_key")# Upload Training and Validation Filestraining_file = client.files.create(    file=open(training_file_name, "rb"), purpose="fine-tune")validation_file = client.files.create(    file=open(validation_file_name, "rb"), purpose="fine-tune")# Create Fine-Tuning Jobsuffix_name = "yt_tutorial"response = client.fine_tuning.jobs.create(    training_file=training_file.id,    validation_file=validation_file.id,    model="gpt-3.5-turbo",    suffix=suffix_name,)

Testing the Fine-Tuned Model

Once fine-tuned, it's essential to test the model's performance. The provided code includes a function to format test queries, a prediction function using the fine-tuned model, and a method to store predictions.

from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_scoredef format_test(row):    formatted_message = [{"role": "user", "content": row['Support Query']}]    return formatted_messagedef predict(test_messages, fine_tuned_model_id):    response = client.chat.completions.create(        model=fine_tuned_model_id, messages=test_messages, temperature=0, max_tokens=50    )    return response.choices[0].message.contentdef store_predictions(test_df, fine_tuned_model_id):    test_df['Prediction'] = None    for index, row in test_df.iterrows():        test_message = format_test(row)        prediction_result = predict(test_message, fine_tuned_model_id)        test_df.at[index, 'Prediction'] = prediction_result    test_df.to_csv("predictions.csv")

Observations

With just 100 examples, the model shows promising results, particularly in identifying top categories. This experiment highlights the importance of starting with a small dataset and progressively adding more data for refinement.

Conclusion

This detailed guide, enriched with code snippets and explanations, illustrates the entire process of fine-tuning the GPT-3.5 model. It's a testament to the power and flexibility of AI models in adapting to specific needs and domains, providing enhanced and more relevant responses.

If you're curious about the latest in AI technology, I invite you to visit my project, AI Demos, at https://www.aidemos.com/. It's a rich resource offering a wide array of video demos showcasing the most advanced AI tools. My goal with AI Demos is to educate and illuminate the diverse possibilities of AI.

For even more in-depth exploration, be sure to visit my YouTube channel at https://www.youtube.com/@aidemos.futuresmart. Here, you'll find a wealth of content that delves into the exciting future of AI and its various applications.

Code and Dataset:

https://github.com/PradipNichite/Youtube-Tutorials/tree/main/GPT3.5%20Finetuning

Deploy Chroma DB on AWS EC2

Vandit Tyagi — Wed, 18 Oct 2023 06:37:55 GMT

Introduction

ChromaDB, a groundbreaking tool in the world of embeddings, is reshaping semantic search with its vector database. This innovation simplifies embedding storage, management, and retrieval, while its standout feature, semantic search, offers unparalleled precision and efficiency in applications like NLP and image analysis. ChromaDB is revolutionizing the embedding landscape, ensuring seamless integration across various use cases, making it the go-to solution for those seeking a robust and efficient system for semantic search.

Note: If you're not familiar with ChromaDB and its capabilities, you can get more insights into it here.

Table of Contents:

Why we need to host chromadb?
Prerequisites and setting up
Accessing the hosted Chroma db
Managing Collections in Chroma
Adding Data to a Collection
Querying a Collection
Updating and Deleting Data in a Collection
Why do we need to host chroma db?
1. Accessibility: When you host ChromaDB, it becomes accessible from anywhere with an internet connection. You can access your database from your laptop, Google Colab, or multiple applications without worrying about the physical location of your data.
2. Collaboration: Hosting ChromaDB allows you to collaborate with others more effectively. You can share access to the hosted database with team members or collaborators, making it easier to work on a project together.
3. Data Synchronization: Hosting ChromaDB ensures that your data is synchronized and up-to-date. You won't need to manually update and copy-paste data folders whenever changes occur; the hosted service takes care of this for you.
4. Scalability: Hosting your database on a server provides the flexibility to scale resources as needed. You can accommodate larger datasets or higher traffic without worrying about the limitations of your local machine.
5. Data Security: Depending on the hosting service, you can benefit from enhanced security measures, such as data encryption and access controls, to protect your valuable data.

Prerequisites and setting up

To begin the setup process for utilizing Chromadb, including its installation along with Docker Compose, as well as obtaining the Chromadb repository, please follow these steps:

Set Up a Virtual Machine (VM):
- Create a VM on AWS (Amazon Web Services).
- Choose an instance type with sufficient RAM (e.g., 4GB or more).
- Create a key pair for SSH access to the VM.
Connect to the VM:
This will allow you to connect to the EC2 instance.
```
 ssh -i [your-key.pem] ubuntu@[your-instance-ip]
```

Install Docker on ubuntu ec2 instance:

 # Update the Package List sudo apt update # Install Required Packages sudo apt install -y apt-transport-https ca-certificates curl software-properties-common # Add the Docker Repository curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo gpg --dearmor -o /usr/share/keyrings/docker-archive-keyring.gpg echo "deb [arch=amd64 signed-by=/usr/share/keyrings/docker-archive-keyring.gpg] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable" | sudo tee /etc/apt/sources.list.d/docker.list > /dev/null # Install Docker sudo apt update sudo apt install -y docker-ce docker-ce-cli containerd.io # Start and Enable Docker sudo systemctl start docker sudo systemctl enable docker # verifying docker installation sudo docker --version

Get the Chroma Docker image from Docker Hub

 # pulling the image sudo docker pull chromadb/chroma # running the image on port 8000 of our virtual machine sudo docker run -p 8000:8000 chromadb/chroma

Accessing the hosted Chroma db

Installing the Chroma db

```
       !pip install chromadb
```

Connect to the server running in the Docker container.

```
       import chromadb       # Create a Chroma client instance       chroma_client = chromadb.HttpClient(host='', port=8000)
```
This Chroma client instance will now enable seamless communication with the Chroma server, establishing the bridge between your application and the stored data.

Now the database is up at localhost at port 8000

Managing Collections in Chroma

Creating Your Collection

Collections are like data containers. You can create one like this:

    # Create a collection with a special touch (embedding function)    collection = client.create_collection(name="my_collection", embedding_function=emb_fn)

Taking a Peek at Collections

Want to see what's in a collection? Easy peasy:

    # Take a look at what's in the collection (don't forget the embedding function)    collection = client.get_collection(name="my_collection", embedding_function=emb_fn)

Bidding Farewell to Collections

When a collection's purpose is served, it's time to let it go:

    # Say goodbye to the collection    client.delete_collection(name="my_collection")

With collections, organizing your data turns from a puzzle into a walk in the park. Stay tuned for more insights into how ChromaDB transforms data management into a delightful experience!

Adding Data to a Collection

ChromaDB lets you effortlessly inject data into your collection using the .add function. This single command can handle various types of data, making your collection richer and more informative.

Adding Raw Documents

For simple data addition, use .add with the documents parameter. ChromaDB will tokenize and embed them using your collection's default method:

    collection.add(        documents=["doc1", "doc2", "doc3"],        metadatas=[{"chapter": "3", "verse": "16"}, {"chapter": "3", "verse": "5"}],        ids=["id1", "id2", "id3"]    )

Direct Embedding with Metadata

Alternatively, add documents along with their embeddings and metadata:

    collection.add(        documents=["doc1", "doc2", "doc3"],        embeddings=[[1.1, 2.3, 3.2], [4.5, 6.9, 4.4]],        metadatas=[{"chapter": "3", "verse": "16"}, {"chapter": "3", "verse": "5"}],        ids=["id1", "id2", "id3"]    )

Linking External Vectors

If your documents are stored elsewhere, associate vectors using their IDs:

    collection.add(        embeddings=[[1.1, 2.3, 3.2], [4.5, 6.9, 4.4]],        metadatas=[{"chapter": "3", "verse": "16"}, {"chapter": "3", "verse": "5"}],        ids=["id1", "id2", "id3",]    )

Querying a Collection

Querying with Query Embeddings

With ChromaDB's vector database, .query simplifies semantic search. It swiftly retrieves the top closest results for your query embeddings, enhancing precision and efficiency in applications like NLP and image analysis. Discover a new level of search capabilities with ChromaDB.

    collection.query(        query_embeddings=[[11.1, 12.1, 13.1],[1.1, 2.3, 3.2]],        n_results=10,        where={"metadata_field": "is_equal_to_this"},        where_document={"$contains":"search_string"}    )

You can use optional filters to refine your search based on metadata or document content.

Querying with Query Texts

Alternatively, you can query using query texts. ChromaDB handles the embedding, allowing you to retrieve results based on these texts:

    collection.query(        query_texts=["doc10", "thus spake zarathustra"],        n_results=10,        where={"metadata_field": "is_equal_to_this"},        where_document={"$contains":"search_string"}    )

Updating and Deleting Data in a Collection

In ChromaDB, adapting and refining your dataset is a seamless process. With the .update and .upsert methods, you can easily modify existing entries or introduce new ones. Additionally, when it's time to trim down, ChromaDB's .delete method offers a way to remove data.

Refining with .update and .upsert

Whether it's enhancing metadata, changing embeddings, or updating documents, ChromaDB's .update method has you covered. Use it to modify specific items in the collection:

    collection.update(        ids=["id1", "id2", "id3"],        embeddings=[[1.1, 2.3, 3.2], [4.5, 6.9, 4.4]],        metadatas=[{"chapter": "3", "verse": "16"}, {"chapter": "3", "verse": "5"}],        documents=["doc1", "doc2", "doc3"],    )

For a smarter touch, the .upsert method combines updates and additions:

    collection.upsert(        ids=["id1", "id2", "id3"],        embeddings=[[1.1, 2.3, 3.2], [4.5, 6.9, 4.4]],        metadatas=[{"chapter": "3", "verse": "16"}, {"chapter": "3", "verse": "5"}],        documents=["doc1", "doc2", "doc3"],    )

Trimming with .delete

When it's time to clean up, the .delete method steps in. Erase items based on their IDs:

    collection.delete(        ids=["id1", "id2", "id3"],        where={"chapter": "20"}    )

Remember, .delete is a powerful action that permanently removes data, so exercise caution.

In a Nutshell

Updating, adding, or removing data in ChromaDB is a breeze with these methods. Your dataset remains dynamic and tailored to your needs. Stay tuned as we further explore the versatile possibilities of ChromaDB in the upcoming sections.

Summary

Semantic Search Reinvented: Harness the Power of ChromaDB's Vector Database. Discover how ChromaDB's vector database revolutionizes semantic search, making it a breeze to find, store, and manage embeddings, and supercharging NLP and image analysis.
ChromaDB: Your Semantic Search Ally: ChromaDB's vector database streamlines embedding management, reshaping semantic search for NLP and image analysis, making it your trusted ally.
Docker Made Easy: ChromaDB + Docker = smooth sailing. We set up effortlessly for client/server teamwork.
Data Magic: Creating, adding, and exploring data collections is a cinch, giving you insights without the hassle.
Uncover Insights: Whether words or images, ChromaDB uncovers hidden gems, making your data journey transformative and exciting.

Next Step

If you're eager to learn more about using vector databases like ChromaDB to build applications with Langchain, we recommend watching this informative video tutorial.

Integrating Llama 2 with Hugging Face and Langchain🦙

Trishanu Das — Thu, 31 Aug 2023 12:26:06 GMT

On July 18, 2023, Meta released LLaMA-2, a collection of pre-trained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. The pre-trained models exhibit notable advancements compared to the Llama 1 models. These enhancements encompass training on 40% more tokens, possessing a substantially extended context length (4k tokens 🤯), and employing grouped-query attention to facilitate rapid inference for the 70B model🔥!

Nevertheless, the most exhilarating aspect of this launch is the fine-tuned models (Llama 2-Chat), meticulously optimized for dialogue applications through the implementation of Reinforcement Learning from Human Feedback (RLHF). Spanning a diverse spectrum of helpfulness and safety benchmarks, the Llama 2-Chat models outperform a majority of open models and achieve performance on par with ChatGPT, as confirmed by human evaluations.

Now to use the LLama 2 models, one has to request access to the models via the Meta website and the meta-llama/Llama-2-7b-chat-hf model card on Hugging Face.

You will also need a Hugging Face Access token to use the Llama-2-7b-chat-hf model from Hugging Face.

Here's how you can use it!🤩

I. Using Hugging Face🤗

Open your Google Colab Notebook. Make sure that you switch your runtime type to any GPU runtime available(this speeds up the process!)

1. Install the following dependencies and provide the Hugging Face Access Token:

!pip install -q transformers accelerate langchain!huggingface-cli login

transformers: Transformers provides APIs and tools to easily download and train state-of-the-art pretrained models for 📝 Natural Language Processing, 🖼 Computer Vision, 🗣 Audio, etc.

accelerate: Accelerate is a library that enables PyTorch code to be run across any distributed configuration by adding a few lines of code!

langchain: LangChain is a framework for developing applications powered by language models. It enables applications that are:

Data-aware: Connect a language model to other sources of data
Agentic: Allows a language model to interact with its environment

huggingface-cli login: The huggingface-cli tool provides several commands for interacting with the Hugging Face Hub from the command line. One of these commands is login, which allows users to authenticate themselves on the Hub using their credentials.

2. Import the dependencies and specify the Tokenizer and the pipeline:

Computers don't understand text. So, we use tokenizers that convert text to numbers that computers can understand and interpret. Pipelines are objects that abstract complex code from the library and provide a simple API for use.

from transformers import AutoTokenizerimport transformersimport torchimport acceleratemodel = "meta-llama/Llama-2-7b-chat-hf"tokenizer=AutoTokenizer.from_pretrained(model)pipeline=transformers.pipeline(    "text-generation",    model=model,    tokenizer=tokenizer,    torch_dtype=torch.bfloat16,    trust_remote_code=True,    device_map="auto",    max_length=1000,    do_sample=True,    top_k=10,    num_return_sequences=1,    eos_token_id=tokenizer.eos_token_id    )

trust_remote_code (bool, optional, defaults to False) Whether or not to allow for custom code defined on the Hub in their modeling, configuration, tokenization or even pipeline files. This option should only be set to True for repositories you trust and in which you have read the code, as it will execute code present on the Hub on your local machine.
device_map (str or Dict[str, Union[int, str, torch.device], optional) Sent directly as model_kwargs (just a simpler shortcut). When accelerate library is present, set device_map="auto" to compute the most optimized device_map automatically (see here for more information).
do_sample: if set to True, this parameter enables decoding strategies such as multinomial sampling, beam-search multinomial sampling, Top-K sampling and Top-p sampling. All these strategies select the next token from the probability distribution over the entire vocabulary with various strategy-specific adjustments.
top_k (int, optional, defaults to None) The number of top labels that will be returned by the pipeline. If the provided number is None or higher than the number of labels available in the model configuration, it will default to the number of labels.
num_return_sequences: The number of sequence candidates to return for each input. This option is only available for the decoding strategies that support multiple sequence candidates, e.g. variations of beam search and sampling. Decoding strategies like greedy search and contrastive search return a single output sequence.

3. Run the model🔥:

sequences = pipeline(    'Hi! I like cooking. Can you suggest some recipes?\n')for seq in sequences:    print(f"Result: {seq['generated_text']}")

Output: Result: Hi! I like cooking. Can you suggest some recipes? I'm glad you're interested in cooking! There are so many delicious recipes out there, but I'll give you a few suggestions to get you started: 1. Chicken Parmesan: Breaded and fried chicken topped with marinara sauce and melted mozzarella cheese. Serve with pasta or a green salad...

II. Using Langchain🦜🔗

1. Import the following dependencies:

from langchain.llms import HuggingFacePipelinefrom transformers import AutoTokenizerfrom langchain.chains import ConversationChainimport transformersimport torchimport warningswarnings.filterwarnings('ignore')

2. Define the Tokenizer, the pipeline and the LLM

model="meta-llama/Llama-2-7b-chat-hf"tokenizer=AutoTokenizer.from_pretrained(model)pipeline=transformers.pipeline(    "text-generation",    model=model,    tokenizer=tokenizer,    torch_dtype=torch.bfloat16,    trust_remote_code=True,    device_map="auto",    max_length=1000,    do_sample=True,    top_k=10,    num_return_sequences=1,    eos_token_id=tokenizer.eos_token_id    )llm=HuggingFacePipeline(pipeline=pipeline, model_kwargs={'temperature':0.7})

3. Defining the Prompt

One of the biggest advantages of open-access models is that one has full control over the system prompt in chat applications. The prompt template should be a template that was used during the model's training procedure. For Llama-2 chat, the template looks something like this:

<s>[INST] <<SYS>>{{ system_prompt }}<SYS>>{{ user_message }} [/INST]

We define our prompt template in the following way:

prompt_template = """[INST] <>{{ You are a helpful AI Assistant}}<>###Previous Conversation:'''{history}'''{{{input}}}[/INST]"""prompt = PromptTemplate(template=prompt_template, input_variables=['input', 'history'])

4. Defining the chain:

~~Chains are like entities that combine various features like prompts, memories and different LLMs to generate the desired output.~~

ConversasationChain of the LangChain Library is the Chain used to have human-assistant conversations and it loads context from the memory. Note: All the conversations are saved in the memory leading to a large buffer size unlike ConversationBufferWindowMemory that stores the last K interactions. To understand this even better, you might have a look at these blogs: Langchain Memory with LLMs for Advanced Conversational AI and Chatbots and Building an Interactive Chatbot with Langchain, ChatGPT, Pinecone, and Streamlit

chain = ConversationChain(llm=llm, prompt=prompt)

5. Run the chain🔥:

chain.run("What is the capital Of India?")

~~Output: "Hello! I'm happy to help you with your question. The capital of India is New Delhi. Did you have any other questions?"~~

~~To prevent our context buffer from becoming very large, we can use ConversationBufferWindowMemory. Just modify the part of the code where you have defined the chain. Then, run the chain again!~~

from langchain.memory import ConversationBufferWindowMemorymemory = ConversationBufferWindowMemory(k=5)chain = ConversationChain(    llm=llm,    prompt=prompt,    memory=memory)

~~Voila! You can now converse with Llama-2. However, this is just the beginning. The potential applications of open-source Large Language Models are virtually limitless.~~

~~Need to use these models for specific use cases? Worry not! You can even fine-tune these models for your specific needs!~~

~~Still skeptical about the performance of the Llama-2 models?🤨~~

~~Here are some benchmarks:~~

It is to be noted that although in MMLU and GSM8K benchmarks Llama 2 hits nearly the same scores as GPT-3.5😎, in a HumanEval (coding) benchmark, it ranks way behind it (29,9% vs. 48,1%) not no mention GPT-4, which outperforms Llama 2 more than twice (67%).😩

III. Conclusion

In conclusion, the rapid advancement of artificial intelligence has brought us to a pivotal juncture in our understanding of intelligence. The conventional notion that "Humans are the most intelligent creatures" is undergoing a profound transformation as AI systems demonstrate remarkable capabilities across an array of tasks. While AI technologies are not sentient beings like humans, their ability to process vast amounts of data, learn from patterns, and make complex decisions challenges the traditional boundaries of intelligence. As AI continues to evolve, it blurs the distinction between human and machine intelligence, inviting us to question and reevaluate our understanding of what it truly means to be intelligent. Rather than replacing human intelligence, AI is expanding the horizons of human potential and reshaping the narrative of intelligence itself. This journey into the age of AI compels us to explore not only the remarkable feats of machines but also the unique qualities that make us inherently human.

~~The sky is not the limit; perhaps the moon is 🌒!~~

IV. Additional Resources:

~~Still, considering GPT models? Check these out:~~

~~https://www.youtube.com/watch?v=nAKhxQ3hcMA&t=784s~~

~~Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI~~

Looking to catch up on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with aidemos.com

NLP Roadmap 2023: Step-by-Step Guide

Pradip Nichite — Fri, 25 Aug 2023 13:10:23 GMT

Are you a beginner data scientist or someone who wants to pursue a career in Natural Language Processing (NLP)? If so, then this blog post is for you. In this post, we will discuss how you can approach NLP, the step-by-step process you should follow, the resources you can refer to, and the kind of problems you can solve with NLP knowledge.
Prerequisite to NLP
Before diving into NLP, it is important to have a basic understanding of machine learning. You should know what machine learning is, the concepts of supervised and unsupervised learning, and the different algorithms such as classification, regression, and clustering.
Assuming you are already familiar with machine learning, let's now proceed to discuss NLP in detail.
You can start learning machine learning from the famous Andrew Ng course.
Step 1: Text Pre-Processing
Text pre-processing is the first step in working with NLP. It involves cleaning and transforming raw text data into a format that can be easily analyzed by machine learning algorithms. Some common tasks in text pre-processing include tokenization, lemmatization, and removing punctuation.
To perform text pre-processing, you can use libraries like SpaCy or NLTK. These libraries provide functions to tokenize text, perform lemmatization, and remove punctuation.
Here is good tutorial to learn spacy
Step 2: Text Representation
After pre-processing the text, the next step is to convert it into a format that machine learning models can understand. This involves representing the text as numerical vectors.
There are several techniques for text representation, including the Bag of Words model, Count Vectorization, and TF-IDF. In addition, more advanced techniques like Word2Vec and Doc2Vec can be used for word embedding.
It is important to have a good understanding of these text representation techniques as they are fundamental to working with NLP models.
Resources:
Build Your First NLP Model: Text Feature Extraction: Bag of words and TF-IDF
Build Text Classification Model using Word2Vec | Gensim
Step 3: Information Extraction
Information extraction involves extracting important information from text, such as named entities (e.g., names of persons, organizations, places) and part-of-speech tagging.
To perform information extraction, you can use libraries like SpaCy that provide pre-trained models for entity recognition and part-of-speech tagging. However, if you have a specific domain or custom entities to extract, you may need to train your own custom-named entity recognition model.
Resources:
Learn How to Build a Custom Named Entity Recognition (NER) model using spacy
Step 4: Deep Learning for NLP
To take your NLP skills to the next level, it is recommended to learn about deep learning algorithms and how they can be applied to NLP tasks. This involves understanding neural networks, backpropagation, and transfer learning.
There are several popular courses available online, such as the courses by Andrew Ng on Coursera or deeplearning.ai, that cover deep learning for NLP. These courses will provide a good foundation in neural networks and help you understand transfer learning techniques.
Resources:
The Deep Learning Specialization
Step 5: Transformers and Transfer Learning
Why Transformers?
Transformers are the backbone of many NLP models and have revolutionized the field with their exceptional ability to understand text data. Libraries like the Transformers library offer a variety of these models, including BERT and T5, ready for fine-tuning and deployment.
The Power of Transfer Learning
Transfer learning allows you to take these pre-trained transformer models and fine-tune them for specific tasks. Whether youre dealing with custom text classification, sentiment analysis, or Named Entity Recognition, a fine-tuned transformer model can be an invaluable asset.
Learning Resources
To dive deeper into these topics, the blog post "The Illustrated Transformer" provides an excellent visual guide to understanding how transformers work.
Action Items
Learn How to Use Pre-Trained Models: Familiarize yourself with how to deploy pre-trained models for standard tasks. These are available in various libraries and can serve as robust starting points.
Learn How to Fine-Tune Transformer Models: Once you understand the basics, the next step is to learn how to fine-tune these models on custom datasets. This is especially crucial for tasks that require a specialized understanding of the data.
By mastering Transformers and Transfer Learning, you arm yourself with powerful tools that can significantly expedite your NLP projects.
Resources:
Learn How to use Hugging face Transformers Library
Fine Tune Transformers Model on Custom Dataset.
Step 6: Deploying NLP Models
Once you have trained and fine-tuned your NLP models, the next step is to deploy them so that they can be accessed by others. This can be done by creating an API for the model and hosting it on a cloud platform like AWS, Google Cloud, or Azure.
There are several frameworks and tools available for deploying NLP models, such as Flask or FastAPI. These frameworks allow you to create APIs and handle HTTP requests and responses.
Resources:
Deploy GPT Streamlit App on AWS EC2
Deploy FastAPI & Open AI ChatGPT on AWS EC2
Deploy Fine Tuned BERT or Transformers model on Streamlit Cloud
Creating a GPT Product Description Generator with AWS Lambda and API Gateway
Step 7: Embedding and Semantic Search
Embedding refers to converting text into vector representations that can be used for semantic search and comparison. By representing text as vectors, you can compare and measure the similarity between different documents or queries.
There are open-source libraries like Sentence Transformers that provide pre-trained models for text embedding. These models can be used to create embeddings and perform semantic search.
Resources:
Sentence Transformers: Sentence Embedding, Sentence Similarity, Semantic Search and Clustering
Build high-performance Semantic Search applications using Vector Databases
Step 8: Large Language Models (LLMs)
Large language models like GPT-3 and GPT-4 have gained popularity in the field of NLP. These models are capable of generating human-like text and can be fine-tuned for various NLP tasks.
To work with LLMs, you can use libraries like OpenAI's GPT or Facebook's Llama 2. These libraries provide interfaces and utilities for working with LLMs and integrating them into your NLP applications.
Resources:
Open AI ChatGPT, GPT-4, GPT-3 Playlist
ChatGPT Prompt Engineering for Developers
Step 9: Vector databases
Vector databases enable efficient searching and retrieval of similar text vectors, making them a valuable tool in NLP applications. Furthermore, the integration of vector databases with large language models and the ability to fine-tune them allows for more advanced NLP functionalities, such as question-answering systems and conversational AI. Overall, vector databases play a crucial role in enhancing the performance and efficiency of NLP applications.
Semantic Search with Open-Source Vector DB: Chroma DB
Step 10: LLM Libraries:
As Large Language Models (LLMs) like GPT variants continue to dominate the NLP space, the need for specialized libraries that can streamline the implementation of these models is ever-increasing. Two such libraries that have gained considerable traction are the LangChain and Llama Index. Here's what you need to know about these popular tools and how they can supercharge your NLP projects.
LangChain: Your One-Stop Shop for LLM Applications
LangChain offers a comprehensive suite of utilities designed to simplify the building of applications around Large Language Models. Whether you're looking to chunk PDF files, interface with vector databases, or execute more complex tasks like Natural Language to SQL conversions, LangChain has got you covered.
Features:
PDF Chunking: Efficiently divide large PDF files into manageable pieces.
Interface with LLMs: Seamless integration with popular Large Language Models like those from OpenAI.
LangChain SQL Agent: A specialized component for Natural Language to SQL conversions.
Llama Index: A Flexible Library for Data Augmentation and Indexing
Llama Index serves a somewhat overlapping but distinct role compared to LangChain. Its core strength lies in connecting to various data sources and indexing documents to augment the capabilities of Large Language Models.
Features:
Multiple Connectors: Easily connect to data sources like Google Docs, Notion, and PDF files.
Advanced Indexing: Offers multiple methods to index documents, from simple list indexes to more complex tree structures and table keyword indexes.
Use-Cases:
Llama Index is ideal for projects that require advanced semantic search capabilities, thanks to its robust indexing features.
Resources
I've created several videos diving into the capabilities of LangChain and Llama Index. Whether you're interested in building a Natural Language to SQL interface or experimenting with advanced document indexing, these resources can guide you through the process.
Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4.
Mastering LlamaIndex : Create, Save & Load Indexes, Customize LLMs, Prompts & Embeddings
LangChain, SQL Agents & OpenAI LLMs: Query Database Using Natural Language
Step 11: MLOps for NLP
MLOps, or Machine Learning Operations, involves deploying, monitoring, and managing machine learning models in production. In the context of NLP, MLOps includes monitoring the performance of NLP models, gathering user feedback, and continuously improving the models.
There are various MLOps tools available, such as Kubeflow, MLflow, and Weight and Biases, that can help you with model deployment, monitoring, and feedback collection.
Resources:
Machine Learning Engineering for Production (MLOps) Specialization
Evaluating and Debugging Generative AI Models Using Weights and Biases
Step 12: Relational Databases and SQL
Understanding relational databases and SQL is important for handling data and storing predictions from NLP models. As part of an end-to-end NLP application, you may need to interact with a database to store and retrieve data for your NLP tasks.
Commonly used relational databases include MySQL and PostgreSQL. Having a good understanding of databases and SQL will enable you to handle data efficiently and effectively.
Conclusion
In this blog post, we discussed the step-by-step roadmap for beginners to approach NLP. We covered topics like text pre-processing, text representation, information extraction, deep learning for NLP, deploying NLP models, embedding and semantic search, large language models, MLOps for NLP, and working with relational databases.
Although this roadmap provides a high-level overview, it is important to explore each topic in detail and gain practical experience by working on NLP projects. There are numerous online resources, courses, and libraries available to help you learn and apply NLP techniques effectively.
If you want to dive deeper into any specific topic, refer to the video transcript provided in this blog post, as it contains links to specific videos and resources related to each topic.
Remember, NLP is a rapidly evolving field, and staying updated with the latest research and developments is crucial. Happy learning and exploring the exciting world of Natural Language Processing!

My Journey as a Data Science Intern: From Newbie To Data Ninja

Subhadeep Choudhury — Wed, 16 Aug 2023 13:27:34 GMT

Introduction
Hey there, fellow explorers! As my incredible journey at FutureSmart AI wraps up, I can't help but marvel at the adventure I've been on. From a data science rookie to a confident intern, I've soaked up so much knowledge and want to spill the beans on all the cool stuff I've learned.
Picture this: I started with a solid grasp of machine learning and math, but real-world model fine-tuning and deployment were like uncharted territories for me. But hey, I was eager to jump in and tackle these challenges head-on.
Learning Experience
One cool part of my time at FutureSmart AI was when we worked on a project where we looked at Glassdoor reviews using the OpenAI API to figure out people's feelings about different things. It was like diving into a new world of AI that I hadn't explored before.
As I moved on to other projects, I took on more responsibilities willingly. In one project, I made a chatbot for a client, which helped me get even better at getting information from the internet and making things work automatically. I also got better at using the ChatGPT API, which is like teaching computers how to talk like humans.
What made my time even more interesting was learning about cloud technologies. We made things that could be used over the internet, like fancy services or apps, using Amazon Web Services (AWS). This gave me a chance to learn how to create things that can handle a lot of people using them at once, like when a lot of people want to watch the same video online.
I also got to try something different by using a low-code platform. This tool helped us build complete solutions for our clients without having to write lots of complicated code. It was like solving puzzles in a new way and made me better at fixing problems creatively.
Task Highlights
My first project was to scrape data from Glassdoor, particularly employee reviews, and perform aspect-based sentiment analysis on it. During this project, I got exposed to OpenAi API. This project also required web automation expertise and so I used Selenium to scrape data, and store all the data in a pandas data frame and used a few shot learning to train our openai model. The goal of this project was to analyze the employee reviews of a particular company to determine various factors about that company like work culture, perks, etc.
My next project involved extracting specific information from a list of URLs representing different categories on a website. This required using web automation tools like Selenium and implementing web scraping techniques using Python. The project presented challenges in handling dynamic web elements and managing large volumes of data. However, through careful analysis and coding, I successfully automated the scraping process and generated a structured CSV file with the extracted data.
Next, I worked on creating end-to-end chatbots with a custom knowledge base. This involved extracting data from different web articles and URLs using web scraping, storing the data in a vector data frame, chunking and embedding the data in a vector database like Pinecone, and finally using open API to answer user queries based on the custom knowledge. I used streamlit for the frontend part and deployed the app on aws ec2.
Furthermore, I was engaged in a project that encompassed extracting data from an API, storing it within a MySQL database, and executing data manipulation tasks using Python. This undertaking demanded adeptness in managing API inquiries, establishing connections with the database, and employing Pandas for data manipulation. I faced complexities related to managing API rate restrictions and guaranteeing the fidelity of data throughout the extraction phase. Nevertheless, through the implementation of streamlined algorithms and the application of sound error-handling approaches, I proficiently accomplished the tasks of acquiring, storing, and processing the necessary data.
I also worked on low-code platforms like Anvil. I utilized the Anvil low-code platform to develop a chatbot efficiently. Leveraging Anvil's intuitive interface and pre-built components, I created a functional chatbot that seamlessly interacts with users. The platform enabled me to design the chatbot's user interface, integrate it with backend logic, and implement dynamic responses without delving into complex coding.
Resources and Practices
Python and Data Manipulation:
Python served as the cornerstone of my work, enabling comprehensive data manipulation, analysis, and automation tasks. The expansive collection of Python libraries, including key players like Pandas, NumPy, and sci-kit-learn, significantly facilitated the efficient processing and management of large datasets.
Conversational AI Development:
My proficiency with advanced language models, specifically ChatGPT and Langchian, empowered me to construct sophisticated conversational AI systems capable of understanding and generating human-like text. By fine-tuning these models, I elevated their performance within the domain of customer support interactions.
Chatgpt On Your Data:
Implemented openai API to generate embeddings, stored those embeddings in a vector database like Pinecone, and performed semantic search operations on these embeddings. In this way, created chatbot solutions for clients on the client-specific knowledge base.
OpenAI API Integration:
Leveraging the capabilities of the OpenAI API, I harnessed its potential for sentiment analysis on movie reviews, extracting valuable insights, and generating contextually relevant responses to customer queries. The seamless integration of this API amplified the power of robust language models within my projects.
Foundational Data Science Libraries:
Central to my work were the indispensable contributions of foundational data science libraries such as Pandas, NumPy, and sci-kit-learn. These tools enabled fluid data manipulation, numerical operations, and the application of diverse machine learning algorithms, spanning tasks like classification, regression, and clustering.
Collaborative Tools for Efficiency:
Throughout my internship, I capitalized on an array of collaborative tools to ensure efficient teamwork and streamlined project management. Git provided reliable version control, Jupyter Notebook and Google Colab facilitated interactive data analysis and rapid prototyping, while diverse collaboration platforms fostered seamless communication among team members.
AWS Cloud Platform Utilization:
Expanding beyond the core tools, I delved into AWS Lambda and AWS Serverless Application Model (SAM), integrating these services to simplify deployment and management on the AWS cloud platform. This approach enhanced scalability and cost-efficiency for my projects.
AWS Service Integration:
Within the AWS ecosystem, I effectively harnessed a range of services such as EC2 instances, S3 for storage, and RDS for database management. These resources provided the necessary infrastructure and support to enable smooth deployment and optimal functionality.
Streamlit for Interactive Apps:
Leveraging Streamlit, I successfully deployed multiple interactive web applications on an AWS EC2 instance. These applications offered intuitive interfaces that provided users with easy access to sentiment analysis functionality.
API Testing with Postman:
Lastly, I employed Postman as a vital tool for testing and debugging API endpoints. This tool allowed me to meticulously assess HTTP requests, validate responses, and ensure the seamless performance of developed APIs.
Grand Finale
As I wrap up my adventure, I'm so thankful for all the guidance and mentorship. FutureSmart AI has been a launchpad for growth and learning. With tools like AWS and Python in my belt, I'm all set to tackle real-world data challenges and make big solutions.
Looking back, I'm filled with gratitude for the hands-on experiences and amazing mentors. Everything I've learned will be my superpower in the world of data science. I can't wait to keep growing, taking on cool projects, and using data to change the world.
So there you have it, my journey from newbie to data ninja. Until next time, keep exploring and never stop learning!

Beginner's Guide to FastAPI & OpenAI ChatGPT Integration

Pradip Nichite — Mon, 31 Jul 2023 13:45:03 GMT

In this guide, we'll learn how to build a Python API using FastAPI and integrate it with OpenAI's ChatGPT. By the end of this post, you'll be able to create RESTful endpoints and utilize the power of OpenAI's ChatGPT. Let's get started!
Step 1: Setup
First, we'll need to install the required packages. For this project, we're going to use FastAPI and Uvicorn for creating and running our API, and OpenAI to use the GPT-3 model for text generation.
Create a new file requirements.txt and add the following lines:
fastapiuvicornopenai
Install these packages using pip:
pip install -r requirements.txt
Step 2: Creating the API
We start by initializing our FastAPI app in the main.py file. Additionally, we import the Pydantic BaseModel, which allows us to define how our data should be modeled.
from fastapi import FastAPIfrom pydantic import BaseModelfrom utils import generate_descriptionapp = FastAPI()
Next, we define our data models for the product and order. We're using Pydantic's BaseModel for this:
Pydantic:
Pydantic is a data validation library that uses Python type annotations. The principal advantage of Pydantic is the ease with which complex data schemas can be declared and validated. Pydantic's BaseModel forms the foundation for all models. It provides functionality for model initialization, serialization to JSON, model validation, etc.
Here, Order and Product are subclasses of Pydantic's BaseModel. By declaring our data classes in this way, Pydantic will automatically handle data validation, serialization, and documentation.
Order class: This model describes the structure of an order in our application. It has two fields: product, which should be a string, and units, which should be an integer. When we create an instance of Order, Pydantic will ensure these types are respected, throwing an error if we attempt to assign an inappropriate value.
Product class: Similarly, this model describes a product in our application. It has two fields: name, which should be a string, and notes, which should also be a string.
class Order(BaseModel): product: str units: intclass Product(BaseModel): name: str notes: str
Python Type Hints:
Python 3.5 introduced optional "type hints". You can specify the expected type of function arguments and return values. They don't affect the runtime behavior of your program but serve as documentation and are used by static type checkers, linters, and IDE features.
In the context of FastAPI, these type hints are used for:
Data validation: For example, if we specify a function parameter to be of type int, and the client sends a string, FastAPI will send a helpful and descriptive error message.
Data serialization: FastAPI can convert complex data types (like datetime objects) into formats that can be easily converted to JSON.
API documentation: FastAPI uses these type hints to automatically generate API documentation.
So, in our Order and Product models, str and int are type hints, declaring what type of data each attribute should hold.
These concepts form the backbone of FastAPI and contribute to its fast, flexible, and developer-friendly nature.
class Order(BaseModel): product: str units: intclass Product(BaseModel): name: str notes: str
With our API and data models ready, we can now create our endpoints:
A GET endpoint /ok which returns a simple "ok" message.
A GET endpoint /hello which takes an optional query parameter name and returns a personalized greeting.
Two POST endpoints /orders and /orders_pydantic which take product information and return a confirmation message. The difference between these two endpoints is that /orders takes query parameters while /orders_pydantic takes a JSON body.
@app.get("/ok")async def ok_endpoint(): return {"message": "ok"}@app.get("/hello")async def hello_endpoint(name: str = 'World'): return {"message": f"Hello, {name}!"}@app.post("/orders")async def place_order(product: str, units: int): return {"message": f"Order for {units} units of {product} placed successfully."}@app.post("/orders_pydantic")async def place_order(order: Order): return {"message": f"Order for {order.units} units of {order.product} placed successfully."}
Step 3: Integrating with OpenAI ChatGPT
We're going to use OpenAI's GPT-3 model to generate product descriptions.
In utils.py, we're initializing the OpenAI API key and defining a function generate_description which takes in product details and returns a generated description.
import openaiopenai.api_key = "" # Add your OpenAI API key heredef generate_description(input): messages = [ {"role": "user", "content": """As a Product Description Generator, Generate multi paragraph rich text product description with emojis from the information provided to you' \n"""}, ] messages.append({"role": "user", "content": f"{input}"}) completion = openai.ChatCompletion.create( model="gpt-3.5-turbo", messages=messages ) reply = completion.choices[0].message.content return reply
Back in main.py, we create a POST endpoint /product_description which takes in product details and returns the generated description:
@app.post("/product_description")async def generate_product_description(product: Product): description = generate_description(f"Product name: {product.name}, Notes: {product.notes}") return {"product_description": description}
Step 4: Running the API and Making Requests
Run your API using Uvicorn:
uvicorn main:app --reload
Now you can make requests to your API. Here are some examples of how to do it in Python using the requests library:
For the /orders endpoint, which uses query parameters:
import requestsurl = 'http://127.0.0.1:8000/orders'headers = { 'accept': 'application/json',}params = { 'product': 'laptop', 'units': '1'}response = requests.post(url, headers=headers, params=params)print(response.json())
For the /orders_pydantic endpoint, which uses a JSON body:
import requestsimport jsonurl = 'http://127.0.0.1:8000/orders_pydantic'headers = { 'accept': 'application/json'}params = { 'product': 'laptop', 'units': '1'}response = requests.post(url, headers=headers, data=json.dumps(params))print(response.json())
import requestsimport jsonurl = 'http://127.0.0.1:8000/product_description'headers = { 'accept': 'application/json', 'Content-Type': 'application/json'}data = { "name": "Laptop", "notes": "4GB RAM . 256 GB Disk"}response = requests.post(url, headers=headers, data=json.dumps(data))print(response.json())
Accessing the Swagger UI Documentation
FastAPI provides out-of-the-box support for generating interactive API documentation with Swagger UI. After you've started your application using uvicorn main:app --reload, navigate to http://localhost:8000/docs in your web browser.
Here, you'll find a list of your defined endpoints (/ok, /hello, /orders, /orders_pydantic, and /product_description). Swagger UI allows you to experiment with your API directly from the browser: you can "Try it out", fill in the necessary fields, and then "Execute" to send a request.
This intuitive interface also presents the schemas of our Pydantic models (Order and Product), making it an excellent resource for anyone who needs to understand or interact with your API.
That's it! You now have a fully functional API with FastAPI, integrated with OpenAI's ChatGPT. To learn more, check out the accompanying video tutorial (put the link of the video here) on YouTube.
Full Code : https://github.com/PradipNichite/Youtube-Tutorials/tree/main/fastapi_openai/app
https://youtu.be/KVdP4SpWcc4

FastAPI is a powerful tool for creating APIs, and when combined with OpenAI's ChatGPT, it becomes even more potent. By understanding these fundamentals, you can create more complex and robust APIs

Mastering LlamaIndex : Create, Save & Load Indexes, Customize LLMs, Prompts & Embeddings

Pradip Nichite — Wed, 05 Jul 2023 12:24:08 GMT

If you're venturing into the domain of natural language processing, you're likely to come across an abundance of tools and libraries designed to assist you in understanding and generating human-like text. One such toolkit is LlamaIndex, a robust indexing tool that facilitates connecting Language Learning Models (LLM) with your external data. In this blog post, we'll explore LlamaIndex in-depth, discussing how to create and query an index, save and load an index, and customize the LLM, prompt, and embedding.
Prerequisites
Before we begin, ensure that you have installed the necessary Python packages. We use LlamaIndex, PyPDF to handle PDF files, and Sentence Transformers to create embeddings. You can install these packages by running the following command:
!pip install llama-index pypdf sentence_transformers -q
Next, you will need an OpenAI API key to access their GPT-3 models. Ensure to replace the empty string with your OpenAI key:
import osimport openaiopenai.api_key = "" # Replace with your OpenAI API keyos.environ["OPENAI_API_KEY"] = "" # Replace with your OpenAI API key
Creating and Querying Index
With the prerequisites out of the way, let's dive into LlamaIndex. First, we'll create an index using a document set and then query it. In this example, we assume that we have a directory called 'book' containing our documents.
from llama_index import VectorStoreIndex, SimpleDirectoryReader# Load documents from a directorydocuments = SimpleDirectoryReader('book').load_data()# Create an index from the documentsindex = VectorStoreIndex.from_documents(documents)# Create a query engine from the indexquery_engine = index.as_query_engine()# Query the engineresponse = query_engine.query("What is this text about?")print(response)
The VectorStoreIndex.from_documents() function takes our loaded documents and creates an index. We then create a query engine from this index using the as_query_engine() function. The query engine allows us to ask questions about our indexed documents and get responses based on the content of the documents.
Saving and Loading Index
LlamaIndex allows you to save an index for later use. This is particularly helpful when dealing with large document sets where creating an index can take considerable time. Let's see how to save and load an index:
# Persist index to diskindex.storage_context.persist("naval_index")from llama_index import StorageContext, load_index_from_storage# Rebuild storage contextstorage_context = StorageContext.from_defaults(persist_dir="naval_index")# Load index from the storage contextnew_index = load_index_from_storage(storage_context)new_query_engine = new_index.as_query_engine()response = new_query_engine.query("who is this text about?")print(response)
Here, we've saved our index to a directory called "naval_index". Later, we can rebuild our storage context and load the index from it.
Customizing LLM's
One of the powerful features of LlamaIndex is the ability to customize the underlying LLM. In this example, we'll use LangChain's ChatOpenAI model and customize its prediction.
from llama_index import LLMPredictor, ServiceContextfrom langchain.chat_models import ChatOpenAI# Create a predictor using a custom modelllm_predictor = LLMPredictor(llm=ChatOpenAI(temperature=0, model_name="gpt-3.5-turbo"))# Create a service context with the custom predictorservice_context = ServiceContext.from_defaults(llm_predictor=llm_predictor)# Create an index using the service contextcustom_llm_index = VectorStoreIndex.from_documents( documents, service_context=service_context)custom_llm_query_engine = custom_llm_index.as_query_engine()response = custom_llm_query_engine.query("who is this text about?")print(response)
The LLMPredictor allows us to utilize different language models and change their parameters.
Custom Prompt
By creating a custom prompt, we can provide more structured questions and responses. This allows us to guide the language model to give more specific answers.
from llama_index import Prompt# Define a custom prompttemplate = ( "We have provided context information below. \n" "---------------------\n" "{context_str}" "\n---------------------\n" "Given this information, please answer the question and each answer should start with code word AI Demos: {query_str}\n")qa_template = Prompt(template)# Use the custom prompt when queryingquery_engine = custom_llm_index.as_query_engine(text_qa_template=qa_template)response = query_engine.query("who is this text about?")print(response)
This provides a more structured conversation with the LLM, which can be helpful in certain use cases.
Custom Embedding
LlamaIndex also allows us to customize the embeddings used in our index. This can be helpful if you want to use a specific embedding model or if the default embeddings do not provide satisfactory results.
from langchain.embeddings.huggingface import HuggingFaceEmbeddingsfrom llama_index import LangchainEmbedding, ServiceContext# Load in a specific embedding modelembed_model = LangchainEmbedding(HuggingFaceEmbeddings(model_name='sentence-transformers/all-MiniLM-L6-v2'))# Create a service context with the custom embedding modelservice_context = ServiceContext.from_defaults(embed_model=embed_model)# Create an index using the service contextnew_index = VectorStoreIndex.from_documents( documents, service_context=service_context,)query_engine = new_index.as_query_engine()response = query_engine.query("list 5 important points from this book")print(response)
We've used the sentence-transformers/all-MiniLM-L6-v2 embedding model, but you could use any model that suits your requirements.
And that's a wrap! We've explored various functionalities of the LlamaIndex toolkit, and I hope it helps you in building and customizing your search engine.
If you'd like to see these steps in action, we've also created a YouTube video tutorial explaining the entire process. Feel free to check it out and don't hesitate to ask if you have any queries. Happy coding!
https://youtu.be/XGBQ_f-Yy48

Full code: https://github.com/PradipNichite/Youtube-Tutorials/blob/main/LlamaIndex_Tutorial.ipynb

FutureSmart AI: Building Custom NLP Solutions with Advanced Tech Stack

Pradip Nichite — Thu, 29 Jun 2023 11:40:36 GMT

At FutureSmart AI, we take pride in developing advanced Natural Language Processing (NLP) solutions tailored to our client's needs. Utilizing a diverse array of cutting-edge technologies, we deliver top-tier AI applications. This blog post offers a comprehensive overview of our technology stack and its usage, highlighting the customized applications we've built and the industries we've transformed.
Our Technology Stack
Our robust and dynamic technology stack is meticulously selected to tackle complex AI and NLP challenges efficiently. Here is an insight into the key technologies we use and how we leverage them:
Python
Python forms the foundation of our development process. Known for its simplicity, versatility, and robust support for machine learning and data analysis libraries, Python enables us to develop a wide range of complex AI projects in a relatively short timeframe.
Hugging Face Transformers and Sentence Transformers
We leverage Hugging Face Transformers and Sentence Transformers libraries to work with pre-trained NLP models and create meaningful sentence embeddings, respectively. These tools empower us to develop solutions for tasks like text classification, named entity recognition, and semantic search, providing our applications with a deep understanding of language and context.
https://youtu.be/I2PRYTNns4I

https://youtu.be/OlhNZg4gOvA

OpenAI Models
OpenAI's GPT-3, GPT-4, and ChatGPT form the cornerstone of our custom NLP applications. From generating human-like text for content creation applications to providing the intelligence backbone of our sophisticated conversational AI solutions, these models play a pivotal role in our AI development journey.
https://youtu.be/LerLk1WLisU

https://youtu.be/RALmm6flXII

Cloud Services - AWS
We deploy our NLP applications on AWS cloud services. Using tools like EC2 instances, we ensure that our solutions are highly scalable, reliable, and readily available for integration into various systems. AWS allows us to securely host our applications and expose them as APIs for seamless third-party integration.
https://youtu.be/904cW9lJ7LQ

FastAPI and Streamlit
FastAPI and Streamlit are our chosen frameworks for creating APIs and quick proof-of-concept web applications, respectively. FastAPI helps us build high-performance APIs that serve our NLP models, while Streamlit offers a fast and intuitive way to create interactive front-end interfaces for our machine-learning models.
Anvil
For more sophisticated applications that require custom UIs, user management features, or custom domains, we turn to Anvil. With its intuitive interface and extensive Python support, Anvil allows us to develop complex web applications without the need for extensive web development experience.
https://youtu.be/KW5HqG-xiOU

Vector Databases - Pinecone and Chroma DB
We utilize vector databases such as Pinecone and Chroma DB to build high-performance semantic search applications. By transforming text into vector embeddings and storing them in these databases, we can create systems that understand the underlying meaning of user queries, returning more relevant and accurate results.
https://youtu.be/bWOvO_cxLHw

https://youtu.be/eCCHDxMaFIk

LangChain
LangChain is a cutting-edge framework we employ to streamline the creation of applications using large language models. With its support for model integration, data augmentation, and sequences of calls to different utilities, LangChain enhances our ability to build sophisticated NLP applications. It's particularly useful when building chatbots that answer queries using information from specific documents and databases.
https://youtu.be/cVA1RPsGQcw

Llama Index
To maximize the potential of large language models, we use LlamaIndex, a data framework that helps augment these models with our private data. This approach enables us to build more personalized and accurate AI applications, making our AI tools highly adaptable to specific use cases.
MySQL and AWS RDS
At FutureSmart AI, we utilize SQL, MySQL, and AWS RDS to handle our relational database needs. MySQL is our go-to for storing and managing data, while AWS RDS ensures easy scalability and accessibility of our databases in the cloud. We primarily use these technologies for storing AI model predictions and API logs, enabling efficient performance tracking and continuous service improvement.
Customized Use Cases and Applications
With our diverse tech stack, we have successfully delivered numerous AI applications across various sectors. Here are some of the customized use cases we've developed:
Intent Detection for Clinic Appointment Systems: We've developed an AI system for healthcare providers that accurately interprets and categorizes patient requests, streamlining the appointment booking process.
Automated Customer Support for E-commerce Sites: Our AI-powered chatbots handle a range of queries on e-commerce platforms, providing quick and efficient customer support.
Chatbot Answering Questions from Website Pages: We've built specialized chatbots that extract information directly from website pages, offering users precise answers without extensive site navigation.
Automated Medical and Insurance Billing: Our AI solutions accurately generate medical and insurance bills, reducing manual errors and saving significant administrative time.
Named Entity Extraction for Specific Business Use Cases: We've built systems that accurately identify and categorize named entities within text, aiding tasks like customer profiling and market research.
Summarizing Large Documents: We use advanced language models to summarize large documents, allowing users to digest crucial information quickly.
ChatGPT Answering from Private Docs and Proprietary Data: By augmenting large language models with private data using LlamaIndex, our chatbots provide highly relevant and accurate responses based on a company's unique information.
OCR and Entity Extraction: We extract and categorize information from scanned documents or images, automating data entry tasks.
Aspect-Based Sentiment Analysis: Our AI tools perform a nuanced sentiment analysis, identifying specific aspects discussed in the feedback, providing businesses with a deeper understanding of their customer's sentiments.
Resume Processing and Tailored Resume Generation: We've developed an AI solution that not only processes resumes to extract relevant information but also generates tailored resumes based on specific job descriptions.
Harnessing the power of Python, NLP libraries, OpenAI models, cloud services, and other advanced technologies, FutureSmart AI is continually pushing the boundaries of what's possible in AI application development.
Join the FutureSmart AI Team
At FutureSmart AI, we're continually exploring the boundaries of AI and Machine Learning, especially in areas like NLP and generative AI. Whether you're a student looking for an enriching internship or a seasoned professional keen on collaboration, we invite you to join us.
Please note that when reaching out, a concise, relevant, and contextual message regarding your interest and experiences in NLP and generative AI would be much appreciated. To keep conversations focused and efficient, we kindly ask you to avoid sending long messages or detailed resumes.
Interested in making the future smarter with us? Don't hesitate to connect with me on LinkedIn.
LinkedIn: https://www.linkedin.com/in/pradipnichite/
Let's shape the future of AI together at FutureSmart AI!
AI Demos: Discover Latest AI Tools with Video Demos
In addition to our core projects at FutureSmart AI, we're also proud of our venture, AI Demos. Our goal with AI Demos is to educate and inform about the limitless possibilities of AI through engaging and accessible content.
AI Demos is a dedicated platform for video demonstrations 🎥 of the latest AI tools. Our team is committed to discovering and showcasing the most exciting and innovative AI technologies. We believe in making AI knowledge widely accessible, and what better way to do that than through interactive and engaging videos!
We diligently create two tool video demos daily, offering fresh and valuable insights into the ever-evolving world of AI. These demos provide a visual understanding of how various AI tools function, allowing viewers to appreciate the practical applications and potential of these tools.
For anyone keen on staying updated about the AI industry's advancements or looking to implement AI solutions in their business or project, AI Demos serves as a comprehensive and continually updated resource.
Visit our website at AI Demos and subscribe to our YouTube channel for regular updates.
Join us on AI Demos, and let's explore the future of AI together!

Using Langchain and Open Source Vector DB Chroma for Semantic Search with OpenAI's LLM

Pradip Nichite — Mon, 26 Jun 2023 17:31:10 GMT

1. Introduction
In the world of AI-native applications, Chroma DB and Langchain have made significant strides. Chroma DB is an open-source embedding (vector) database, designed to provide efficient, scalable, and flexible ways to store and search embeddings. Langchain, on the other hand, is a comprehensive framework for developing applications powered by language models.
You might recall our previous blog where we covered Langchain's capabilities with Pinecone Vector Database. Today, we are here to showcase Chroma DB. Why? Because Chroma DB is open-source, default vector DB is used by Langchain, and has gained significant popularity in recent times.
In this blog, we will delve into how to use Chroma DB for semantic search using Langchain's utilities. Specifically, we will discuss indexing documents, retrieving semantically similar documents, implementing persistence, integrating Large Language Models (LLMs), and employing question-answering and retriever chains.
2. Setting up the Environment
To start off, let's set up our environment. For this exercise, we will need the following libraries:
!pip install openai langchain sentence_transformers chromadb unstructured -q
3. Loading and Splitting the Documents
Now that we've set up our environment, let's start by loading and splitting documents using Langchain utilities.
from langchain.document_loaders import DirectoryLoaderdirectory = '/content/pets'def load_docs(directory): loader = DirectoryLoader(directory) documents = loader.load() return documentsdocuments = load_docs(directory)len(documents)
Once we load the documents, we split them using the RecursiveCharacterTextSplitter from Langchain.
from langchain.text_splitter import RecursiveCharacterTextSplitterdef split_docs(documents,chunk_size=1000,chunk_overlap=20): text_splitter = RecursiveCharacterTextSplitter(chunk_size=chunk_size, chunk_overlap=chunk_overlap) docs = text_splitter.split_documents(documents) return docsdocs = split_docs(documents)print(len(docs))
This text splitter, ideally recommended for general text, operates based on a specific list of characters. It attempts to divide the text on these characters in sequential order until the resulting chunks are sufficiently small. By default, it splits text using this list of characters: ["\n\n", "\n", " ", ""].
The goal is to maintain paragraphs, and subsequently, sentences and words, together for as long as feasibly possible, given that they typically form the most potent semantic units within a text.
4. Embedding Text Using Langchain
After splitting the documents, the next step is to embed the text using Langchain. Let's go ahead and use the SentenceTransformerEmbeddings from Langchain.
from langchain.embeddings import SentenceTransformerEmbeddingsembeddings = SentenceTransformerEmbeddings(model_name="all-MiniLM-L6-v2")
5. Creating Vector Store with Chroma DB
Vector stores serve as a prevalent method for handling and searching through unstructured data. The standard process involves creating embeddings from the unstructured data, saving these generated vectors, and then, during a query, embedding the unstructured query to retrieve the 'most similar' vectors to this embedded query. The role of a vector store is primarily to facilitate this storage of embedded data and execute the similarity search.
Importantly, Langchain offers support for various vector stores, including Chroma, Pinecone, and others. This flexibility enables users to choose the most suitable vector store based on their specific requirements and preferences.
Let's create a vector store using the Chroma DB from the documents we loaded and split.
from langchain.vectorstores import Chromadb = Chroma.from_documents(docs, embeddings)
6. Retrieving Semantically Similar Documents
Now that we've created the vector store, we can use it to execute a query and retrieve semantically similar documents.
query = "What are the different kinds of pets people commonly own?"matching_docs = db.similarity_search(query)matching_docs[0]
(Document(page_content='Pet animals come in all shapes and sizes, each suited to different lifestyles and home environments. Dogs and cats are the most common, known for their companionship and unique personalities.....', metadata={'source': '/content/pets/Different Types of Pet Animals.txt'}), 0.7325009703636169)
7. Persistence in Chroma DB
Persistence is an important aspect of any database. In this step, we will create a persistent Chroma DB instance.
If you want to save to disk, simply initialize the Chroma client and pass the directory where you want the data to be saved.
persist_directory = "chroma_db"vectordb = Chroma.from_documents( documents=docs, embedding=embeddings, persist_directory=persist_directory)vectordb.persist()
8. Using OpenAI Large Language Models (LLM) with Chroma DB
Next, we will see how to integrate OpenAI's Large Language Models (LLM) with Chroma DB.
import osos.environ["OPENAI_API_KEY"] = "key"from langchain.chat_models import ChatOpenAImodel_name = "gpt-3.5-turbo"llm = ChatOpenAI(model_name=model_name)
9. Extracting Answers from Documents
LangChain introduces a useful abstraction called a 'Chain' for representing sequences of calls to components. These components can include other chains, making it possible to build complex, nested sequences of operations. One specific type of chain is the question-answering (QA) chain.
The QA chain is specifically designed for answering questions based on a provided set of documents. It does this by performing a similarity search for the input question against the embedded documents and then using a model to generate an answer based on the most relevant documents.
By using the question-answering chain provided by Langchain, we can extract answers from documents.
from langchain.chains.question_answering import load_qa_chainchain = load_qa_chain(llm, chain_type="stuff",verbose=True)query = "What are the emotional benefits of owning a pet?"matching_docs = db.similarity_search(query)answer = chain.run(input_documents=matching_docs, question=query)answer
Output: Owning a pet can provide emotional support, reduce stress and anxiety, and can even help their owners lead healthier lives......
10. Utilizing RetrieverQA Chain
Finally, we utilize the RetrieverQA chain in Langchain to implement a retriever query.
from langchain.chains import RetrievalQAretrieval_chain = RetrievalQA.from_chain_type(llm, chain_type="stuff", retriever=db.as_retriever())retrieval_chain.run(query)
Output: Owning a pet can provide emotional support and reduce stress. Pets can also offer comfort and consistency .....
11. Further Reading
To help you deepen your understanding, we recommend the following articles:
A Detailed Exploration of Chroma DB: This blog post will provide you with in-depth knowledge about Chroma DB and its Python library.
Pinecone Vector Database and Langchain: This blog post discusses using Pinecone vector database in tandem with Langchain, similar to what we did in this blog post with Chroma DB.
12. Video Walkthrough
For those who prefer a more interactive form of learning, we have prepared a video walkthrough of this entire process. It complements this blog post and provides a step-by-step guide to help you visualize the process.
https://youtu.be/5NG8mefEsCU

13. Conclusion
In this blog post, we showcased how to use Chroma DB, an open-source embedding database, in tandem with Langchain for semantic search. We demonstrated how to load and split documents, create embeddings, and use those embeddings to store and search documents in Chroma DB.
We also discussed how to integrate Large Language Models (LLM) provided by OpenAI with Chroma DB and extract answers from documents using Langchain's question-answering chain. Additionally, we utilized Langchain's RetrieverQA chain to further enhance the querying process.
By leveraging Langchain and Chroma DB, developers can create sophisticated applications powered by large language models that can handle complex information retrieval tasks. It opens the door to creating AI-native applications that can leverage the power of vector databases and language models.
We hope this guide has been informative and useful in your journey to develop AI-native applications. Remember, the power of AI is in your hands. Keep exploring and keep innovating!
Full Code: https://github.com/PradipNichite/Youtube-Tutorials/blob/main/Chroma_DB_with_Langchain.ipynb

Building a Custom NER Model with SpaCy: A Step-by-Step Guide

Naman Goel — Wed, 21 Jun 2023 07:11:49 GMT

In today's data-driven world, extracting and understanding named entities from text is crucial for various natural language processing tasks. Named Entity Recognition (NER) is a subtask of information extraction that aims to identify and classify named entities such as names, organizations, locations, dates, and more. While SpaCy provides a powerful pre-trained NER model, there are situations where building a custom NER model becomes necessary. This blog post will guide you through the process of building a custom NER model using SpaCy, covering data preprocessing, training configuration, and model evaluation.
I. Understanding NER and the Need for Custom NER:
Named Entity Recognition (NER) is a subtask of natural language processing that focuses on identifying and classifying named entities within the text. Named entities refer to specific types of commodities such as person names, organization names, locations, dates, numerical values, and more. NER is vital in various applications, including information extraction, question answering, chatbots, sentiment analysis, and recommendation systems.
SpaCy, a popular Python library for NLP, provides pre-trained NER models that perform well on general domains. These models are trained on large corpora and can recognize common entity types. However, there are scenarios where building a custom NER model becomes necessary. Here are some reasons why you might need to build a custom NER model:
Domain-specific entity recognition: General-purpose NER models may struggle to identify entities in domain-specific texts accurately. For instance, if you're working in the medical domain, you may need to recognize medical terms, drug names, and specific medical conditions. Building a custom NER model allows you to train it on domain-specific data, improving the recognition of entities relevant to your field.
Custom entity types: Pre-trained NER models are typically trained to recognize common entity types such as a person, organization, and location. However, your application may require identifying unique entity types specific to your domain. By creating a custom NER model, you can define and train it to recognize these specific entity types effectively.
Adaptation to new entity types: Language is dynamic, and new entity types emerge over time. Pre-trained NER models might not be aware of these newly emerged entity types. Building a custom NER model allows you to incorporate these new entity types by providing labeled data and training the model accordingly.
Limited labeled data: In some cases, you might have a limited amount of labeled data available. Pre-trained models may not generalize well with limited training data. By building a custom NER model, you can fine-tune it on your limited labeled data, resulting in better performance specific to your dataset.
In summary, while pre-trained NER models offer a great starting point, building a custom NER model becomes necessary to address domain-specific challenges, recognize custom entity types, adapt to new entity types, and make the most of limited labeled data. By customizing the NER model using SpaCy, you can enhance its performance and achieve more accurate and context-specific named entity recognition.
II. Building and Training a Custom NER Model with SpaCy:
A. Introduction to SpaCy's Named Entity Recognition (NER)
Importing the required libraries and downloading SpaCy models:
import spacy !python -m spacy download en_core_web_lg nlp = spacy.load("en_core_web_lg")
Here, we import the spacy library. Then, we download the en_core_web_lg model using the command !python -m spacy download en_core_web_lg. After downloading, we load the model using spacy.load("en_core_web_lg") and assign it to the variable nlp.
Creating a SpaCy Doc object:
doc = nlp("Donald Trump was President of USA")
We create a Doc object by passing a text string to the loaded SpaCy model (nlp). This text is processed by the NLP pipeline, and the resulting document is assigned to the variable doc.
Accessing the entities in the document:
doc.ents
The ents attribute of the Doc object contains the named entities recognized in the document.
Output:

Visualizing the entities using SpaCy's displacy:
from spacy import displacy displacy.render(doc, style="ent", jupyter=True)
This code imports the displacy module from SpaCy and renders the named entities in the document using the "ent" (entity) style.

B. Enhancing NER through Kgalle DB for Fine-tuning:
Loading training data from a JSON file:
Data link: https://www.kaggle.com/datasets/finalepoch/medical-ner
import json with open('/content/Corona2.json', 'r') as f: data = json.load(f)
Here, we import the json library and load training data from a JSON file named "Corona2.json" using the json.load() function. The loaded data is stored in the data variable.
Preparing training data in SpaCy format:
training_data = [] for example in data['examples']: temp_dict = {} temp_dict['text'] = example['content'] temp_dict['entities'] = [] for annotation in example['annotations']: start = annotation['start'] end = annotation['end'] label = annotation['tag_name'].upper() temp_dict['entities'].append((start, end, label)) training_data.append(temp_dict)
This code prepares the training data in the required format for SpaCy. It iterates over the examples in the loaded data, extracts the text and annotations (start position, end position, and label) for each example, and appends them to a list called training_data.
Output:
{'text': "While bismuth compounds (Pepto-Bismol) decreased the number of bowel movements in those with travelers' diarrhea, they do not decrease the length of illness.[91] Anti-motility agents like loperamide are also effective at reducing the number of stools but not the duration of disease.[8] These agents should be used only if bloody diarrhea is not present.[92]\n\nDiosmectite, a natural aluminomagnesium silicate clay, is effective in alleviating symptoms of acute diarrhea in children,[93] and also has some effects in chronic functional diarrhea, radiation-induced diarrhea, and chemotherapy-induced diarrhea.[45] Another absorbent agent used for the treatment of mild diarrhea is kaopectate.\n\nRacecadotril an antisecretory medication may be used to treat diarrhea in children and adults.[86] It has better tolerability than loperamide, as it causes less constipation and flatulence.[94]", 'entities': [(360, 371, 'MEDICINE'), (383, 408, 'MEDICINE'), (104, 112, 'MEDICALCONDITION'), (679, 689, 'MEDICINE'), (6, 23, 'MEDICINE'), (25, 37, 'MEDICINE'), (461, 470, 'MEDICALCONDITION'), (577, 589, 'MEDICINE'), (853, 865, 'MEDICALCONDITION'), (188, 198, 'MEDICINE'), (754, 762, 'MEDICALCONDITION'), (870, 880, 'MEDICALCONDITION'), (823, 833, 'MEDICINE'), (852, 853, 'MEDICALCONDITION'), (461, 469, 'MEDICALCONDITION'), (535, 543, 'MEDICALCONDITION'), (692, 704, 'MEDICINE'), (563, 571, 'MEDICALCONDITION')]}
Converting training data to SpaCy DocBin format:
from spacy.tokens import DocBin from tqdm import tqdm nlp = spacy.blank("en") doc_bin = DocBin() from spacy.util import filter_spans for training_example in tqdm(training_data): text = training_example['text'] labels = training_example['entities'] doc = nlp.make_doc(text) ents = [] for start, end, label in labels: span = doc.char_span(start, end, label=label, alignment_mode="contract") if span is None: print("Skipping entity") else: ents.append(span) filtered_ents = filter_spans(ents) doc.ents = filtered_ents doc_bin.add(doc) doc_bin.to_disk("train.spacy")
In this section, we convert the training data to SpaCy's DocBin format, which is an efficient binary format for storing Doc objects. We initialize a blank English language model using spacy.blank("en"), create an empty DocBin, and iterate over each training example. For each example, we create a Doc object using nlp.make_doc(text) and create spans for the entities. We filter the spans to remove overlapping entities using filter_spans, update the ents attribute of the Doc object, and add the Doc to the DocBin. Finally, we save the DocBin to a file named "train. spacy".
Initializing the training configuration:
!python -m spacy init fill-config base_config.cfg config.cfg
This command initializes a base configuration file named "base_config.cfg" and fills it with default settings. The filled configuration is saved as "config.cfg", which can be further customized for training the NER model.
Training the NER model:
!python -m spacy train config.cfg --output ./ --paths.train ./train.spacy --paths.dev ./train.spacy
This command trains the NER model using the specified configuration file ("config. cfg") and the training data from the "train.spacy" file. The trained model is saved in the current directory.
Loading the trained NER model and visualizing entities:
nlp_ner = spacy.load("model-best") doc = nlp_ner("While bismuth compounds (Pepto-Bismol) decreased the number of bowel movements in those with travelers' diarrhea, they do not decrease the length of illness.[91] Anti-motility agents like loperamide are also effective at reducing the number of stools but not the duration of disease.[8] These agents should be used only if bloody diarrhea is not present.") colors = {"PATHOGEN": "#F67DE3", "MEDICINE": "#7DF6D9", "MEDICALCONDITION": "#a6e22d"} options = {"colors": colors} spacy.displacy.render(doc, style="ent", options=options, jupyter=True)
In this part, we load the trained NER model using spacy.load("model-best"). Then, we process a sample text using the loaded model and store the result in the doc variable. Finally, we define colors for entity types, specify options, and use displacy.render them to visualize the entities with their corresponding colors in the Jupyter Notebook.
Output:

Note: Make sure to install the necessary dependencies (tqdm and json) if they are not already installed in your environment.
III. Conclusion and Next Steps: Empowering NLP Applications with Custom NER Models
In conclusion, this blog has provided a step-by-step guide on building a custom NER model using SpaCy. We have covered the importance of NER and the need for custom models in specific domains or with unique entity types. Additionally, we explored the process of preprocessing NER training data in SpaCy format and creating a training configuration to train and test the new model.
By following these steps, you can leverage the power of SpaCy to create a custom NER model that accurately identifies named entities relevant to your specific application or domain. Building a custom NER model allows you to enhance the performance and adaptability of your NLP applications, leading to more accurate and meaningful results.
To further enhance your custom NER model, you can consider the following next steps:
Fine-tuning and Optimization: Experiment with different hyperparameters, model architectures, and training strategies to improve the performance of your custom NER model. Fine-tuning the model on more labeled data or using transfer learning techniques can also be beneficial.
Data Augmentation: If you have limited labeled data, consider applying data augmentation techniques to generate synthetic training examples. This can help diversify and expand your training data, leading to better generalization.
Error Analysis and Iterative Refinement: Perform a thorough error analysis on your model's performance. Identify common patterns or challenging cases where the model struggles. Based on the analysis, iteratively refine your training data, model architecture, or training process to address these challenges.
Deployment and Integration: Once you have a trained and validated custom NER model, explore options for deploying it in your desired application or system. Consider integration with other components, frameworks, or APIs to utilize the model's predictions in real-world scenarios.
Remember, building an effective custom NER model requires an iterative process of experimentation, evaluation, and refinement. Stay updated with the latest research and advancements in NLP to incorporate new techniques and approaches into your workflow.
By following these guidelines and continuously improving your custom NER model, you can unlock the potential of named entity recognition for a wide range of applications, from information extraction to intelligent chatbots and beyond. Happy coding and NLP exploration!
To complement the concepts discussed in this blog, you can also check out this informative video tutorial on building a custom NER model with SpaCy:
https://youtu.be/YLQvVpCXpbU

The video provides visual demonstrations and additional insights into the implementation process.

ChromaDB: An Open-source vector embedding database

Pradip Nichite — Mon, 19 Jun 2023 11:47:20 GMT

In today's digital age, having a smart and efficient way to handle data is crucial. Here, we explore the capabilities of ChromaDB, an open-source vector embedding database that allows users to perform semantic search. ChromaDB stores documents as dense vector embeddings, which are typically generated by transformer-based language models, allowing for nuanced semantic retrieval of documents. In this blog post, we will demonstrate how to create and store embeddings in ChromaDB and retrieve semantically matching documents based on user queries.
Installation
We start off by installing the required packages.
!pip install chromadb -q!pip install sentence-transformers -q
For our demonstration, we use a set of text files stored in a folder named "pets". Each file contains information about a different aspect of pet care.
Next, we need to connect to ChromaDB and create a collection. By default, ChromaDB uses the Sentence Transformers all-MiniLM-L6-v2 model to create embeddings.
import chromadbclient = chromadb.Client()collection = client.create_collection("yt_demo")
Adding Documents
We add some documents to our collection, along with corresponding metadata and unique IDs.
collection.add( documents=["This is a document about cat", "This is a document about car"], metadatas=[{"category": "animal"}, {"category": "vehicle"}], ids=["id1", "id2"])
Querying
Now, we can query our collection. Let's search for the term "vehicle". The returned result should be the document about the car.
results = collection.query( query_texts=["vehicle"], n_results=1)print(results)
{'ids': [['id2']], 'embeddings': None, 'documents': [['This is a document about car']], 'metadatas': [[{'category': 'vehicle'}]], 'distances': [[0.8069301247596741]]}
Reading Files from a Folder
Our output is as expected, providing the id, document content, metadata, and distance value for the best-matching document.
Now, let's add our pet documents to the collection. We start by reading all the text files from the "pets" folder and storing the data in a list.
import osdef read_files_from_folder(folder_path): file_data = [] for file_name in os.listdir(folder_path): if file_name.endswith(".txt"): with open(os.path.join(folder_path, file_name), 'r') as file: content = file.read() file_data.append({"file_name": file_name, "content": content}) return file_datafolder_path = "pets"file_data = read_files_from_folder(folder_path)
Adding File Contents to ChromaDB
Then, we create separate lists for documents, metadata, and ids, which we add to our collection.
documents = []metadatas = []ids = []for index, data in enumerate(file_data): documents.append(data['content']) metadatas.append({'source': data['file_name']}) ids.append(str(index + 1))pet_collection = client.create_collection("pet_collection")pet_collection.add( documents=documents, metadatas=metadatas, ids=ids)
Performing Semantic Searches
Let's now query the collection for the different kinds of pets people commonly own.
results = pet_collection.query( query_texts=["What are the different kinds of pets people commonly own?"], n_results=1)print(results)
{'ids': [['1']], 'embeddings': None, 'documents': [['Pet animals come in all shapes and sizes, each suited to different lifestyles and home environments. Dogs and cats are the most common, known for their companionship and unique personalities. Small mammals like hamsters, guinea pigs, and rabbits are often chosen for their low maintenance needs. Birds offer beauty and song, and reptiles like turtles and lizards can make intriguing pets. Even fish, with their calming presence, can be wonderful pets.']], 'metadatas': [[{'source': 'Different Types of Pet Animals.txt'}]], 'distances': [[0.7325009703636169]]}
Our query successfully retrieves the most relevant document, which talks about different types of pet animals.
Filtering Results
If you want to refine your search further, you can use the where_document parameter to specify a condition that must be met in the document text. For example, if you want to find documents about the emotional benefits of owning a pet that mention reptiles, you could use the following query:
pet_collection.query( query_texts=["What are the emotional benefits of owning a pet?"], n_results=1, where_document={"$contains":"reptiles"})print(results)
The results show that the document talking about the emotional bond between humans and pets is the most relevant to our query.
Similarly, if you want to use metadata to filter your search results, you can use the where parameter. Let's say you want to find information about the emotional benefits of owning a pet, but you want to retrieve this information specifically from the document about pet training and behaviour. You could do so with the following query:
results = pet_collection.query( query_texts=["What are the emotional benefits of owning a pet?"], n_results=1, where={"source": "Training and Behaviour of Pets.txt"})print(results)
The results now show the document about the training and behaviour of pets, as we specified in our query.
Using a different model for embedding
While ChromaDB uses the Sentence Transformers all-MiniLM-L6-v2 model by default, you can use any other model for creating embeddings. In this example, we use the 'paraphrase-MiniLM-L3-v2' model from Sentence Transformers.
First, we load the model and create embeddings for our documents.
from sentence_transformers import SentenceTransformermodel = SentenceTransformer('paraphrase-MiniLM-L3-v2')documents = []embeddings = []metadatas = []ids = []for index, data in enumerate(file_data): documents.append(data['content']) embedding = model.encode(data['content']).tolist() embeddings.append(embedding) metadatas.append({'source': data['file_name']}) ids.append(str(index + 1))
Then, we create a new collection and add the documents, embeddings, metadata, and ids to it.
pet_collection_emb = client.create_collection("pet_collection_emb")pet_collection_emb.add( documents=documents, embeddings=embeddings, metadatas=metadatas, ids=ids)
Now, when we perform a query, we need to provide the embedding of the query text instead of the text itself. Let's search again for the different kinds of pets people commonly own.
query = "What are the different kinds of pets people commonly own?"input_em = model.encode(query).tolist()results = pet_collection_emb.query( query_embeddings=[input_em], n_results=1)print(results)
The results are similar to our previous query, with the same document about different types of pet animals being returned.
Finally, let's make a more specific query about what foods are recommended for dogs.
query = "foods that are recommended for dogs?"input_em = model.encode(query).tolist()results = pet_collection_emb.query( query_embeddings=[input_em], n_results=1)print(results)
The result correctly provides the document about the nutrition needs of pet animals.
Conclusion
ChromaDB is a powerful tool that allows us to handle and search through data in a semantically meaningful way. It provides flexibility in terms of the transformer models used to create embeddings and offers efficient ways to narrow down search results. Whether you're managing a small collection of documents or a large database, ChromaDB's ability to handle semantic search can help you find the most relevant information quickly and accurately.
I hope you found this tutorial on using ChromaDB for semantic search helpful. The power of machine learning and natural language processing opens up a new world of possibilities when it comes to information retrieval, and ChromaDB is a fantastic tool to have in your arsenal.
But don't stop here! I've created several other tutorials on vector databases and semantic search that dive even deeper into these topics. I encourage you to check them out to continue expanding your knowledge in this exciting field.
https://blog.futuresmart.ai/building-an-interactive-chatbot-with-langchain-chatgpt-pinecone-and-streamlit
https://blog.futuresmart.ai/youtube-qa-chatbot-with-openai-whisper-embeddings-chatgpt-pinecone
If you're more of a visual learner, you'll be pleased to know I've also created video tutorials on these topics. The videos provide step-by-step guidance, making it even easier to understand and implement these complex concepts.
https://youtu.be/eCCHDxMaFIk

So, whether you're a seasoned pro looking to brush up on the latest techniques or a curious beginner ready to dip your toes into the world of AI and NLP, there's content designed to support you on your learning journey.
Remember, learning is a continuous process, and every step you take brings you closer to mastering the art of semantic search. Happy learning!
Full Code With Jupyter Notebook: https://github.com/PradipNichite/Youtube-Tutorials/tree/main/chroma_db

A Journey Through Data Science: Insights from My Internship

jay — Sat, 17 Jun 2023 11:34:08 GMT

Introduction
As my internship with FutureSmart AI comes to an end, I reflect on the transformative journey I have embarked upon. I have learned so much, grown as a data science intern, and gained valuable insights that I am eager to share with others.
I began my internship with a strong foundation in machine learning and mathematics, but I had limited experience in fine-tuning and deployment of actual models. I was eager to learn and to take on the challenges that lay ahead.
Learning Experience
My internship at FutureSmart AI has been a tremendous learning opportunity, allowing me to expand my knowledge and skills in the field of data science. Through hands-on experiences, I have delved into the intricacies of GPT-based models, Pinecone indexing, and other fundamental concepts. During my internship, I had the privilege of engaging in a wide array of projects that significantly augmented my knowledge and expertise in the field of data science. These projects spanned a variety of domains, ranging from developing predictive models for a customer assistant bot to analyzing resumes and candidate audio to generate interview questions. Additionally, I had the opportunity to implement resume parsing libraries and deploy end-to-end Streamlit applications.
Task Highlights
One of the projects involved extracting specific information from a list of URLs representing different categories on a website. This required using web automation tools like Selenium and implementing web scraping techniques using Python. The project presented challenges in handling dynamic web elements and managing large volumes of data. However, through careful analysis and coding, I successfully automated the scraping process and generated a structured CSV file with the extracted data.
Another project focused on sentiment analysis of movie reviews scraped from a popular movie database. I developed a script using Python and utilized natural language processing techniques. The challenge here was to analyze the sentiment of each review based on different aspects. By leveraging the OpenAI API, I was able to achieve accurate sentiment analysis and provide meaningful insights into movie reviews.
In addition, I worked on a project involving data retrieval from an API, data storage in a MySQL database, and data processing using Python. This project required expertise in handling API requests, establishing database connections, and manipulating data using Pandas. I encountered challenges in handling API rate limits and ensuring data integrity during the extraction process. However, by implementing efficient algorithms and utilizing proper error-handling techniques, I successfully retrieved, stored, and processed the required data.
Furthermore, I contributed to a project that involved integrating advanced language models like GPT-4 for customer query responses. I fine-tuned the models and optimized them for customer support interactions, ensuring context-aware and accurate responses. This project required a deep understanding of natural language processing, model training, and deployment techniques.
Tools and Techniques
Python: I extensively used Python for data manipulation, analysis, and automation tasks. Its vast ecosystem of libraries, such as Pandas, NumPy, and sci-kit-learn, proved invaluable for handling and processing large datasets efficiently.
ChatGPT and Langchian: These advanced language models enabled me to develop conversational AI systems capable of understanding and generating human-like text. I fine-tuned these models for specific tasks, enhancing their performance in customer support interactions.
OpenAI API: Leveraging the OpenAI API, I performed sentiment analysis on movie reviews, extracted insights, and generated context-aware responses for customer queries. The API streamlined the integration of powerful language models into my projects.
Pandas, NumPy, and sci-kit-learn: These standard data science libraries were essential for data manipulation, numerical operations, and implementing machine learning algorithms for tasks such as classification, regression, and clustering.
Collaborative Tools: I utilized Git for version control, Jupyter Notebook and Google Colab for interactive data analysis and prototyping, and various collaboration platforms to work effectively with team members.
Overall, my internship exposed me to a diverse set of tools and techniques, including Python, language models like ChatGPT and Langchian, the OpenAI API, and popular data science libraries. These tools enabled me to tackle complex data challenges, develop robust models, and gain practical experience in applying data science principles.
In addition to the previously mentioned tools and techniques, I also had the opportunity to work with the following:
AWS Lambda and AWS Serverless Application Model (SAM): I dedicated time to learning and integrating these services into my projects. By incorporating serverless computing, I was able to simplify the deployment and management process on the AWS cloud platform, ensuring scalability and cost-efficiency.
AWS: I leveraged various AWS services, such as EC2 instances, S3 for storage, and RDS for database management. These services provided the infrastructure and resources needed to support the deployment and functionality of my projects.
Streamlit: I successfully deployed multiple Streamlit apps on an EC2 instance. Streamlit allowed me to create interactive web applications with ease, and by deploying it on AWS, I provided users with a user-friendly interface to access the sentiment analysis functionality.
Postman: I used Postman as a tool for testing and debugging API endpoints. It allowed me to send HTTP requests, inspect responses, and validate the functionality of the APIs I developed.
These additional tools, including AWS services, Streamlit, and Postman, played a significant role in enhancing the deployment, scalability, and user experience of the projects I worked on during my internship.
Beyond Technical Proficiency
In addition to technical skills, I developed essential non-technical skills that were essential to my success as a data science intern.
Effective communication: I learned to convey complex ideas clearly and actively listen to others, ensuring effective team communication and understanding.
Teamwork: I worked on a collaborative data analysis project where we encountered various challenges, such as data inconsistencies and conflicting interpretations. By fostering open communication, embracing diverse perspectives, and collectively problem-solving, we were able to overcome these obstacles and deliver a robust analysis. For example, when we encountered data inconsistencies, we worked together to identify the source of the errors and develop a plan to correct them.
Critical thinking and problem-solving: I developed critical thinking and problem-solving skills while developing a prediction system. When encountering unexpected errors in the algorithm, I employed try-catch blocks to identify and handle exceptions gracefully. This approach not only enabled me to troubleshoot effectively but also contributed to the overall robustness of the system.
Time management: I was able to balance multiple projects simultaneously by using time management techniques such as creating a project plan (in Notion mostly) and setting personal deadlines.
Conclusion
I am grateful for the guidance and mentorship that accompanied each project, fostering an environment of growth and learning. The exposure to cloud platforms like AWS and the utilization of Python and various libraries have equipped me with the skills needed to tackle real-world challenges and deploy scalable solutions.
As I reflect on my internship journey, I am filled with gratitude for the hands-on experiences and mentorship I received. I am confident that the skills and knowledge I have acquired during this internship will serve as a strong foundation for my future endeavors in the field of data science. I am excited to continue my professional growth, contribute to cutting-edge projects, and make a positive impact using data-driven insights.

Mastering User Management in Anvil: A Comprehensive Tutorial

Pradip Nichite — Mon, 12 Jun 2023 08:05:38 GMT

Web application development can be a challenging journey, especially when it comes to managing users effectively. Thankfully, Anvil, a tool built to create robust web applications using nothing but Python, simplifies this process significantly. In this short blog post, we dive into a comprehensive tutorial on Anvil's User Management feature.
Embedded below is a detailed video tutorial that will walk you through the user management process in Anvil. Whether you're an experienced developer or just starting your web development journey, this tutorial is designed to ensure you can maximize the efficiency of your web apps.
https://youtu.be/ldbf4FcNZEo

Before we delve into the specifics, if you haven't already, I highly recommend checking out the previous two videos in our Anvil Tutorial series. The first video, "Build AI Web app with Custom UI using Python, Anvil, and ChatGPT API", provides a foundational understanding of creating AI web applications. The second video, "Connecting Anvil app to external DB and displaying data in table | Data Grid | Repeating Panels", offers valuable insights into connecting your Anvil app to an external database and displaying data effectively.
Anvil: Python-Powered Web Apps
Anvil's key selling point is its ability to empower developers to build powerful web applications with the flexibility of Python. You don't need to be a seasoned web developer to use Anvil - it's easy-to-use and requires no prior web development experience.
The Anvil Users Service
The Anvil Users Service is a powerful tool that handles user signups, logins, and permissions. It provides a wide range of features to make user management straightforward, flexible, and efficient.
What the Tutorial Covers
Our video tutorial explores the major aspects of the Users Service. Here's a brief overview:
Displaying the Login and Sign Up Forms: The video demonstrates the use of anvil.users.login_with_form() function to show these forms.
Getting the Logged-in User: You'll learn how to retrieve the details of the currently logged-in user with the user = anvil.users.get_user() command.
Implementing the Logout Button: The tutorial explains how to use the anvil.users.logout() function to enable users to securely log out of your application.
Enabling Google Account Logins: We also discuss how to allow users to log in using their Google accounts.
By the end of this tutorial, you'll have a robust understanding of user management within Anvil, ensuring that your web apps are not just powerful but also user-friendly and secure.
We encourage you to watch the video tutorial embedded above. Don't forget to like, share, and subscribe to our channel for more educational content on Anvil and Python-based web application development. If you have any questions or comments, feel free to drop them in the comments section, and we'll be more than happy to assist.
Armed with the knowledge from this and our previous tutorials, we look forward to seeing you create fantastic web apps with Anvil. Happy coding!

Aspect-Based Sentiment Analysis with PyABSA & Hugging Face

Naman Goel — Tue, 06 Jun 2023 13:01:06 GMT

Aspect-based sentiment analysis is a type of natural language processing technique that involves identifying specific aspects or features of a product or service, and the analysis of the sentiment expressed towards each of these aspects. It provides a more fine-grained sentiment analysis compared to traditional sentiment analysis, which analyzes the overall sentiment of a text without considering specific aspects.
Aspect-based sentiment analysis is becoming increasingly important in the fields of marketing, customer service, and product development. It can be used to gain insights into customer feedback, identify areas for improvement, and monitor brand reputation. For example, a company may use aspect-based sentiment analysis to identify the most commonly discussed aspects of their product and analyze the sentiment expressed towards each aspect. This can help the company to identify areas where their product is performing well, as well as areas that require improvement.
In addition to business applications, aspect-based sentiment analysis can also be used in various research fields such as social sciences, political science, and psychology to analyze opinions and attitudes toward specific topics or issues.
Hugging Face Transformers for Sentiment Analysis
Huggingface Transformers is a popular open-source library for natural language processing (NLP) tasks such as text classification, question-answering, and language translation. It provides access to a large number of pre-trained models that can be fine-tuned for specific tasks, as well as tools for building custom models.
Using Huggingface Transformers for aspect-based sentiment analysis has several advantages. Firstly, the library provides access to a wide range of pre-trained language models that can be fine-tuned for aspect-based sentiment analysis tasks. These models have been trained on large amounts of text data, and can therefore perform well in a variety of different domains and languages.
Secondly, Huggingface Transformers provides a simple and flexible API for loading and using pre-trained models. This allows users to quickly and easily perform aspect-based sentiment analysis without needing to write complex code or train their models from scratch.
Finally, Huggingface Transformers provides a range of tools for analyzing and visualizing the results of aspect-based sentiment analysis. This includes tools for generating confusion matrices, calculating performance metrics such as accuracy and F1-score, and visualizing the results of sentiment analysis on specific aspects of a text.
Overall, Huggingface Transformers provides a powerful and flexible framework for aspect-based sentiment analysis and is widely used in both academic and industry settings.
Difference between Traditional and Aspect-based Sentiment Analysis
Traditional sentiment analysis involves analyzing the overall sentiment of a text without considering specific aspects. For example, if we were to analyze the following sentence using traditional sentiment analysis:
"I loved the movie, the acting was great and the plot was engaging."
The overall sentiment of this sentence would be positive, as it contains the word "loved" which indicates a positive sentiment. However, traditional sentiment analysis would not be able to tell us which specific aspects of the movie contributed to the positive sentiment.
Aspect-based sentiment analysis, on the other hand, involves analyzing the sentiment toward specific aspects or features of a text. For example, if we were to perform aspect-based sentiment analysis on the same sentence as above, we would identify the two aspects "acting" and "plot", and analyze the sentiment expressed towards each of these aspects.
Let's say we have the following product review:
"I bought a new smartphone and it's amazing. The camera quality is great, the battery life is good, and the screen resolution is impressive."
Using traditional sentiment analysis, we would classify this review as positive because it contains positive adjectives such as "amazing", "great", and "impressive". However, we wouldn't be able to tell which specific aspects of the smartphone contributed to the positive sentiment.
Using aspect-based sentiment analysis, we can analyze the sentiment expressed towards each specific aspect of the smartphone. For example:
Aspect: Camera Quality
Sentiment: Positive
Aspect: Battery Life
Sentiment: Neutral
Aspect: Screen Resolution
Sentiment: Positive
This approach allows us to gain more fine-grained insights into the sentiment towards specific aspects of the product and can help businesses identify areas for improvement or highlight features that are performing well.
Aspect-Based Sentiment Analysis using PyABSA
PyABSA is a Python library that provides functionality for aspect-based sentiment analysis. It allows users to extract aspect terms from text, and analyze the sentiment expressed towards each aspect term. In this example, we will demonstrate how to use PyABSA to perform aspect-based sentiment analysis on a sample text.
Installation
First, we need to install PyABSA using pip:
!pip install pyabsa
Example Text
Let's use the following text as an example:
"Staff was very rude but food was delicious."
Initializing Aspect Extractor
We can initialize an aspect extractor using the ATEPCCheckpointManager class from PyABSA:
from pyabsa import ATEPCCheckpointManageraspect_extractor = ATEPCCheckpointManager.get_aspect_extractor(checkpoint='english', auto_device=True # False means load model on CPU)
This code initializes an aspect extractor using a pre-trained checkpoint for the English language. The auto_device parameter specifies whether to automatically use GPU if available or not.
Performing Aspect-Based Sentiment Analysis
We can now use the initialized aspect extractor to perform aspect-based sentiment analysis on our example text:
example_text = "Staff was very rude but food was delicious."inference_source = [example_text]atepc_result = aspect_extractor.extract_aspect(inference_source=inference_source, pred_sentiment=True)print(atepc_result)
This code performs aspect term extraction and sentiment inference on the specified example text. The extract_aspect method of the aspect_extractor object is used to perform aspect extraction and sentiment inference. The inference_source parameter specifies the input text to be analyzed. The pred_sentiment the parameter specifies whether to predict the sentiment of the extracted aspect terms or not. The result of the aspect term extraction and sentiment inference is stored in the atepc_result variable, which is then printed to the console.
Output
The output of the above code should be something like this:
[{'IOB': ['B-ASP', 'O', 'O', 'O', 'O', 'B-ASP', 'O', 'O'], 'aspect': ['Staff', 'food'], 'position': [[1], [6]], 'sentence': '\x1b[31m\x1b[0m was very rude but \x1b[32m\x1b[0m was delicious', 'sentiment': ['Negative', 'Positive'], 'tokens': ['Staff', 'was', 'very', 'rude', 'but', 'food', 'was', 'delicious']}]
Here's a breakdown of what each part of the output means:
IOB: This is the IOB (Inside-Outside-Beginning) tagging for each token in the input text. IOB tagging is used to identify the position of aspect terms within a sentence. B-ASP indicates the beginning of an aspect term, while I-ASP indicates an intermediate token of an aspect term, and O indicates a token that is not part of an aspect term.
aspect: This is a list of aspect terms extracted from the input text. In this case, the aspect terms are "Staff" and "food".
position: This is a list of lists that specify the position of each aspect term within the input text. Each sublist contains the start and end index of the aspect term in the input text.
sentence: This is the input text with the aspect terms highlighted in color. Aspect terms with a positive sentiment are highlighted in green, while those with a negative sentiment are highlighted in red.
sentiment: This is a list of sentiment labels for each aspect term. In this case, the sentiment for "Staff" is negative, while the sentiment for "food" is positive.
tokens: This is a list of tokens (words) in the input text.
Overall, this output provides a detailed analysis of the sentiment expressed towards each aspect term in the input text.
In conclusion, aspect-based sentiment analysis is a powerful technique that allows us to gain more fine-grained insights into the sentiment expressed towards specific aspects of a product, service, or experience. By analyzing the sentiment towards each specific aspect, we can identify areas of improvement and prioritize our efforts to improve customer satisfaction.
In this blog post, we provided an introduction to aspect-based sentiment analysis, explained the differences between traditional sentiment analysis and aspect-based sentiment analysis, and demonstrated how to perform aspect-based sentiment analysis using Hugging Face Transformers and PyABSA.
Finally, we encourage you to check out our video tutorial on aspect-based sentiment analysis, which provides a step-by-step guide on how to perform aspect-based sentiment analysis using Hugging Face Transformers in Python.
https://youtu.be/-UEU-HGjUyQ

Thank you for reading, and we hope you found this blog post helpful!
Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI
Looking to catch up on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with AI Demos

Fine-Tuning Hugging Face Transformers Model

Naman Goel — Mon, 29 May 2023 13:11:53 GMT

Natural Language Processing (NLP) is a field of artificial intelligence that focuses on enabling computers to understand and process human language. One of the key techniques used in NLP is finetuning, which involves adapting a pre-trained language model to a specific task by further training it on task-specific data. Finetuning is important in NLP because it allows for more accurate and efficient natural language processing.
Hugging Face Transformers is a popular library for NLP that provides pre-trained language models based on the transformer architecture, which has revolutionized NLP in recent years by enabling the development of models that can process entire sentences and paragraphs at once. In this blog post, we will guide finetuning Hugging Face Transformers using BERT, one of the most widely used pre-trained language models.
We'll cover the importance of finetuning, the popularity of Hugging Face Transformers, understanding BERT, dataset and preprocessing, fine-tuning BERT, and evaluating the fine-tuned model. By the end of this post, you'll know how to finetune Hugging Face Transformers using BERT for NLP tasks. This guide is intended to be comprehensive and provide a solid understanding of finetuning Hugging Face Transformers using BERT.
BERT and its significance in NLP
BERT, which stands for Bidirectional Encoder Representations from Transformers, is a pre-trained language model developed by Google in 2018. It is a deep neural network architecture based on transformer architecture, capable of processing and understanding natural language text.
What makes BERT significant in NLP is its ability to understand the context and meaning of words within a sentence, or even an entire document, rather than simply treating each word as a standalone entity. This is due to its bidirectional nature, where the model is trained to consider both the preceding and following words in a sentence when making predictions.
BERT Architecture and pre-training
BERT, which stands for Bidirectional Encoder Representations from Transformers, is a language model developed by Google in 2018. It is a type of Transformer-based neural network architecture that uses an attention mechanism to process sequential input data. The architecture of BERT consists of a series of transformer blocks where each block contains a multi-head attention layer and a feed-forward neural network.
The model uses a bidirectional approach, which means that it can learn the context of a word by looking at both the left and right context of the word. This helps the model to better understand the meaning of the words and the relationships between them.
BERT is pre-trained on a large corpus of text using two unsupervised learning tasks. The first task is called Masked Language Modeling (MLM), where the model is trained to predict a randomly masked word in a sentence given the surrounding context. This task helps the model learn the contextual relationship between words in a sentence.
The second task is called Next Sentence Prediction (NSP), where the model is trained to predict whether two given sentences are consecutive or not. This task helps the model learn the relationships between sentences in a document.
The pre-training process is carried out on massive amounts of text data, which enables the model to learn general language understanding. Once the model is pre-trained, it can be fine-tuned on specific NLP tasks with smaller amounts of task-specific data. The fine-tuning process involves adding a classification layer on top of the pre-trained model and training the entire model on the task-specific dataset. This fine-tuning approach allows the model to learn the nuances of the specific task and achieve state-of-the-art performance on various NLP tasks.
Demystifying Model Fine-tuning for Improving Sentiment Analysis Performance:
Hugging Face provides an easy-to-use library called Transformers that make it simple to fine-tune pre-trained transformer models on a variety of natural language processing tasks. The library includes many pre-implemented scripts that can be used to fine-tune models on common tasks such as sentiment analysis, text classification, and question answering.
Fine-tuning a pre-trained transformer model from Hugging Face involves taking a pre-trained model and adapting it to a specific downstream task. This is done by training the pre-trained model on a task-specific dataset.
Here we explain the code for fine-tuning the model for the sentiment analysis task:
Dataset Link: https://www.kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge/data
Install the transformers library using !pip install transformers -U and import the required libraries:
!pip install transformers -Uimport pandas as pdimport numpy as npfrom sklearn.model_selection import train_test_splitfrom sklearn.metrics import accuracy_score, recall_score, precision_score, f1_scoreimport torchfrom transformers import TrainingArguments, Trainerfrom transformers import BertTokenizer, BertForSequenceClassification
Load the dataset containing toxic comments using pd.read_csv and select only the relevant columns:
data = pd.read_csv("file_location",error_bad_lines=False, engine="python")data = data[['comment_text','toxic']]data = data[0:1000]
Output:
Split the dataset into training and validation sets using train_test_split from sklearn. 80% of the data is used for training and 20% for validation:
X = list(data["comment_text"])y = list(data["toxic"])X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2,stratify=y)
Tokenize the input sequences using the BertTokenizer from the transformers library. The padding=True, truncation=True, and max_length=512 parameters ensure that all input sequences have the same length.
from transformers import BertTokenizer, BertForSequenceClassificationtokenizer = BertTokenizer.from_pretrained('bert-base-uncased')X_train_tokenized = tokenizer(X_train, padding=True, truncation=True, max_length=512)X_val_tokenized = tokenizer(X_val, padding=True, truncation=True, max_length=512)
Create a custom PyTorch dataset using the tokenized input sequences and corresponding labels:
class Dataset(torch.utils.data.Dataset): def __init__(self, encodings, labels=None): self.encodings = encodings self.labels = labels def __getitem__(self, idx): item = {key: torch.tensor(val[idx]) for key, val in self.encodings.items()} if self.labels: item["labels"] = torch.tensor(self.labels[idx]) return item def __len__(self): return len(self.encodings["input_ids"])train_dataset = Dataset(X_train_tokenized, y_train)val_dataset = Dataset(X_val_tokenized, y_val)
In this snippet, we define a custom dataset class Dataset that takes the tokenized encodings and labels as input. The __getitem__ method returns a dictionary of tensors representing each input sample. We create instances of train_dataset and val_dataset using the tokenized datasets and labels.
Define a function to compute evaluation metrics (accuracy, recall, precision, and F1-score) for the model:
def compute_metrics(p): pred, labels = p pred = np.argmax(pred, axis=1) accuracy = accuracy_score(y_true=labels, y_pred=pred) recall = recall_score(y_true=labels, y_pred=pred) precision = precision_score(y_true=labels, y_pred=pred) f1 = f1_score(y_true=labels, y_pred=pred) return {"accuracy": accuracy, "precision": precision, "recall": recall, "f1": f1}
Define the training arguments for the Trainer using TrainingArguments:
args = TrainingArguments( output_dir="output", num_train_epochs=1, per_device_train_batch_size=8)
Define the Trainer using the Trainer class from the transformers library, passing in the model, training, and validation datasets, and the evaluation metrics function. Then, we train the model using trainer.train():
model = BertForSequenceClassification.from_pretrained('bert-base-uncased', num_labels=2)model = model.to('cuda')trainer = Trainer( model=model, args=args, train_dataset=train_dataset, eval_dataset=val_dataset, compute_metrics=compute_metrics)trainer.train()
Evaluate the model and make predictions on new inputs:
Here, we define the training arguments and initialize the Trainer with the BERT model, training arguments, and the training and validation datasets. We train the model using trainer.train() and evaluate its performance on the validation set using trainer.evaluate():
trainer.evaluate()text = "That was good point"inputs = tokenizer(text, padding=True, truncation=True, return_tensors='pt').to('cuda')outputs = model(**inputs)predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)predictions = predictions.cpu().detach().numpy()
The output has a 0.97 score for a positive label indicating that the text was a positive comment.
We save the fine-tuned model using trainer.save_model() to a directory called "CustomModel". Later, we load the saved model using BertForSequenceClassification.from_pretrained() and move it to the GPU**:**
trainer.save_model('CustomModel')
We can load the saved model and make predictions on new inputs.
Finally, we provide an example text and tokenize it. We pass the tokenized inputs through the loaded model (model_2) to obtain the predicted probabilities using the softmax function. The predictions are converted to a numpy array for further processing**:**
model_2 = BertForSequenceClassification.from_pretrained("CustomModel")model_2.to('cuda')text = "go to hell"inputs = tokenizer(text, padding=True, truncation=True, return_tensors='pt').to('cuda')outputs = model_2(**inputs)predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)predictions = predictions.cpu().detach().numpy()
Output:
Here we have a 0.999 score for the negative label, indicating that the text represents a negative comment.
This code demonstrates how to fine-tune a pre-trained BERT model on a custom dataset using the Hugging Face Transformers library. The model is trained to classify toxic comments, and evaluation metrics such as accuracy, recall, precision, and F1-score are computed to assess the model's performance. Finally, the trained model is saved and can be loaded later to make predictions on new inputs.
Conclusion
In this blog post, we have explored the process of fine-tuning Hugging Face Transformers for NLP tasks using the BERT model. We started by discussing the importance of fine-tuning and how BERT is pre-trained on a large corpus of text to learn general language understanding.
Next, we loaded a dataset, pre-processed it, and split it into training and validation sets. We then tokenized the input sequences using the BERT tokenizer and created a PyTorch dataset. We trained a BERT model on this dataset and evaluated its performance on the validation set.
Finally, we saved the fine-tuned model and tested it on some sample input sequences to see how well it performs on new data.
In conclusion, fine-tuning pre-trained models like BERT using Hugging Face Transformers is a powerful technique for solving a wide range of NLP problems. With the availability of pre-trained models and libraries like Transformers, it has become easier and more efficient to implement state-of-the-art NLP models. The success of these techniques has led to significant advancements in various NLP applications such as sentiment analysis, text classification, and language translation.
To follow the video tutorial visit:
https://youtu.be/9he4XKqqzvE

Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI
Looking to catch up on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with AI Demos

Anvil.Works and ChatGPT API for AI Powered Web App Development

Pradip Nichite — Sun, 28 May 2023 16:32:46 GMT

In the world of web app development, Anvil Works and OpenAI's ChatGPT API stand out as powerful tools that developers can leverage to simplify their work process and deliver remarkable results. In this tutorial, we'll illustrate how these tools can be used in conjunction to develop an AI-powered product description generator.
Our main focus won't be on the product description generator per se, but on the integration and use of Anvil's intuitive web app builder and OpenAI's advanced language model, ChatGPT.
Anvil: A Paradigm Shift in Web App Development
Anvil is a free Python-based platform that allows you to build fully-featured web applications quickly and effortlessly. It has a drag-and-drop interface for designing the user interface and uses Python for both frontend and backend code.
So, why would you prefer Anvil over other popular frameworks like Streamlit? Here's why:
UI Design: Anvil provides a drag-and-drop designer, giving you total control over your app's look and feel.
Deployment: With just two clicks, your app is deployed to the cloud. You can even add your custom domain.
Authentication: Anvil comes with a built-in user authentication feature.
API Integration: Connecting to external APIs is straightforward with Anvil.
OpenAI's ChatGPT API: Powering Conversational AI
ChatGPT API by OpenAI is a state-of-the-art language model that is capable of generating human-like text. It can carry on a conversation, write essays, answer questions, and even perform certain tasks. In this tutorial, we'll be using this API in our Anvil server module to bring AI-powered capabilities to our web application.
Building the User Interface with Anvil
The user interface is a crucial part of any application. With Anvil, we use its drag-and-drop designer to create an intuitive interface for our application. Here is the UI code for our application:
from ._anvil_designer import Form1Templatefrom anvil import *import anvil.serverclass Form1(Form1Template): def __init__(self, **properties): # Set Form properties and Data Bindings. self.init_components(**properties) def button_1_click(self, **event_args): product_name = self.text_box_1.text notes = self.text_area_1.text input = f'The product name is {product_name} and the product notes are {notes}.' description = anvil.server.call('generate_description', input) self.rich_text_2.content = description def text_box_1_pressed_enter(self, **event_args): pass
In this code:
We create a Form class Form1 that inherits from Form1Template (auto-generated by Anvil). This class represents our form (or page) in the application.
The __init__ method initializes our form, setting up properties and data bindings.
The button_1_click method is triggered when a button is clicked. It retrieves the text from two input fields (text_box_1 and text_area_1), formats it into a string, and sends it to our server function 'generate_description'. The generated description is then displayed in our rich text box (rich_text_2).
The text_box_1_pressed_enter method is activated when the user presses Enter in the first text box. Although it doesn't perform any action in our current setup, it can be customized for various functionalities.
Implementing Server-side Operations with Anvil Server Module
Anvil's Server Modules provide a full server-side Python environment, letting you import any packages you need. We'll be using it to call OpenAI's ChatGPT API. Here is our server module code:
import anvil.serverimport openaiopenai.api_key = ""@anvil.server.callabledef generate_description(input): messages = [ {"role": "system", "content": """As a Product Description Generator, Generate multi paragraph rich text product description with emojis from the information provided to you' \n"""}, ] messages.append({"role": "user", "content": f"{input}"}) completion = openai.ChatCompletion.create( model="gpt-3.5-turbo", messages=messages ) reply = completion.choices[0].message.content return reply
In this code:
We import the required modules and set up the OpenAI API key.
We define a server-side function generate_description that is callable from the client-side.
The generate_description function crafts a conversation with the OpenAI API. It first creates a 'system' message that provides instruction for the GPT-3 model. The 'user' message, containing the product name and notes input by the user, is then appended to the messages list.
The function sends this conversation to the OpenAI API, fetches the response, and returns it to the client-side code.
Wrapping Up
Anvil and ChatGPT can be your go-to tools for developing powerful AI-powered web applications. With Anvil's easy-to-use web app builder and OpenAI's advanced language model, you can build sophisticated applications that solve real-world problems.
For a more interactive experience and a step-by-step guide on building this app, check out the video tutorial on YouTube. The tutorial also covers deploying your web application with a custom URL using Anvil.
https://youtu.be/KW5HqG-xiOU

In conclusion, Anvil and ChatGPT are game-changers in web app development. Harnessing their combined power can open up new possibilities and help you create impressive applications with relative ease. Happy coding!
Jupyter Notebook: https://github.com/PradipNichite/Youtube-Tutorials/blob/main/Anvil%20Work%20and%20ChatGPT%20API.ipynb
Also, want to learn about the state-of-the-art stuff in AI? Don't forget to subscribe to AI Demos. A place to learn about the latest and cutting-edge tools in AI!

Building an Interactive Chatbot with Langchain, ChatGPT, Pinecone, and Streamlit

Pradip Nichite — Sun, 21 May 2023 15:34:02 GMT

Introduction
In the era of digital communication, chatbots have emerged as a powerful tool for businesses, organizations, and users alike. From handling customer service inquiries to providing interactive experiences, these AI-powered platforms are transforming the way we communicate and access information.
In this blog post, we will be delving into the technicalities of creating a highly efficient chatbot that can answer queries from its own documents or knowledge base. This chatbot, leveraging the power of advanced language models, can also respond to follow-up questions from the users, ensuring a seamless and interactive user experience.
Setting Up the Environment
Before diving into the actual process of building our chatbot, we first need to set up our development environment with the necessary libraries and tools. This ensures our code can execute successfully and our chatbot can function as intended.
The requirements.txt file contains a list of libraries and tools required for this project. Here's what it includes:
streamlit: This library helps us to create interactive web apps for machine learning and data science projects.
streamlit_chat: This Streamlit component is used for creating the chatbot user interface.
langchain: This is a framework for developing applications powered by language models. It provides a standard interface for chains, lots of integrations with other tools, and end-to-end chains for common applications.
sentence_transformers: This library allows us to use transformer models like BERT, RoBERTa, etc., for generating semantic representations of text (i.e., embeddings), which we'll use for our document indexing.
openai: This is the official OpenAI library that allows us to use their language models, like GPT-3.5-turbo, for generating human-like text.
unstructured and unstructured[local-inference]: These are used for document processing and managing unstructured data.
pinecone-client: This is the client for Pinecone, a vector database service that enables us to perform similarity search on vector data.
To install all these libraries, you can run the following command in your terminal:
pip install -r requirements.txt
This command tells pip (Python's package installer) to install the libraries mentioned in the requirements.txt file.
With our environment set up, we can now proceed to the next step: indexing our documents in Pinecone.
Document Indexing
The next step in our journey to build the chatbot involves preparing and indexing the documents that our chatbot will utilize to answer queries. For this, we use the indexing.py script.
Loading documents from a directory with LangChain
The first step in the indexing.py script involves loading the documents from a directory. We use the DirectoryLoader class provided by LangChain to achieve this. This class accepts a directory as input and loads all the documents present in it.
from langchain.document_loaders import DirectoryLoaderdirectory = '/content/data'def load_docs(directory): loader = DirectoryLoader(directory) documents = loader.load() return documentsdocuments = load_docs(directory)len(documents)
Splitting documents
After loading the documents, the script proceeds to split these documents into smaller chunks. The size of the chunks and the overlap between these chunks can be defined by the user. This is done to ensure that the size of the documents is manageable and that no relevant information is missed out due to the splitting. The RecursiveCharacterTextSplitter class from LangChain is used for this purpose.
from langchain.text_splitter import RecursiveCharacterTextSplitterdef split_docs(documents,chunk_size=500,chunk_overlap=20): text_splitter = RecursiveCharacterTextSplitter(chunk_size=chunk_size, chunk_overlap=chunk_overlap) docs = text_splitter.split_documents(documents) return docsdocs = split_docs(documents)print(len(docs))
Creating embeddings
Once the documents are split, we need to convert these chunks of text into a format that our AI model can understand. This is done by creating embeddings of the text using SentenceTransformerEmbeddings class provided by LangChain.
from langchain.embeddings import SentenceTransformerEmbeddingsembeddings = SentenceTransformerEmbeddings(model_name="all-MiniLM-L6-v2")
Storing embeddings in Pinecone
After the embeddings are created, they need to be stored in a place from where they can be easily accessed and searched. Pinecone is a vector database service that is perfect for this task. The embeddings are stored in Pinecone using the Pinecone class from LangChain.
import pinecone from langchain.vectorstores import Pineconepinecone.init( api_key="", # find at app.pinecone.io environment="us-east-1-aws" # next to api key in console)index_name = "langchain-chatbot"index = Pinecone.from_documents(docs, embeddings, index_name=index_name)
The embeddings can then be accessed and searched using the similarity_search function provided by the Pinecone class.
def get_similiar_docs(query,k=1,score=False): if score: similar_docs = index.similarity_search_with_score(query,k=k) else: similar_docs = index.similarity_search(query,k=k) return similar_docs
This completes the process of document indexing, and we are now ready to move to the main application of our chatbot.
4. Building the Chatbot Application with Streamlit
With the indexed documents in place, the main part of our task is to build the chatbot application itself. We use Streamlit to create a seamless interactive interface for the chatbot. This involves constructing a user-friendly interface and ensuring the chatbot can process queries and provide responses. We accomplish this using the main.py file.
from langchain.chat_models import ChatOpenAIfrom langchain.chains import ConversationChainfrom langchain.chains.conversation.memory import ConversationBufferWindowMemoryfrom langchain.prompts import ( SystemMessagePromptTemplate, HumanMessagePromptTemplate, ChatPromptTemplate, MessagesPlaceholder)import streamlit as stfrom streamlit_chat import messagefrom utils import *st.subheader("Chatbot with Langchain, ChatGPT, Pinecone, and Streamlit")if 'responses' not in st.session_state: st.session_state['responses'] = ["How can I assist you?"]if 'requests' not in st.session_state: st.session_state['requests'] = []if 'buffer_memory' not in st.session_state: st.session_state.buffer_memory=ConversationBufferWindowMemory(k=3,return_messages=True)system_msg_template = SystemMessagePromptTemplate.from_template(template="""Answer the question as truthfully as possible using the provided context, and if the answer is not contained within the text below, say 'I don't know'""")human_msg_template = HumanMessagePromptTemplate.from_template(template="{input}")prompt_template = ChatPromptTemplate.from_messages([system_msg_template, MessagesPlaceholder(variable_name="history"), human_msg_template])
Session State Initialisation: Firstly, we initialise two lists 'responses' and 'requests' within Streamlit's session state. These lists store the history of bot responses and user requests respectively.
ConversationBufferWindowMemory: This memory structure is instantiated with a size of 3, meaning that our chatbot would remember the last three interactions, keeping a manageable memory size for efficiency.
PromptTemplate Construction: We construct a PromptTemplate for our chatbot. The template contains instructions to the language model (LLM), providing structure and context to the input for the LLM to generate a response. Langchain provides different types of MessagePromptTemplate, which includes AIMessagePromptTemplate, SystemMessagePromptTemplate, and HumanMessagePromptTemplate, for creating different types of messages.
Creating the User Interface
The Streamlit library allows us to quickly build a user-friendly interface for our chatbot application. The st.title function is used to display the chatbot's title at the top of the interface. The user's queries and the chatbot's responses are displayed in a conversation format using the st.container and st.text_input functions.
st.title("Langchain Chatbot")...response_container = st.container()textcontainer = st.container()...with textcontainer: query = st.text_input("Query: ", key="input") ...with response_container: if st.session_state['responses']: for i in range(len(st.session_state['responses'])): message(st.session_state['responses'][i],key=str(i)) if i < len(st.session_state['requests']): message(st.session_state["requests"][i], is_user=True,key=str(i)+ '_user')
Initializing the Language Model and Conversation
ChatOpenAI Initialisation: We then create an instance of ChatOpenAI, which utilizes the powerful gpt-3.5-turbo model from OpenAI for language understanding and generation.
ConversationChain Setup: Finally, we set up the ConversationChain using the memory, prompt template, and LLM we've prepared. The ConversationChain is essentially a workflow of how our chatbot would operate: it leverages user input, prompt template formatting, and the LLM to conduct an interactive chat.
llm = ChatOpenAI(model_name="gpt-3.5-turbo", openai_api_key="")...conversation = ConversationChain(memory=st.session_state.buffer_memory, prompt=prompt_template, llm=llm, verbose=True)
Generating Responses
When the user inputs a query, the chatbot uses the predict method to generate a response. The response is then displayed in the chat interface.
if query: with st.spinner("typing..."): ... response = conversation.predict(input=f"Context:\n {context} \n\n Query:\n{query}") st.session_state.requests.append(query) st.session_state.responses.append(response)
This creates a seamless interaction where the user can ask questions and get responses from the chatbot. We will delve into the refining of queries and finding matches in the next section.
Refining Queries and Finding Matches with Utility Functions
Once our chatbot is operational, we need to ensure that it can effectively process user queries and find relevant responses. This is achieved through a set of utility functions defined in utils.py. Here, we describe the purpose of these functions and their roles in the application.
Refining Queries with OpenAI
The query_refiner function is used to take the user's query and refine it to ensure it's optimal for providing a relevant answer. It uses OpenAI's DaVinci model to refine the query based on the current conversation log.
def query_refiner(conversation, query): response = openai.Completion.create( model="text-davinci-003", prompt=f"Given the following user query and conversation log, formulate a question that would be the most relevant to provide the user with an answer from a knowledge base.\n\nCONVERSATION LOG: \n{conversation}\n\nQuery: {query}\n\nRefined Query:", temperature=0.7, max_tokens=256, top_p=1, frequency_penalty=0, presence_penalty=0 ) return response['choices'][0]['text']
Finding Matches in Pinecone Index
The find_match function is used to find the most relevant documents that match the user's query. It uses the Pinecone vector index to find matches and returns the most relevant text.
def find_match(input): input_em = model.encode(input).tolist() result = index.query(input_em, top_k=2, includeMetadata=True) return result['matches'][0]['metadata']['text']+"\n"+result['matches'][1]['metadata']['text']
Tracking the Conversation
The get_conversation_string function is used to keep track of the ongoing conversation. It generates a string of the conversation log, including both the user's queries and the chatbot's responses.
def get_conversation_string(): conversation_string = "" for i in range(len(st.session_state['responses'])-1): conversation_string += "Human: "+st.session_state['requests'][i] + "\n" conversation_string += "Bot: "+ st.session_state['responses'][i+1] + "\n" return conversation_string
With these utility functions, the chatbot can not only generate responses but also refine the user's queries and find the most relevant answers. This ensures a more effective and user-friendly chatbot experience.
I hope this blog post has been informative and inspiring as you embark on your journey with chatbots and language models.
Full Code: https://github.com/PradipNichite/Youtube-Tutorials/tree/main/Langchain%20Chatbot

Summarizing Documents Made Easy With Langchain Summarizer

Subhadeep Choudhury — Sat, 20 May 2023 05:36:46 GMT

Introduction
In today's post, we will delve into the fascinating world of natural language processing and learn how we can harness LangChain's capabilities to create a document summarizer that is not just accurate but efficient too.
We will be exploring three different summarization techniques, each implemented using LangChain's unique chain types: stuff, map_reduce, and refine. This post will guide you through the process of using LangChain to summarize a list of documents, breaking down the steps involved in each technique.
Whether you are a seasoned developer or just starting with natural language processing, this post is the perfect starting point for anyone interested in exploring the world of document summarization with LangChain. So, let's get started and see how LangChain can help us create an effective document summarizer!
Langchain
LangChain, a cutting-edge framework, provides a seamless interface for creating advanced language model-based applications. It features data awareness and agentic behavior, enabling the model to interact with its environment and make dynamic decisions based on user input.
With LangChain's data-awareness feature, the language model can connect to external data sources, enabling it to analyze data from various sources. There are several main modules that LangChain provides support for like Models, Prompts, Indexes, etc and these modules can be used in a variety of ways for different use cases like Summarization, Utilization, Evaluation, etc.
Langchain Chain
Language models have been revolutionizing natural language processing, enabling computers to understand and even generate human-like language. While using a single language model (LLM) may suffice for simpler applications, the power of these models truly shines when they are combined in a chain. But how can we chain these models together seamlessly and effectively? That's where LangChain comes in.
LangChain provides a standard interface for chaining LLMs, allowing users to easily combine multiple models to achieve more complex tasks. Whether it's using multiple LLMs in sequence or integrating them with other expert systems, LangChain streamlines the process of building and utilizing these chains.
With chains, we can bring together various building blocks to form a powerful chain of actions. Just picture this - a chain that can take user input, format it using a PromptTemplate, and then seamlessly pass the formatted response to a language model. But it doesn't stop there. We can even create even more intricate chains by combining multiple chains or by integrating chains with other components. The possibilities are endless, and with chains, we have the key to unlocking the full potential of our applications.
Benefits of LangChain as a Summarizer Tool.
When it comes to summarizing large or multiple documents using natural language processing (NLP), the sheer volume of data can be overwhelming, which may lead to slower processing times and even memory issues. This means that we may need to invest in a high-performance computing infrastructure to handle large volumes of data. But with Langchain we can break large documents into smaller chunks and either process them simultaneously or serially based on the type of chain we use without dealing with the maximum token issue.
Another challenge can be combining information from multiple documents to create a summary because the documents may use different terminology, have conflicting information, or cover different aspects of the topic. This can be solved using langchain because it stores some information about its previous document in the current document and thus create a chain of documents that can be used to justify the importance of the context during summarization and also maintain proper order of sentences in the summarized content.
Document Summarizer App
Initialize OpenAi Key
import osos.environ["OPENAI_API_KEY"] = "Your openai key"
Summarization Chain Setup
from langchain import OpenAI, PromptTemplate, LLMChainfrom langchain.text_splitter import CharacterTextSplitterfrom langchain.chains.mapreduce import MapReduceChainfrom langchain.prompts import PromptTemplatellm = OpenAI(temperature=0)
Upload Document And Split It Into Chunks
text_splitter = CharacterTextSplitter()with open("../../state_of_the_union.txt") as f: state_of_the_union = f.read()docs = text_splitter.split_text(state_of_the_union)[:4]
We upload the document and then split the document into smaller chunks using the CharacterTextSplitter() method and then store the output which is a list inside the texts variable.
It is important to chunk the document because processing large documents as a single unit can be computationally expensive and time-consuming.
Output
Summarization With 'map_reduce' Chain
When it comes to document processing, breaking a large document into smaller, more manageable chunks is essential. But how do you combine those chunks into a comprehensive summary or answer? That's where LangChain's MapReduceDocumentsChain comes in.
This powerful tool uses an initial prompt on each chunk of data to generate a summary or answer based solely on that section of the document.
But that's not all - the MapReduceDocumentsChain takes things a step further by running a different prompt to combine all the initial outputs, creating a comprehensive and coherent summary or answer for the entire document. And with its implementation in LangChain, this method can handle even the largest and most complex documents with ease.
from langchain.chains.summarize import load_summarize_chainimport textwrapchain = load_summarize_chain(llm, chain_type="map_reduce", verbose = True)output_summary = chain.run(docs)wrapped_text = textwrap.fill(output_summary, width=100)print(wrapped_text)
We are creating an object of load_summarizer_chain where we are passing three arguments.
model - We pass the large language model of our choice which will query the user input
chain_type - We pass the type of langchain chain to use for summarization of docs
verbose - It is a boolean argument and if set to True it will show us all the intermediate steps between processing the user request and generating the output
Summarized Output
TCS has a purpose-driven approach to business and values that have enabled it to cope withindustry-wide supply side challenges. It has invested in its people, providing hospitalizationsupport and a massive pan-India vaccination drive, resulting in a strong employer brand. Theworkforce has grown to over half a million and is highly diverse, with employees logging 60.3million learning hours and acquiring 3.5 million digital competencies. TCS has also made progress inimproving gender diversity in the senior management ranks and has been working with communities toprovide health, STEM education, skills development, and digital divides. It has also made afinancial contribution to international humanitarian organizations and reduced its absolute carbonfootprint by 66% over base year 2016.
Pros
This can scale to larger documents (and more documents) than StuffDocumentsChain. The calls to the LLM on individual documents are independent and can therefore be parallelized.
Cons
Requires many more calls to the LLM than StuffDocumentsChain. Loses some information during the final combining call.
Summarization With 'stuff' Chain
Stuffing is the simplest method, whereby you simply stuff all the related data into the prompt as context to pass to the language model. This is implemented in LangChain as the StuffDocumentsChain.
The main downside of this method is that it only works one smaller piece of data. Once you are working with many pieces of data, this approach is no longer feasible. The next approach is designed to help deal with that.
Creating Custom Prompt Template
prompt_template = """Write a concise bullet point summary of the following:{text}CONSCISE SUMMARY IN BULLET POINTS:"""BULLET_POINT_PROMPT = PromptTemplate(template=prompt_template, input_variables=["text"])
Generating Summarized Output
chain = load_summarize_chain(llm, chain_type="stuff", prompt=BULLET_POINT_PROMPT)output_summary = chain.run(docs)wrapped_text = textwrap.fill(output_summary, width=100, break_long_words=False, replace_whitespace=False)print(wrapped_text)
Summarized Output
- Purpose-driven approach to business and values have shaped TCS' culture and work environment-Investing in people and giving them opportunities to realize their potential- Decentralizeddecision-making, empowering leaders on the front lines, and providing support- Treating theorganization as an extended family and standing by each member in their hour of need- Strongemployer brand validated by third-party assessments and accolades- Workforce crossed the half-million mark in the first half of the year- Highly diverse workforce with over 153 nationalitiesrepresented- Women in the workforce exceeding 200,000- Gender diversity in senior management ranksimproved- Organic talent development focus area- Lowest attrition in the industry- Community andplanet programs reaching 1.7 million beneficiaries- Net zero carbon footprint by 2030
Pros
Only makes a single call to the LLM. When generating text, the LLM has access to all the data at once.
Cons
Most LLMs have a context length, and for large documents (or many documents) this will not work as it will result in a prompt larger than the context length.
Summarization With 'refine' Chain
This method involves an initial prompt on the first chunk of data, generating some output. For the remaining documents, that output is passed in, along with the next document, asking the LLM to refine the output based on the new document.
chain = load_summarize_chain(llm, chain_type="refine")output_summary = chain.run(docs)wrapped_text = textwrap.fill(output_summary, width=100)print(wrapped_text)
Summarized Output
TCS has a purpose-driven approach to business and values that have shaped its culture and workenvironment. It invests in its people and provides them with opportunities to reach their fullpotential. During the pandemic, TCS provided hospitalization support and a massive pan-Indiavaccination drive. This has resulted in a strong employer brand and helped the company cope withindustry-wide supply side challenges. The workforce has grown to over half a million people, with153 nationalities represented and over 200,000 women. TCS has also been involved in community andplanet initiatives, helping over 1.7 million beneficiaries and making a financial contribution of 1million Euros to international humanitarian organizations. The company has also made progress inbecoming net zero by 2030.
Pros
Can pull in the more relevant context, and may be less lossy than MapReduceDocumentsChain.
Cons
Requires many more calls to the LLM than StuffDocumentsChain. The calls are also NOT independent, meaning they cannot be paralleled like MapReduceDocumentsChain. There are also some potential dependencies on the ordering of the documents.
Conclusion
In conclusion, Langchain makes it quite easy to create state-of-the-art AI applications s by integrating its custom methods and agents. We learned about creating a summarizer by leveraging the power of langchain chains and OpenAi embeddings. We also learned how langchain provides a better approach to summarizing large documents compared to conventional processes.
If you are more interested to learn about Langchains integration with SQL , then be sure to check out the below YouTube tutorial.
https://youtu.be/VG9KYCS0-8E

Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4.
https://youtu.be/cVA1RPsGQcw

To learn about more interesting and cool applications of LLMs look into our other Blogs and YouTube channel.
Also, want to learn about the state-of-the-art stuff in AI? Don't forget to subscribe to AI Demos. A place to learn about the latest and cutting-edge tools in AI!

Exploring Zero-Shot Learning and HuggingFace Transformers: A Comprehensive Guide

Naman Goel — Thu, 18 May 2023 10:18:11 GMT

I. Introduction
Zero-shot learning (ZSL) is a machine learning paradigm that enables models to classify new examples that are not included in the training data. In Zero-Shot Learning, the model uses previously learned information to recognize new classes that it has not encountered before. This makes ZSL a crucial technique for tasks that involve large-scale classification with limited labeled data.
Zero-Shot Learning allows for flexible and scalable classification, which is particularly useful when dealing with large datasets with a large number of classes. This technique is also useful when collecting labeled data is time-consuming or expensive, as it reduces the need for labeled data. Additionally, ZSL models can learn and generalize better, which can lead to better performance on downstream tasks.
HuggingFace Transformers is an open-source library that provides a wide range of state-of-the-art machine learning models for natural language processing (NLP) tasks, including text classification, named entity recognition, and machine translation. These models are based on transformer architecture, which allows them to capture long-term dependencies and perform exceptionally well on various NLP tasks. HuggingFace Transformers provides an easy-to-use interface for fine-tuning and deploying these models.
II. How Zero-Shot Learning Works
A. Definition of Zero-Shot Learning:
Zero-Shot Learning is a machine learning approach where models are trained to recognize new classes that have not been seen during the training phase. It is different from traditional supervised learning, where the model is trained to recognize classes that it has been trained on.
B. Techniques used in Zero-Shot Learning:
Natural Language Inference (NLI): In NLI, the model is trained to determine the logical relationship between two given sentences, which can be used to infer the correct class label for a new sentence. The model is trained to identify if a given hypothesis is true, false, or neutral given a given premise. This technique is often used in text-based ZSL tasks, where a new sentence is classified based on how well it matches the premise of each class label.
Few-Shot Learning: Few-shot learning is a method used to classify new classes with only a few labeled examples. In few-shot learning, the model is trained on a small number of examples from each new class to recognize and generalize to new examples from that class. This technique is often used in image-based ZSL tasks, where the model is trained on a few examples of a new class to recognize new examples from that class.
C. Applications of Zero-Shot Learning:
Zero-Shot Learning has several practical applications, including image classification, natural language processing, and speech recognition. ZSL is particularly useful in situations where labeled data is scarce, and where the model needs to generalize to new classes that are not present in the training data. ZSL can be used in a variety of domains, including e-commerce, healthcare, finance, and more.
III. Zero-Shot Classification using HuggingFace Transformers
In this section, we will explore how to perform zero-shot classification using HuggingFace Transformers, a powerful open-source library for natural language processing tasks. Zero-shot classification is a technique that allows a model to classify new inputs that it has not been trained on, making it useful in situations where labeled data is scarce or costly to obtain. We will explain what zero-shot classification is and the benefits of using HuggingFace Transformers for this task. We will also provide a step-by-step guide on how to use the library to perform zero-shot classification on a given input sequence for a set of candidate labels. Finally, we will demonstrate the effectiveness of zero-shot classification using code snippets that classify new input sequences with labels that are not present in the training data.
Here is the example code:
Installing the transformers library:
!pip install -q transformers
This command installs the transformers library using pip.
Importing the pipeline:
from transformers import pipeline
This line imports the pipeline module from the transformers library.
Creating the pipeline object:
classifier = pipeline("zero-shot-classification", model="facebook/bart-large-mnli")
This code creates a pipeline object with the task of zero-shot classification and the pre-trained BART model "facebook/bart-large-mnli".
Classifying a single-label input:
sequence_to_classify = "one day I will see the world"candidate_labels = ['travel', 'cooking', 'dancing']classifier(sequence_to_classify, candidate_labels)
This code classifies the input sequence "one day I will see the world" using the candidate labels 'travel', 'cooking', and 'dancing'. The output of the classifier is a dictionary that contains the predicted label and its associated score.
Output:
Classifying a multi-label input:
candidate_labels = ['travel', 'cooking', 'dancing', 'exploration']classifier(sequence_to_classify, candidate_labels, multi_label=True)
This code classifies the same input sequence as above using the candidate labels 'travel', 'cooking', 'dancing', and 'exploration'. Since this is a multi-label classification problem, the multi_label parameter is set to True, and the output of the classifier contains scores for each of the candidate labels.
Output:
Classifying a new input sequence:
sequence_to_classify = "Donald Trump will be next president"candidate_labels = ['science', 'politics', 'history']classifier(sequence_to_classify, candidate_labels)
This code classifies a new input sequence "Donald Trump will be next president" using the candidate labels 'science', 'politics', and 'history'. The output of the classifier is a dictionary that contains the predicted label and its associated score.
Output:
IV. Zero-Shot learning NER using HuggingFace Transformers
In this example, we will be using the Hugging Face Transformers library to perform zero-shot learning for NER using the GPT-2 model.
First, make sure you have the transformers library installed:
!pip install transformers
Then, let's load the GPT-2 model and tokenizer:
from transformers import pipeline, AutoModelForTokenClassification, AutoTokenizermodel = AutoModelForTokenClassification.from_pretrained("dbmdz/bert-large-cased-finetuned-conll03-english")tokenizer = AutoTokenizer.from_pretrained("bert-base-cased")nlp = pipeline("ner", model=model, tokenizer=tokenizer)
In the above code, we're loading the pre-trained BERT model fine-tuned on the CoNLL-2003 NER dataset for the English language and initializing the NER pipeline.
Now, let's define a sentence for which we want to perform zero-shot NER:
text = "J.K. Rowling wrote Harry Potter and the Philosopher's Stone in Edinburgh."
To perform zero-shot NER, we need to define a list of entity labels that we want the model to detect. In this example, we will detect person, book, and location entities. Here's how to do that:
labels = ["PERSON", "BOOK", "LOCATION"]
Now, we can use the NER pipeline to detect entities in the input sentence using zero-shot learning:
result = nlp(text, zero_shot=True, labels=labels)
In the above code, we're passing the input sentence, enabling zero-shot learning, and providing the list of entity labels we want the model to detect.
Finally, let's print the detected entities:
for entity in result: print(entity["word"], "-", entity["entity"])
The output will be:
J.K. Rowling - PERSONHarry Potter and the Philosopher's Stone - BOOKEdinburgh - LOCATION
That's it! You've successfully performed zero-shot learning for NER using Hugging Face Transformers. Keep in mind that zero-shot learning for NER may not be as accurate as fine-tuning the model on a specific dataset, but it can still be useful in certain scenarios where fine-tuning is not feasible or practical.
V. Conclusion
A. Summary of Zero-Shot Learning and its Importance:
In this blog post, we have explored the concept of Zero-Shot Learning and its importance in machine learning. We have seen that Zero-Shot Learning is a technique that allows a model to recognize new classes in that it has not been trained. This is useful in situations where labeled data is scarce, or where the model needs to generalize to new classes that are not present in the training data.
B. Summary of HuggingFace Transformers and their Benefits:
We have also explored the HuggingFace Transformers library and its benefits for natural language processing tasks. We have seen that the library provides a variety of pre-trained models and tools for various NLP tasks, including zero-shot classification. The library allows us to perform zero-shot classification on a given input sequence for a set of candidate labels, making it a powerful tool for natural language processing tasks.
C. Final Thoughts and Future Directions:
Overall, Zero-Shot Learning and HuggingFace Transformers are promising technologies that can be applied to a variety of domains, including e-commerce, healthcare, finance, and more. As the field of machine learning continues to grow and evolve, we can expect to see more advancements in zero-shot learning and natural language processing. With more data and more powerful models, we can further improve the accuracy and effectiveness of these technologies
To see a video tutorial on zero-shot learning using HuggingFace Transformers, check out this link:
https://youtu.be/PH_eb1udpew

The video provides an in-depth explanation of the concepts covered in this blog post and demonstrates how to implement zero-shot classification using HuggingFace Transformers.
Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI
Looking to catch up on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with aidemos.com

Langchain Memory with LLMs for Advanced Conversational AI and Chatbots

Namrata Kadasi — Tue, 16 May 2023 05:55:30 GMT

Introduction
In the rapidly advancing world of Artificial Intelligence, chatbots have emerged as indispensable tools that augment our day-to-day interactions. They provide a gamut of services, from answering customer queries to offering personalized assistance. However, traditional chatbots often lack the ability to recall the context of a conversation, leading to responses that can sometimes feel disjointed and impersonal. This is primarily because most chatbots are stateless, treating each user query as an isolated interaction without reference to previous exchanges.
To address this limitation and enhance the conversational experience, the concept of Langchain conversational memory has been introduced. This innovative solution enables chatbots to remember past interactions, use that context to generate more relevant responses, and create a more seamless, human-like dialogue.
In this blog post, we will delve into the world of Langchain memory, exploring its different types and their functionalities. We will also guide you on how to incorporate it with Streamlit and OpenAI's GPT API to build a smarter, more responsive chatbot. Whether you're an AI enthusiast, a seasoned developer, or a curious reader, this exploration of Langchain memory will offer valuable insights into the future of conversational AI. Stay tuned as we unravel the magic behind Langchain memory and its transformative role in the realm of chatbots.
Langchain
Langchain is a comprehensive framework designed for developing applications powered by language models. The creators of Langchain believe that the most impactful and differentiated applications will not only interact with a language model but will also be data-aware and agentic. This means that they will be capable of connecting a language model to other data sources and allowing the language model to interact with its environment.
What is Langchain Memory?
Langchain conversational memory is a module in the Langchain library that enables chatbots and other conversational agents to remember the context from past interaction history and use that information to generate relevant responses. It provides a unified interface for managing and accessing different types of memory including Conversation Buffer Memory, Conversation Summary Memory etc. It can store and retrieve information from previous conversations to learn and adapt to new information and use context to generate more accurate responses. It allows developers to incorporate memory into their conversational AI systems easily and can be used with different types of language models, including pre-trained models such as GPT-3, ChatGPT as well as custom models.
Types of Langchain Memory
This section delves into the various types of memory available in the Langchain library. We will provide an in-depth analysis of how each type works by using them with a ConversationChain and comparing their prompts. We will also evaluate their respective advantages and disadvantages.
Install and import the following libraries and modules which will be used moving forward into the blog
pip install langchainpip install openai
from langchain.llms import OpenAIfrom langchain.chains import ConversationChain
ConversationBufferMemory
The ConversationBufferMemory mechanism in the LangChain library is a simple and intuitive approach that involves storing every chat interaction directly in the buffer. This allows the LangChain Language Model (LLM) to easily recall the conversation history. However, although it provides the maximum amount of information to the LLM by storing everything, there are certain drawbacks to this approach. One of the downsides is that high usage of tokens leads to slower response times and higher computational costs. Additionally, due to the LLM token limit, it is not always possible to store everything as intended.
In Langchain, a chain is used to often break down tasks and is made up of links. Langchain provides ConversationChain which is specially created to have some concept of memory. When creating an instance of the ConversationChain class, three parameters must be provided: llm, which specifies the language model to be used for generating responses; memory, which determines the type of memory to be used to store conversation history; and verbose, which controls whether to print prompts and other information during the conversation.
After initializing the ConversationChain instance, we use the predict method to simulate a conversation based on the user input provided. Here, for example, I have taken three turns of conversation.
from langchain.memory import ConversationBufferMemoryconversation_with_memory = ConversationChain( llm=OpenAI(temperature=0,openai_api_key="YOUR_OPENAI_KEY"), memory=ConversationBufferMemory(), verbose=True)conversation_with_memory.predict(input="Hi, I am Sara")conversation_with_memory.predict(input="I am an AI enthusiast and love sharing my knowledge through blogs")conversation_with_memory.predict(input="I want you to suggest a good and professional name for my AI blog page based on my name")conversation_with_memory.predict(input="Can you give more options")
Given below is the sample output of the above-simulated conversation. We observe that the response generated from AI and user input for each turn is stored in memory and passed in the prompt as context for the upcoming query
> Entering new ConversationChain chain...Prompt after formatting:The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.Current conversation:Human: Hi, I am SaraAI: Hi Sara, my name is AI. It's nice to meet you. What can I do for you today?Human: I am an AI enthusiast and love sharing my knowledge through blogsAI: That's great to hear! It sounds like you have a passion for AI. What topics do you like to write about?Human: I want you to suggest a good and professional name for my AI blog page based on my nameAI: Hmm, that's a tough one. Let me think about it for a moment. How about "Sara's AI Insights"? That has a nice ring to it. What do you think?Human: Can you give more optionsAI:> Finished chain. Sure! How about "Sara\'s AI Adventures" or "Sara\'s AI Explorations"?
ConversationBufferWindowMemory
The ConversationBufferWindowMemory is similar to the ConversationBufferMemory except it stores only the recent specified number of interactions by maintaining a window memory and discards the rest.
from langchain.memory import ConversationBufferWindowMemoryconversation_with_memory = ConversationChain( llm=OpenAI(temperature=0,openai_api_key="YOUR_OPENAI_KEY"), memory=ConversationBufferMemory(k=2), verbose=True)conversation_with_memory.predict(input="Hi, I am Sara")conversation_with_memory.predict(input="I am an AI enthusiast and love sharing my knowledge through blogs")conversation_with_memory.predict(input="I want you to suggest a good and professional name for my AI blog page based on my name")conversation_with_memory.predict(input="Can you give more options")
Here, we specify the parameter k=2 indicating that we want the last two turns of the conversation to be stored in memory. We can see this effective change in memory compared to above in the sample prompt below
> Entering new ConversationChain chain...Prompt after formatting:The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.Current conversation:Human: I am an AI enthusiast and love sharing my knowledge through blogsAI: That's great to hear! It sounds like you have a passion for AI. What topics do you like to write about?Human: I want you to suggest a good and professional name for my AI blog page based on my nameAI: Hmm, that's a tough one. Let me think about it for a moment. How about "Sara's AI Insights"? That has a nice ring to it. What do you think?Human: Can you give more optionsAI:> Finished chain. Sure! How about "Sara\'s AI Adventures" or "Sara\'s AI Explorations"?
This approach however is not suited for retaining long-term memory but helps us limit the number of tokens being used.
ConversationTokenBufferMemory
The key difference in ConversationTokenBufferMemory is that it uses a token limit, which is determined by the number of words in the stored messages. This is different from a ConversationBufferWindowMemory, which discards interactions based on the number of turns.
from langchain.llms import OpenAIfrom langchain.memory import ConversationTokenBufferMemoryfrom langchain.chains import ConversationChainllm=OpenAI(temperature=0,openai_api_key="YOUR_OPENAI_KEY") conversation_with_memory = ConversationChain( llm=llm, memory=ConversationTokenBufferMemory(llm=llm,max_token_limit=60), verbose=True)conversation_with_memory.predict(input="Hi, I am Sara")conversation_with_memory.predict(input="I am an AI enthusiast and love sharing my knowledge through blogs")conversation_with_memory.predict(input="I want you to suggest a good and professional name for my AI blog page based on my name")conversation_with_memory.predict(input="Can you give more options")
The sample output is given below
> Entering new ConversationChain chain...Prompt after formatting:The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.Current conversation:Human: I want you to suggest a good and professional name for my AI blog page based on my nameAI: That's a great idea! Let me think about it for a moment. How about "The AI Adventures of [Your Name]"? That has a nice ring to it.Human: Can you give more optionsAI:> Finished chain. Sure! How about "AI Insights with [Your Name]"? Or "AI Explorations with [Your Name]"? Or "AI Journeys with [Your Name]"?
ConversationSummaryMemory
Practically, it's not possible to store all interactions in a long conversation as in ConversationBufferMemory as this will quite often lead to token limit error of even the most advanced LLMs available today. Also, this leads to huge surges in computational costs. Alternatively, if we use ConversationBufferWindowMemory or ConversationTokenBufferMemory, then there is a possibility that we fail to remember the original aim of the problem. We can address this situation by using ConversationSummaryMemory which summarizes interactions between the user and the AI building up a "running summary" of all past interactions. However, it should be noted that it is highly reliant on the summarization capability of the llm being passed as a parameter. Also, ConversationSummaryMemory might take up more tokens for shorter conversations but enables longer conversations with fewer tokens.
from langchain.llms import OpenAIfrom langchain.memory import ConversationSummaryMemoryfrom langchain.chains import ConversationChainllm=OpenAI(temperature=0,openai_api_key="YOUR_OPENAI_KEY") conversation_with_memory = ConversationChain( llm=llm, memory=ConversationSummaryMemory(llm=llm), verbose=True)conversation_with_memory.predict(input="Hi, I am Sara")conversation_with_memory.predict(input="I am an AI enthusiast and love sharing my knowledge through blogs")conversation_with_memory.predict(input="I want you to suggest a good and professional name for my AI blog page based on my name")conversation_with_memory.predict(input="Can you give more options")
The sample output is given below
> Entering new ConversationChain chain...Prompt after formatting:The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.Current conversation:The human introduces themselves as Sara and the AI introduces itself as AI. The AI then asks what it can do for Sara, to which Sara responds that she is an AI enthusiast and loves sharing her knowledge through blogs. The AI then expresses its appreciation for Sara's passion for AI and inquires about the topics she usually writes about in her blogs. Sara then requests the AI to suggest a good and professional name for her AI blog page based on her name, to which the AI agrees to help.Human: Can you give more optionsAI:> Finished chain. Sure, I can provide you with a few more options. How about "Sara\'s AI Adventures", "Sara\'s AI Insights", or "Sara\'s AI Explorations"?
ConversationSummaryBufferMemory
The ConversationSummaryBufferMemory combines both ideas of maintaining a buffer and summarizing the conversation. It stores the recent conversations in a buffer and instead of discarding the past turns, it summarizes these conversations and uses both. The token limit is used here to flush out conversations
from langchain.llms import OpenAIfrom langchain.memory import ConversationSummaryBufferMemoryfrom langchain.chains import ConversationChainllm=OpenAI(temperature=0,openai_api_key="YOUR_OPENAI_KEY") conversation_with_memory = ConversationChain( llm=llm, memory=ConversationSummaryBufferMemory(llm=llm,max_token_limit=60), verbose=True)conversation_with_memory.predict(input="Hi, I am Sara")conversation_with_memory.predict(input="I am an AI enthusiast and love sharing my knowledge through blogs")conversation_with_memory.predict(input="I want you to suggest a good and professional name for my AI blog page based on my name")conversation_with_memory.predict(input="Can you give more options")
The sample output is as follows
> Entering new ConversationChain chain...Prompt after formatting:The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.Current conversation:System: The human introduces themselves as Sara and the AI introduces itself as AI. The AI then asks what it can do for Sara, to which Sara responds that she is an AI enthusiast and loves sharing her knowledge through blogs. The AI then expresses its appreciation for Sara's passion for AI and inquires about the topics she likes to write about.Human: I want you to suggest a good and professional name for my AI blog page based on my nameAI: Sure, I'd be happy to help! What kind of name are you looking for? Do you want something that is related to your name, or something more abstract?Human: Can you give more optionsAI:> Finished chain. Absolutely! How about "Sara\'s AI Adventures" or "Sara\'s AI Insights"? Alternatively, you could go for something more abstract like "AI Explorations" or "AI Innovations".
Entity Memory
It is a type of memory designed to store information about specific entities. Here we pass the llm as a parameter that helps in extracting the entities and relevant information about them. As the conversation continues, it gradually accumulates its knowledge about these entities. Note that here we also use the ENTITY_MEMORY_CONVERSATION_TEMPLATE HERE because otherwise, we override the default prompt template for ConversationChain.
from langchain.llms import OpenAIfrom langchain.memory import ConversationEntityMemoryfrom langchain.chains import ConversationChainfrom langchain.memory.prompt import ENTITY_MEMORY_CONVERSATION_TEMPLATEllm = OpenAI(temperature=0,openai_api_key="YOUR_OPENAI_KEY")conversation_with_memory = ConversationChain( llm=llm, verbose=True, prompt=ENTITY_MEMORY_CONVERSATION_TEMPLATE, memory=ConversationEntityMemory(llm=llm))conversation_with_memory.predict(input="Sara and John work for the same company")conversation_with_memory.predict(input="However their departments differ. Sara works for the IT Department while John works for the Finance Department")conversation_with_memory.predict(input="Sara got a promotion and hence both of them are going out for celebration")conversation_with_memory.predict(input="What do you know about Sara and John")
The sample output is as follows
> Entering new ConversationChain chain...Prompt after formatting:You are an assistant to a human, powered by a large language model trained by OpenAI.You are designed to be able to assist with a wide range of tasks, from answering simple questions to providing in-depth explanations and discussions on a wide range of topics. As a language model, you are able to generate human-like text based on the input you receive, allowing you to engage in natural-sounding conversations and provide responses that are coherent and relevant to the topic at hand.You are constantly learning and improving, and your capabilities are constantly evolving. You are able to process and understand large amounts of text, and can use this knowledge to provide accurate and informative responses to a wide range of questions. You have access to some personalized information provided by the human in the Context section below. Additionally, you are able to generate your own text based on the input you receive, allowing you to engage in discussions and provide explanations and descriptions on a wide range of topics.Overall, you are a powerful tool that can help with a wide range of tasks and provide valuable insights and information on a wide range of topics. Whether the human needs help with a specific question or just wants to have a conversation about a particular topic, you are here to assist.Context:{'Sara': 'Sara works for the same company as John and recently got a promotion, which they are celebrating together.', 'John': 'John works for the same company as Sara and is celebrating her promotion.'}Current conversation:Human: Sara and John work for the same companyAI: That's great! It must be nice for them to have a colleague they can rely on.Human: However their departments differ. Sara works for the IT Department while John works for the Financ DepartmentAI: That's true. It's great that they can still work together despite their different departments. It shows the strength of the company's culture.Human: Sara got a promotion and hence both of them are going out for celebrationAI: That's wonderful! It's always nice to see colleagues celebrating each other's successes.Last line:Human: What do you know about Sara and JohnYou:> Finished chain. I know that Sara and John work for the same company and recently celebrated Sara's promotion. They work in different departments, with Sara in the IT Department and John in the Finance Department.
Building a smart bot
In an earlier tutorial, we learned how to build a chatbot using Streamlit and ChatGPT. If you haven't checked it out yet, you can find it by clicking here - https://blog.futuresmart.ai/building-a-gpt-4-chatbot-using-chatgpt-api-and-streamlit-chat
To enhance the capabilities of our bot and make it more comparable to human-like intelligence, we will incorporate a new feature by integrating memory storage using langchain. So, let's get coding.
Start by installing and importing the necessary libraries
from langchain.chat_models import ChatOpenAIfrom langchain.chains import ConversationChainfrom langchain.chains.conversation.memory import ConversationBufferWindowMemoryimport streamlit as stfrom streamlit_chat import message
langchain.chat_models: This is the module that contains the language model that can be used for generating responses. We'll be using OpenAI's GPT-3.5 turbo model for our chatbot.
langchain.chains: This is the module that contains different types of chains that can be used for building conversational agents. In this case, we'll be using a ConversationChain to keep track of the conversation history and generate responses.
langchain.chains.conversation.memory: This module contains different types of memory objects that can be used for storing conversation history. In this case, we'll be using ConversationBufferWindowMemory, which stores a fixed-length window of the conversation history.
streamlit: Streamlit is a Python framework for building interactive data science web applications, while Streamlit Chat is a Python package that provides a chatbot interface for Streamlit applications
Let's now give a cool name for our Chatbot using st.title() function and also create a text area for the user input query using st.text_input() function. We initialise the session state variables 'past' and 'generated'. st.session_state is used to store the queries and generated responses from the ChatGPT API. This way, the chat history can be preserved and displayed to the user in a chat-like interface, even if the user refreshes the page or navigates away from the app and comes back later.
st.title("Smartbot")query = st.text_input("Query: ", key="input")if 'responses' not in st.session_state: st.session_state['responses'] = []if 'requests' not in st.session_state: st.session_state['requests'] = []
Next, you need to set your Openai API key and initiate our LLM using the ChatOpenAI class with the GPT-3.5 Turbo model which is currently being used by ChatGPT
llm = ChatOpenAI(model_name="gpt-3.5-turbo", openai_api_key="YOUR_API_KEY")
Here we create a ConversationBufferWindowMemory object with k=3 and assigns it to the session state variable 'buffer_memory' if it is not already present. The ConversationBufferWindowMemory is a type of memory to store the history of a conversation in a limited-size buffer window. The "k" parameter specifies the maximum number of turns (i.e., back-and-forth exchanges between the user and the chatbot) that can be stored in the memory buffer window. Once the maximum number of turns is reached, the oldest turn will be removed from the memory to make room for the newest turn.
if 'buffer_memory' not in st.session_state: st.session_state.buffer_memory= ConversationBufferWindowMemory(k=3)
In the next step, we initialize a conversation chain by passing in the necessary parameters. The "llm" parameter specifies the language model to use for generating responses, which is a ChatOpenAI object in this case. The "verbose" parameter is set to True, which allows us to print the running conversation details for easy debugging.
However, the most crucial parameter is "memory". We pass the ConversationBufferWindowMemory object stored in the st.session_state.buffer_memory to the memory parameter of the ConversationChain class. This is because we want our chatbot to be able to store and retrieve previous conversations in its memory buffer
conversation = ConversationChain( llm=llm, memory=st.session_state.buffer_memory, verbose=True)
If there is input provided by the user, we use the chain.run method to get a response from the llm. Then we add the user's query and the generated response to the chat history, which is stored in the session state variables past and generated.
if query: response = conversation.run(query) st.session_state.requests.append(query) st.session_state.responses.append(response)
Now we move on to the final step of displaying the conversation. If there are any generated responses in the st.session_state object, a for loop is initiated. The for loop iterates through the generated list in order of the latest response to the earliest.
For each generated response, the message() function is called twice to display the query made by the user and the response generated by the llm. The key parameter is used to uniquely identify each message.
if st.session_state['responses']: for i in range(len(st.session_state['responses'])-1, -1, -1): message(st.session_state['requests'][i], is_user=True, key=str(i) + '_user') message(st.session_state["responses"][i], key=str(i))
That's it. We have successfully built a custom chatbot with conversational memory and can run the application using the following command.
streamlit run file_name.py
If everything has been set up correctly, you should now have a chatbot that resembles the one shown in the example.
If you are more interested to learn about Langchains integration with SQL , then be sure to check out the below YouTube tutorial.
https://youtu.be/VG9KYCS0-8E

Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4.
https://youtu.be/cVA1RPsGQcw

Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI
Looking to catch up on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with aidemos.com

PDF Processing: Open Source Tools for PDF to Image Conversion, Text and Table Extraction

Naman Goel — Sun, 14 May 2023 06:11:49 GMT

PDFs (Portable Document Format) are popular file formats for documents of all kinds, including contracts, manuals, reports, and more. One of the reasons for their popularity is that PDFs preserve the original formatting and layout of the document, making them easy to read and share across different devices and platforms.
If you work with PDFs regularly, you may have encountered challenges such as extracting text or images, working with tables or charts, or merging or splitting documents. Fortunately, there are several open-source tools available that can help streamline these tasks and improve your PDF workflow.
In this blog, we'll explore some of the most useful use cases for working with PDFs using open-source tools, including PDF-to-image conversion, text extraction, table and chart extraction, and merging or splitting documents. Whether you're a student, researcher, or professional, these tools can help you work more efficiently and effectively with PDFs.
PDF to Image Conversion
Converting PDF pages to images can be a useful task for a variety of reasons. One common use case is to extract images from PDFs. PDF documents often contain images or graphics that are important and useful, such as charts, diagrams, or illustrations. By converting the PDF pages to images, you can extract these images and use them for other purposes, such as adding them to a presentation or including them in a report.
Another reason to convert PDF pages to images is for sharing PDF content on social media platforms. Many social media platforms are image-based, meaning that they don't support PDF content natively. By converting PDF pages to images, you can share PDF content on social media platforms as images. For example, you could take a screenshot of a specific page in a PDF document and share it on Instagram or Twitter.
Converting PDF pages to images can also be useful when creating presentations. Many presentation tools, such as PowerPoint or Google Slides, support images but not PDFs. By converting PDF pages to images, you can easily add PDF content to your presentations. For example, you could convert a series of PDF pages to images and then import them into a presentation tool to create a slide deck.
In addition, converting PDF pages to images can be helpful when you need to view PDF content on devices that don't support PDFs. For example, some e-readers or mobile devices may not have built-in PDF readers, but they can display images. By converting PDF pages to images, you can view PDF content on these devices as images instead.
Overall, converting PDF pages to images is a useful task that can help you extract important content from PDFs, share PDF content on social media, create presentations, and view PDF content on devices that don't support PDFs. There are a variety of open-source tools available to perform this task, such as Poppler or ImageMagick, which can convert PDF pages to various image formats like JPEG, PNG, and TIFF.
Open Source Tools to convert PDF to Image:
There are several open-source tools available to convert PDFs to images, such as Poppler and ImageMagick. These tools can be used on the command line, but can also be accessed through Python libraries such as PyPDF2, pdf2image, and Wand. Here's a step-by-step guide to using PyPDF2 and pdf2image to convert PDFs to images:
Install the required libraries: To use PyPDF2 and pdf2image in Python, you need to install them first. You can install them using pip, the Python package installer. Open a terminal or command prompt and enter the following commands:
pip install PyPDF2 pip install pdf2image
Import the required modules: Once the libraries are installed, you need to import the required modules in your Python script. Here's an example of how to do this
import PyPDF2 from pdf2image import convert_from_path
Open the PDF file: Next, you need to open the PDF file that you want to convert. PyPDF2 provides a PdfFileReader class that can be used to read PDF files. Here's an example of how to open a PDF file:
pdf_file = open('document.pdf', 'rb') pdf_reader = PyPDF2.PdfReader(pdf_file)
In this example, "document.pdf" is the name of the PDF file you want to convert.
Convert PDF pages to images: Once you have opened the PDF file, you can use pdf2image to convert the pages of the PDF to images. pdf2image provides a convert_from_path function that takes the path to the PDF file as input and returns a list of PIL (Python Imaging Library) Image objects, one for each page of the PDF. Here's an example of how to use convert_from_path:
pages = convert_from_path('document.pdf')
In this example, "document.pdf" is the name of the PDF file you want to convert. The resulting list of "pages" contains one PIL Image object for each page of the PDF.
Note: If you're using Google Colab, you can install Poppler and add it to the system's PATH by running the following commands:
arduinoCopy code!apt-get install poppler-utils import os os.environ['PATH'] += ':/usr/bin/poppler-utils'
The first command installs Poppler on the system. The second command adds the path to the Poppler utilities to the system's PATH environment variable so that pdf2image can find them.
After running these commands, you can try running your Python code again to convert PDF pages to images using pdf2image. It should now be able to locate the pdfinfo command and convert the PDF pages to images.
Save the images: Finally, you can save the PIL Image objects as image files using the save method. Here's an example of how to save the images as JPEG files:
for i, page in enumerate(pages): page.save(f'page_{i}.jpg', 'JPEG')
In this example, the images are saved with filenames in the format "page_0.jpg", "page_1.jpg", and so on. The 'JPEG' parameter specifies the image format to use.
Input:
Output:
And that's it! With these steps, you can easily convert PDFs to images using open-source tools and Python.
Text Extraction from PDFs
Extracting text from PDF documents is a valuable use case for several reasons. One of the primary benefits is that it allows you to make PDF content searchable, which can be challenging with documents that only contain scanned images or non-searchable text. By extracting the text from PDF documents, it is possible to create searchable PDFs that make finding and locating specific information much easier.
In addition, extracting text from PDFs can be useful for analyzing PDF content. For example, if you need to analyze the content of a PDF document for text mining or sentiment analysis, extracting the text can be a critical first step. Once the text has been extracted, you can use a variety of tools and techniques to analyze and extract insights from it.
Finally, extracting text from PDFs can be useful for reusing PDF content in other formats. Instead of manually copying and pasting the text, you can extract it automatically and save it in the desired format, such as a Word document or a web page. This can save significant amounts of time and effort, especially for larger PDF documents.
Overall, there are many open-source tools available to extract text from PDFs accurately, even in complex PDF formats with embedded images, fonts, or tables. Extracting text from PDFs can be a crucial step in making PDF content searchable, analyzing PDF content, and reusing PDF content in other formats.
Open Source Tools for Text Extraction from PDFs:
Open-source libraries like PyPDF2, PDFMiner, and Textract can be used to extract text from PDFs using Python code. Here's how to use each of these libraries:
PyPDF2: PyPDF2 is a Python library that can be used to work with PDFs, including extracting text. Here's an example of how to use PyPDF2 to extract text from a PDF:
import PyPDF2 pdf_file = open('dsvds.pdf', 'rb') pdf_reader = PyPDF2.PdfReader(pdf_file) text = '' for page in range(len(pdf_reader.pages)): text += pdf_reader.pages[page].extract_text() print(text)
In this example, "document.pdf" is the name of the PDF file you want to extract text from. The pdf_reader object reads in the PDF file and the extractText() the method is used to extract the text from each page of the PDF. The extracted text is then concatenated into a single string and printed to the console.
For the same pdf file used above the output is as follows:
PDFMiner: PDFMiner is another Python library for working with PDFs, specifically for extracting text and layout information. Here's an example of how to use PDFMiner to extract text from a PDF:
import io from pdfminer.converter import TextConverter from pdfminer.pdfinterp import PDFPageInterpreter from pdfminer.pdfinterp import PDFResourceManager from pdfminer.pdfpage import PDFPage pdf_file = open('dsvds.pdf', 'rb') resource_manager = PDFResourceManager() output_stream = io.StringIO() laparams = None device = TextConverter(resource_manager, output_stream, laparams=laparams) interpreter = PDFPageInterpreter(resource_manager, device) for page in PDFPage.get_pages(pdf_file): interpreter.process_page(page) text = output_stream.getvalue() pdf_file.close() device.close() output_stream.close() print(text)
In this example, the PDF file is read and converted to a text stream using the TextConverter object. The text is then extracted from each page of the PDF using the PDFPageInterpreter object. The extracted text is stored in a string variable and printed to the console.
Textract: Textract is a Python library that can extract text and data from various file formats, including PDFs. Here's an example of how to use Textract to extract text from a PDF:
import textract text = textract.process('document.pdf', method='pdfminer') print(text)
In this example, the process() method from the extract module is used to extract text from the PDF file. The method parameter specifies that the PDFMiner engine should be used to extract the text. The extracted text is stored in a string variable and printed to the console.
Example:
Input:
Output:
b'Article\nParallel Structure Deep Neural Network Using CNN\nand RNN with an Attention Mechanism for Breast\nCancer Histology Image Classi\xef\xac\x81cation\n\nHongdou Yao, Xuejie Zhang , Xiaobing Zhou \nSchool of Information Science and Engineering, Yunnan University, Kunming 650091, China;\nyaohongdou@westlake.edu.cn (H.Y.); zhangxj.yn@gmail.com (X.Z.); shengyan0302@gmail.com (S.L.)\n Correspondence: zhouxb@ynu.edu.cn\n\nand Shengyan Liu\n\nReceived: 5 October 2019; Accepted: 26 November 2019; Published: 29 November 2019\n\nAbstract: In this paper, we present a new deep learning model to classify hematoxylin\xe2\x80\x93eosin-stained\nbreast biopsy images into four classes (normal tissues, benign lesions, in situ carcinomas, and invasive\ncarcinomas). Our model uses a parallel structure consist of a convolutional neural network (CNN)\nand a recurrent neural network (RNN) for image feature extraction, which is greatly different from\nthe common existed serial method of extracting image features by CNN and then inputting them\ninto RNN. Then, we introduce a.....
Overall, PyPDF2, PDFMiner, and Textract are all excellent open-source libraries for extracting text from PDFs using Python code. Each library has its strengths and weaknesses, so it's a good idea to try out each one to see which works best for your specific use case.
Extracting Tables from PDF
Extracting tables from PDF documents is a valuable use case for several reasons. One of the primary benefits is that it allows you to easily analyze tabular data that may be difficult to extract manually. For example, if you need to perform statistical or machine learning analysis on data contained in a PDF table, extracting the table can save significant amounts of time and effort.
In addition, extracting tables from PDFs can be useful for converting the tabular data to another format, such as Excel. Instead of manually copying and pasting the data, you can extract it automatically and save it in the desired format. This can be particularly useful if you need to use the data in another application or system.
Extracting tables from PDFs can also be useful for automating data entry tasks. If you receive PDFs with tabular data regularly, extracting the tables can be a useful first step in automating the data entry process. For example, you can use a script to extract the data and import it directly into a database or other system.
Finally, extracting tables from PDFs can help reduce the risk of manual errors. Manually copying and pasting data from PDF tables can be a time-consuming and error-prone process. By extracting the tables automatically, you can reduce the risk of manual errors and save time in the long run.
Open Source Tools to Extract Tables from PDF:
Open-source tools like Tabula and Camelot can be used to extract tables from PDFs using Python. Here's how to use each of these tools:
Camelot: Camelot is a Python library for extracting tables from PDFs. Here's an example of how to use Camelot to extract a table from a PDF file using Python:
Install Camelot by running the following command:
pip install camelot-py[cv] !apt-get install ghostscript python3-tk
Import the Camelot library and use the read_pdf() method to extract the table from the PDF file. For example:
pythonCopy codeimport camelot tables = camelot.read_pdf('document.pdf', pages='all') # Extract the first table on the first page table1 = tables[0].df # Extract the second table on the first page table2 = tables[1].df # Save the table as a CSV file table1.to_csv('table1.csv') table2.to_csv('table2.csv')
In this example, the read_pdf() method is used to extract all the tables from the PDF file. The tables are stored in a list of Table objects and the df the attribute is used to extract the table as a Pandas DataFrame. Finally, the table is saved as a CSV file.
Overall, Tabula and Camelot are both powerful open-source tools for extracting tables from PDFs using Python. Each tool has its strengths and weaknesses, so it's a good idea to try out both to see which works best for your specific use case.
In this blog, we have discussed the various use cases of open-source tools and libraries to work with PDF files in Python. We have explored three main use cases:
Converting PDFs to images: We discussed the benefits of converting PDFs to images and how it can be useful in various scenarios. We explored the use of open-source libraries like Poppler and ImageMagick to convert PDFs to various image formats like PNG, JPEG, and TIFF.
Extracting text from PDFs: We discussed the importance of extracting text from PDFs and how it can be useful for data analysis and text mining. We explored the use of open-source libraries like PyPDF2, PDFMiner, and Textract to extract text from PDFs.
Extracting tables from PDFs: We discussed the challenges involved in extracting tables from PDFs and how they can be useful for data analysis and visualization. We explored the use of open-source tools like Tabula and Camelot to extract tables from PDFs.
Example:
Input:
Output:
Overall, open-source tools and libraries provide a wide range of options for working with PDFs in Python. Whether you need to convert PDFs to images, extract text from PDFs, or extract tables from PDFs, there are open-source solutions available that can help you get the job done efficiently and effectively.
For similar tasks follow our tutorials:
How to Extract Text from PDFs and Images with Amazon Textract
Invoice Processing with OCR using Google Vision API and GPT-4
Invoice Processing with Azure OCR and GPT-4: An In Depth Step-by-Step Guide

Langchain's Pandas & CSV Agents: Revolutionizing Data Querying using OpenAI LLMs

Rupesh Gelal — Fri, 12 May 2023 13:16:37 GMT

In this blog, we will explore Langchain's Pandas Agent and CSV Agent, explaining how they work and their key features. We will also compare the agents to traditional query languages, provide real-world examples of how they have been used to improve the querying of tabular data and discuss potential future developments and applications. This blog is aimed at data scientists, developers, or business analysts who want to learn more about using Langchain's agents to query tabular data efficiently.
Introduction:
Langchain is a framework designed to develop applications powered by language models, with a focus on being data-aware and agentic. It is particularly useful for handling large datasets and providing efficient querying capabilities.
Features:
Large Language Models (LLMs): LangChain provides different types of models for natural language processing, including LLMs, which can be used to process unstructured text data and retrieve information based on user queries.
PromptTemplates: This feature enables developers to construct input prompts for their models using multiple components. In the case of queries, developers can use PromptTemplates to construct prompts for user queries, which can then be passed to LLMs for processing.
Indexes and Memory Components: LangChain's indexing feature structures documents so that LLMs can efficiently retrieve information based on user queries, while memory components enable applications such as chatbots to remember previous interactions and provide personalized responses to users, including for similar queries.
Chains: Chains in LangChain are a sequence of models that are linked together to accomplish a specific goal. An instance of a chain for a chatbot application could involve utilizing a language model to comprehend user inputs, a memory component to store past interactions, and a decision-making component to create relevant responses.
Agents: Agents in LangChain interact with user inputs and process them using different models. Agents determine which actions to take and in what order. For example, the CSV Agent can be used to load data from CSV files and perform queries, while the Pandas Agent can be used to load data from Pandas data frames and process user queries. Agents can be chained together to build more complex applications.
Pandas and CSV agents:
LangChain's Pandas Agent is a tool used to process large datasets by loading data from Pandas data frames and performing advanced querying operations. Its key features include the ability to group and aggregate data, filter data based on complex conditions, and join multiple data frames. This agent is ideal for developers who work with large datasets and require advanced querying capabilities.
The CSV Agent in LangChain is another tool used for querying structured data. It loads data from CSV files and supports basic querying operations like selecting and filtering columns, sorting data, and querying based on a single condition. It is an efficient option for developers who require a simple, yet powerful querying tool for structured data.
Both of these allow developers to create more complex and efficient applications that utilize different data sources and processing steps.
Pandas Agent:
!pip install langchainimport osos.environ["OPENAI_API_KEY"] = ""
The first line is a command to install a langchain using pip, which is a package installer for Python.
The second and third lines of code are Python statements that set an environment variable OPENAI_API_KEY to a string value that represents an OpenAI API key. The OpenAI API key is used for authenticating API requests to the OpenAI language models.
from langchain.agents import create_pandas_dataframe_agentfrom langchain.llms import OpenAIimport pandas as pd
This code imports modules from the langchain package and pandas package. It uses the create_pandas_dataframe_agent function from langchain to create a data agent that can be used to convert data between different formats. It also imports the OpenAI language model from langchain.llms. The pandas package is used to create a pandas DataFrame.
df = pd.read_csv('sales_data.csv')df[0:10]
Now, we read a CSV file named sales_data.csv and store the data in a pandas dataframe. Then, we'll print the first 10 rows of the dataframe.
I have only utilized a small amount of the actual sales_data available here.
pd_agent = create_pandas_dataframe_agent(OpenAI(temperature=0), df, verbose=True)
The above code creates a LangChain agent for processing natural language using OpenAI's language model with a temperature of 0, and then creates a Pandas DataFrame agent from the provided CSV file sales_data.csv.
pd_agent.run("Name of the city of first two sales?")> Entering new AgentExecutor chain...Thought: I need to find the city of the first two salesAction: python_repl_astAction Input: df.head(2)['CITY']Observation: 0 NYC1 ReimsName: CITY, dtype: objectThought: I now know the final answerFinal Answer: NYC and Reims> Finished chain.NYC and Reims
The pd_agent is a LangChain agent created using the create_pandas_dataframe_agent function, which takes a language model as input and a Pandas dataframe containing the data. When the run method is called on the pd_agent with the input text "Name of the city of first two sales?", it goes through a sequence of steps to generate an answer.
Initially, the agent identifies the task, then selects the appropriate action to retrieve the required information from the dataframe. After that, it observes the output and combines the observations, and generates the final answer.
pd_agent.run("Find the total sales for each product line in the year 2003")> Entering new AgentExecutor chain...Thought: I need to group the data by product line and yearAction: python_repl_astAction Input: df.groupby(['PRODUCTLINE', 'YEAR_ID'])['SALES'].sum()Observation: PRODUCTLINE YEAR_IDMotorcycles 2003 32131.60 2004 51084.53 2005 3940.23Name: SALES, dtype: float64Thought: I now know the total sales for each product line in the year 2003Final Answer: 32131.60> Finished chain.32131.60
Similarly, here agent identifies the task as finding the total sales for each product line in the year 2003. It then selects the appropriate action, which is to execute a Python REPL command to group the data by product line and year, and sum the sales for each group. It observes the output of the action, which is the total sales for each product line in the year 2003. Finally, the agent combines the observations to generate the final answer 32131.60 and returns it.
CSV Agent:
The CSV agent functions similarly to the Pandas DataFrame agent by utilizing the Python agent to execute code. However, the CSV agent specifically relies on the Pandas DataFrame agent for its interaction with CSV files.
from langchain.agents import create_csv_agentcsv_agent = create_csv_agent(OpenAI(temperature=0), 'sales_data.csv', verbose=True)
The code is importing the create_csv_agent function from the langchain.agents module, which is used to create an agent that can interact with a CSV file. The OpenAI object is passed as an argument to the function to provide the agent with access to the OpenAI language model. The CSV file named sales_data.csv is also passed as an argument to the function.
pd_agent.run("Sales value of two of items?")> Entering new AgentExecutor chain...Thought: I need to find the sales value of two itemsAction: python_repl_astAction Input: df.loc[[0,1], 'SALES'].sum()Observation: 5636.9Thought: I now know the final answerFinal Answer: 5636.9> Finished chain.5636.9
The agent receives a task to find the sales value of two items. It selects the appropriate action, which is to execute a Python REPL command to retrieve the sales values of the first two rows from the SALES column of the dataframe and sum them up. It observes the output of the action, which is the sales value of 5636.9. Finally, the agent combines the observations to generate the final answer of 5636.9 and returns it.
Comparison of pandas and CSV agents with traditional query languages
CSV and Pandas agents provide an alternative approach to querying data that differs from traditional query languages. Rather than writing code or SQL queries to extract and manipulate data, these agents allow users to pose natural language questions and receive answers in a conversational style.
This approach can be more intuitive for users who are not familiar with coding or SQL. Additionally, it can be more efficient for users who need to quickly retrieve information without spending time writing complex queries.
However, there are some limitations to using agents for data querying. The natural language approach can sometimes be imprecise, leading to inaccurate results. Additionally, agents may not be able to handle more complex queries that require multiple operations or logic.
For advanced users who require more control over data extraction and manipulation, traditional query language like SQL may be a better option. It provides a greater level of flexibility and customization, allowing users to write more complex queries and perform more advanced data analysis.
Future developments and applications
The future potential of pandas and CSV agents is extensive and diverse. As AI and NLP technologies continue to evolve, these agents can become even more robust tools for data analysis and manipulation.
One potential area for development is the integration of these agents with other APIs and technologies. For example, by leveraging APIs such as SERPAPI or web scraping tools, these agents could be used to collect data from a variety of sources and provide even more precise and insightful results.
Likewise, these agents can be used in chatbots and other conversational interfaces. By integrating a pandas or CSV agent into a chatbot, businesses could provide real-time insights and recommendations to their customers based on their data. For example, a bank could use a chatbot equipped with a pandas agent to help customers understand their spending patterns and make more informed decisions about their finances.
Also, these agents could be used in the development of automated data processing systems. By using a CSV agent to import data from various sources and a pandas agent to analyze and manipulate the data, businesses could streamline their data processing workflows. This could be particularly useful for businesses with large amounts of data that need to be analyzed and processed quickly and efficiently.
Conclusion
Hence, LangChain's Pandas Agent and CSV Agent are powerful tools that provide data scientists, and business analysts with efficient and advanced querying capabilities. The ability to chain different agents together to build more complex applications opens up a wide range of possibilities for data-driven applications. With further advancements in the LangChain framework, we can anticipate even more innovative and exciting developments in this field.
You can check out this video for more information on LangChain, SQL Agents & OpenAI LLMs: Query Database Using Natural Language.
https://www.youtube.com/watch?v=VG9KYCS0-8E

Also, AIDemos.com is an incredible resource for anyone looking to explore the potential of AI.

YouTube Q&A Chatbot with OpenAI Whisper, Embeddings, ChatGPT & Pinecone

Subhadeep Choudhury — Mon, 08 May 2023 06:17:09 GMT

Introduction
In today's world, chatbots have become an essential tool for businesses to interact with their customers. With the advancements in Natural Language Processing (NLP) technology, chatbots are becoming smarter and more efficient. OpenAI, a leading AI research organization, has developed a powerful suite of tools for NLP tasks, including text embeddings and transcription services through their Whisper API.
In this blog, we will explore OpenAi Whisper and its use cases and create a YouTube question-answering chatbot that is integrated with OpenAI's Whisper API for accurate and efficient transcription of audio and video content. We will also leverage OpenAI's text embedding model, ChatGPT, to understand user input and generate relevant responses. Finally, we will use a vector database to find semantically matching content for the given user request.
By the end of this blog, you will have a deep understanding of how to leverage OpenAI's tools to create a powerful chatbot that can answer your queries in a natural and seamless way. So, let's dive in and see how we can create an intelligent chatbot from scratch.
Unboxing OpenAi Whisper
OpenAI has recently unveiled its latest breakthrough in automatic speech recognition (ASR) technology - Whisper. This system is trained on a massive 680,000 hours of multilingual and multitask supervised data sourced from the web, making it a highly accurate and robust transcription tool.
One of the key benefits of Whisper is its ability to transcribe audio with 50% fewer errors than previous models. It is designed to be highly robust to accents, background noise, and technical language, making it an ideal tool for a variety of applications.
Whisper also offers impressive multilingual capabilities, enabling transcription in 99 different languages, as well as translation from those languages into English. It also supports full punctuation, making it even easier to use for various applications.
OpenAi Whisper Use Cases
OpenAIs Whisper API allows transcription service providers to transcribe audio and video content in multiple languages accurately and efficiently. The API's advanced machine-learning algorithms enable it to transcribe the audio in near real-time, ensuring faster turnaround times.
Another advantage of using the Whisper API is its support for multiple file formats, including MP3, WAV, and FLAC. This feature provides greater flexibility, allowing transcription service providers to work with a wide range of audio and video files.
So now the time has come to do some hands-on coding by exploring a few use-cases of Whisper.
Use Case 1: Subtitle Generation
Step 1
pip install openai-whisper
Install OpenAi Whisper library
Step 2
import whisper
Install the libraries
Step 3
model = whisper.load_model("base")result = model.transcribe("provide path of your audio file")print(result["text"])
Load the base model from the whisper library and store it in a variable. Then pass your audio file to the whisper model and it will generate subtitles for the audio using its transcribe method and finally print the transcribed text.
Use Case 2: Audio Language Translation
Whisper is able to perform voice translation from a vast set of languages to English.
Follow Steps 1 and 2 of Whisper Use Case 1
Step 3
model = whisper.load_model("base")result = model.transcribe("provide path of your audio file", task = 'translate')print(result["text"])
Load the 'base' model from the whisper library and store it in a variable. Then pass the audio path and task = 'translate' as parameters to the whisper transcribe method and store the returned object in a variable and finally print the translated audio text.
Let's Create A Youtube Question Answering Chatbot
With this app, we can interact with youtube videos in the form of a Q/A chatbot. First, we will generate subtitles of our desired youtube videos and then we will create the embeddings and we will store the embeddings in a vector database(In this case Pinecone). Finally, we will be using Streamlit to create the interface of our chatbot app.
Install The Requirements
openaipinecone-clientstreamlitstreamlit-chatpandasnumpypytubepython-dotenv
Fork the repository from GitHub - https://github.com/PradipNichite/FutureSmart-AI-Blog/tree/main/Youtube%20Q-A%20Chatbot
Open the folder in your local machine and execute the command in your terminal
pip install -r requirements
This will install all the libraries required to execute the project.
Create a .env file To Initialize API Keys
openai_key = "Enter your openai key"pinecone_key = "Provide your pinecone key"
Go to openai.com and get your API key.
Retrieving Content From Youtube Videos
Import The Libraries
# file name youtube_subtitle_gen.pyimport openaiimport tempfileimport numpy as npimport pandas as pdfrom pytube import YouTube, Searchimport os
Initialize a dictionary to store the data extracted from youtube videos using pytube and whisper libraries
# file name youtube_subtitle_gen.pyopenai.api_key = os.getenv("openai_key")video_dict = { "url": [], "title": [], "content": []}
Extracting Audio Content And Transcribing It Using Whisper
# file name youtube_subtitle_gen.pydef video_to_audio(video_URL): # Get the video video = YouTube(video_URL) video_dict["url"].append(video_URL) try: video_dict["title"].append(video.title) except: video_dict["title"].append("Title not found") # Convert video to Audio audio = video.streams.filter(only_audio=True).first() temp_dir = tempfile.mkdtemp() variable = np.random.randint(1111, 1111111) file_name = f'recording{variable}.mp3' temp_path = os.path.join(temp_dir, file_name) # audio_in = AudioSegment.from_file(uploaded_file.name, format="m4a") # with open(temp_path, "wb") as f: # f.write(uploaded_file.getvalue()) # Save to destination output = audio.download(output_path=temp_path) audio_file = open(output, "rb") textt = openai.Audio.translate("whisper-1", audio_file)["text"] return textt
video_to_audio: The video_to_audio method takes a youtube URL as an argument. Then we create an object for the Youtube class and name it a video and then we extract the attributes of the video like title, etc.
Next, we convert the video to audio using the below code
audio = video.streams.filter(only_audio=True).first()
and create a temporary directory to store the audio file.
Finally, we extract the subtitles from the audio using openai 'whisper-1" model and save it in a text variable and return the variable.
Saving The Data To A Dataframe
def create_dataframe(data): df = pd.DataFrame(data) df.to_csv("history.csv")s = Search("Youtube video title")print(len(s.results))for ele in s.results[0:5:1]: transcription = video_to_audio(ele.watch_url) print(transcription) print("\n\n\n") video_dict["content"].append(transcription)create_dataframe(video_dict)print("Created Dataframe")
create_dataframe: This method takes a dictionary as an argument and converts that dictionary into a pandas data frame and then stores it in a CSV file.
We search for youtube videos using the Search method. Then we loop through each of the videos and generate its subtitles and store them in a dictionary that we had initialized earlier and store all the data in a CSV file.
Transferring Youtube Content To Pinecone
Import The Libraries
import pineconeimport pandas as pdimport openaiimport os
Pinecone is a cloud-native vector database that allows us to build high-performance vector search applications. If you are interested to learn more about Pinecone and its use cases then refer to this link - https://docs.pinecone.io/docs/overview
Instantiate Pinecone Index
pinecone.init(api_key=os.getenv("pinecone_key"), environment="us-east-1-aws")pinecone.create_index( "demo-youtube-app", dimension=1536, metric="cosine", pod_type="p1")index = pinecone.Index("demo-youtube-app")print(index.describe_index_stats())
Get Embeddings From The Context
def get_embedding(text): response = openai.Embedding.create( input=text, model="text-embedding-ada-002" ) return response['data'][0]['embedding']
An embedding is a numerical vector representation of words or phrases in a high-dimensional space. It captures the context and meaning of words by analyzing large amounts of text data using machine learning algorithms.
If you want to know more about embeddings and their use cases then refer to our blog
Save The Embeddings To Pinecone Database
def addData(index,url, title,context): my_id = index.describe_index_stats()['total_vector_count'] chunkInfo = (str(my_id), get_embedding(context), {'video_url': url, 'title':title,'context':context}) index.upsert(vectors=[chunkInfo])
The arguments passed to the method are -
index: pinecone index where we will upsert our data.
url : Youtube video url
title: Youtube video title, we will pass this as metadata so that when we query our database, we can also look into the title and URL of the video to confirm the accuracy of the generated answer
context: Subtitles generated from the video
Finally, with the help of the upsert method, we append data to the vector database.
Answering User Queries From Stored Youtube Content
Import The Libraries
import streamlit as stimport openaifrom streamlit_chat import messageimport pineconeimport osimport pinecone_utils
Find Context Semantically Similar To User Query
def find_top_match(query, k): query_em = pinecone_utils.get_embedding(query) result = index.query(query_em, top_k=k, includeMetadata=True) return [result['matches'][i]['metadata']['video_url'] for i in range(k)], [result['matches'][i]['metadata']['title'] for i in range(k)], [ result['matches'][i]['metadata']['context'] for i in range(k)]
With this method, we are able to find the most related video based on the input query.
The method takes two parameters - query which is the user query and k which specifies the number of top results to return.
The variable query_em stores the vector representation of the query generated by the openai embedding model.
The index.query() is called to query the vector database. top_k parameter is used to specify the number of most relevant results to return and the includeMetadata parameter specifies whether to include any metadata associated with the search results.
Integrate ChatGPT To Provide Answers
def get_message_history(contexts): message_hist = [ {"role": "system", "content": """As a Bot, it's important to show empathy and understanding when answering questions.You are a smart AI who have to answer the question only from the provided context If you are unable to understand the question and need more clarity then your response should be 'Could you please be more specific?'. If you are unable to find the answer from the given context then your response should be 'Answer is not present in the provided video' \n"""}, {"role": "system", "content": contexts}, ] return message_histdef chat(user_query, message, role="user"): message_history.append({"role": role, "content": f"{var}"}) completion = openai.ChatCompletion.create( model="gpt-3.5-turbo", messages=message ) reply = completion.choices[0].message.content message_history.append({"role": "assistant", "content": f"{reply}"}) return reply
get_message_history sets up the initial conversation between the user and the AI tutor.
We pass the most similar context passed by the vector database to this method to ensure that answers are only generated from the provided context.
chat takes the message, user_query, and the role as input, makes an API call to chatGPT, and returns the generated response.
Process User Query And Generate Bot Response
# container for chat historyresponse_container = st.container()# container for text boxtextcontainer = st.container()with textcontainer: user_input = get_text() if st.session_state.past or user_input: urls, title, context = find_top_match(user_input, 1) message_history = get_message_history(context[0]) with st.spinner("Generating the answer..."): response = chat(user_input, message_history) st.session_state.past.append(user_input) st.session_state.generated.append(response) st.subheader("References") link_expander = st.expander("Context obtained from url") link_expander.write(urls)
First, the user asks a query and that query gets stored in the use_input variable.
Next, we pass the user query to the find_top_match() method which queries the vector database and provides the document with the highest semantic score.
Then we pass this document and query to chatGPT and its answers to our queries based on the context provided to it.
ChatBot In Action
Congratulations, you have made your AI chatbot with semantic search functionalities.
Now it is time to execute your chatbot
Execute the following command in your terminal
streamlit run app.py
Conclusion
In conclusion, building a YouTube Q/A chatbot with OpenAI Whisper, Pinecone vector database, ChatGPT, and OpenAI embeddings was an exciting project that showcases the potential of cutting-edge technologies in natural language processing. By integrating these tools and technologies, we were able to create a chatbot that accurately understands and responds to user queries in real time. Furthermore, with the addition of Streamlit as a frontend, we were able to create an engaging and interactive user interface that enhances the overall user experience. This project is just the beginning of what is possible with advanced NLP techniques and AI technologies, and we look forward to seeing how these innovations will continue to transform the world of conversational AI.
If you are more interested to learn about embeddings and their use cases, then be sure to check out the below YouTube tutorial.
https://www.youtube.com/watch?v=ld3YbhoJz9w&embeds_euri=https%3A%2F%2Fwww.notion.so%2F&feature=emb_logo

To know more about Vector Database and its applications, refer to the following video
https://www.youtube.com/watch?v=bWOvO_cxLHw

Also, check the video demonstrations of a chatbot integrated with ChatGPT
https://www.youtube.com/watch?v=RALmm6flXII&t=17s

To learn about more interesting and cool applications of LLMs look into our other Blogs and YouTube channel.
Also, want to learn about the state-of-the-art stuff in AI? Don't forget to subscribe to AI Demos. A place to learn about the latest and cutting-edge tools in AI!

Building Chatbot using Langchain and ChatGPT

rajashekar vt — Sun, 30 Apr 2023 05:05:16 GMT

LangChain is changing the game when it comes to building powerful conversational AI applications. Its advanced tools and features empower businesses to create innovative and effective solutions for a wide range of use cases. In this blog post, we'll explore some of the most exciting use cases for LangChain in building Chatbots. We'll also share best practices for developing successful conversational AI applications using LangChain's framework, providing developers with the guidance they need to take their projects to the next level. Join us as we dive into the world of LangChain and discover how this framework is revolutionizing the world of conversational AI.
Langchain
LangChain is an all-encompassing framework that enables developers to build robust applications using large language models (LLMs) and chat models. It provides a comprehensive suite of tools, interfaces, and components that simplify the process of developing end-to-end applications powered by cutting-edge AI technology. With LangChain, managing interactions with LLMs and chaining multiple components together has never been easier. Moreover, the framework offers seamless integration with various resources, including APIs and databases, making it a versatile solution for a broad range of use cases. Whether you're developing conversational AI applications for customer service, virtual assistants, or other use cases, LangChain provides a powerful set of features to help you create innovative and effective solutions.
Key Features:
Here are the six main areas where LangChain supports developers:
LLMs and Prompts: LangChain simplifies prompt management, optimizing them, and creating a universal interface for all LLMs. It also includes useful utilities to work with LLMs efficiently.
Chains: These are sequences of calls to LLMs or other utilities. LangChain offers a standard interface for chains, integrates with various tools, and provides end-to-end chains for popular applications.
Data Augmented Generation: LangChain enables chains to interact with external data sources to gather data for the generation step. This functionality can help with tasks such as summarizing long texts or answering questions using specific data sources.
Agents: LangChain's agents allow LLMs to make decisions about actions, take those actions, check the results, and continue until the job is complete. It provides a standard interface for agents, a variety of agents to choose from, and end-to-end agent examples.
Memory: LangChain's standard memory interface helps maintain state between chain or agent calls. It also offers a range of memory implementations and examples of chains or agents that use memory.
Evaluation: LangChain recognizes that traditional metrics can be inadequate for evaluating generative models. As a result, it provides prompts and chains that use LLMs to help developers assess their models effectively.
Powering ChatBots with Langchain
Chatbots have been around for a while now and have proven to be a powerful tool for businesses to enhance customer service, automate tasks, and increase efficiency. However, developing a chatbot can be a complex and challenging task, especially when it comes to creating a natural language interface. That's where LangChain comes in.
LangChain simplifies the process of creating chatbots powered by large language models (LLMs). This allows businesses to create chatbots that can understand and respond to user input in a natural and human-like way. LangChain also includes agents that enable LLMs to make decisions about actions, take those actions, check the results, and keep going until the job's done. This helps chatbots to function more efficiently and effectively, providing users with faster and more accurate responses.
Building a Language Model Application: Chat Models
Chat model APIs are still relatively new, and developers are continually exploring the best abstractions to optimize chat model performance. By utilizing LangChain, developers can easily manage interactions with chat models, integrate additional resources such as APIs and databases, and chain together multiple components to create end-to-end chatbot applications.
Heres an example of using chat models:
#install dependenciespip install langchainpip install openai
Using the Pip package manager to download and install the LangChain and OpenAI library and their dependencies on your local machine.
#Authenticate with OpenAIimport openaiimport osos.environ["OPENAI_API_KEY"]="YOUR API KEY"
Since LangChain uses OpenAI's models for its chat models, it's necessary to authenticate with OpenAI's API by providing an API key. The API key is a unique identifier that grants access to OpenAI's language models, enabling developers to use them to power their chatbots and other language-based applications.
from langchain.chat_models import ChatOpenAIfrom langchain.schema import ( AIMessage, HumanMessage, SystemMessage)
ChatOpenAI is a class within the chat_models module of LangChain. It provides an interface for working with OpenAI's language models to power chatbots. By using the ChatOpenAI class, developers can easily integrate OpenAI's language models into their chatbots and other conversational applications.
The AIMessage, HumanMessage, and SystemMessage classes are data models defined in the schema module of LangChain. These models represent different types of messages that may be exchanged in a chatbot conversation. AIMessage represents a message generated by the chatbot's AI, HumanMessage represents a message sent by the user, and SystemMessage represents a message sent by the chatbot to provide information or assistance.
chat = ChatOpenAI(temperature = 0)
Create an instance of the ChatOpenAI class from the LangChain library. By setting the temperature to 0, the chatbot will always generate the same response to a given user message (deterministic response). This may be useful in certain applications where consistency and predictability are important, such as customer support or technical assistance chatbots.
chat([HumanMessage(content="Generate a Product Description for Apple Iphone 15")])
This line of code is a function call to a chatbot. Specifically, it's asking the chatbot to generate a product description for the Apple iPhone 15. It generates a product description as follows:
AIMessage(content="The Apple iPhone 15 is the latest addition to the iconic iPhone series, designed to provide an unparalleled user experience. With a sleek and stylish design, this smartphone boasts a 6.7-inch OLED display that offers stunning visuals and vibrant colors. The device is powered by the latest A16 Bionic chip, which delivers lightning-fast performance and seamless multitasking.\n\nThe iPhone 15 features a triple-lens camera system that includes a 64-megapixel primary lens, a 12-megapixel ultra-wide lens, and a 12-megapixel telephoto lens. This camera setup allows you to capture stunning photos and videos with incredible detail and clarity. The device also features a 32-megapixel front-facing camera that takes stunning selfies and supports Face ID for secure authentication.\n\nThe iPhone 15 comes with a range of advanced features, including 5G connectivity, Wi-Fi 6, and Bluetooth 5.2. It also features a long-lasting battery that provides up to 12 hours of continuous use on a single charge. The device runs on the latest iOS 16 operating system, which offers a range of new features and improvements to enhance your user experience.\n\nOverall, the Apple iPhone 15 is a powerful and feature-packed smartphone that offers an unparalleled user experience. Whether you're a professional photographer, a gamer, or a casual user, this device is sure to meet all your needs and exceed your expectations.", additional_kwargs={})
You can provide additional information as a prompt, such as a constraint on words, vocabulary etc. Thus you can tailor it to your use case, however, you want.
Building a ChatGPT Clone with LangChain
Langchain is so amazing that it can be used to develop a ChatGPT clone for your specific use case. For example, if you want to provide better service to your customers when they visit your website, you can prompt them with an assistant that helps them find what they need. This task becomes simple with Langchain. You can use the following code to build ChatGPT clones with Langchain.
from langchain import OpenAI,ConversationChain,LLMChain, PromptTemplatefrom langchain.memory import ConversationBufferWindowMemory
OpenAI: This module provides an interface for integrating with OpenAI's GPT models. It enables you to create an instance of the OpenAI model, generate text with the model, and manage the parameters of the model, such as temperature, length penalty, and more.
ConversationChain: This module provides a class for creating conversation flows. It allows you to define a sequence of prompts and responses, along with any necessary context, and manage the flow of the conversation based on the user's inputs and the bot's responses.
LLMChain: This module provides a class for creating a custom language model based on a pre-trained model, such as GPT3. It enables you to fine-tune the model on your own dataset, so that it can generate responses that are specific to your use case.
PromptTemplate: This module provides a class for defining prompts and generating new prompts based on templates. It allows you to define placeholders in your prompts, such as {name} or {product}, and generate new prompts by filling in these placeholders with specific values.
ConversationBufferWindowMemory: This module provides a class for creating a memory buffer that stores the most recent messages in a conversation. It enables you to maintain context between messages in the conversation so that you can generate more coherent and relevant responses.
template="""Assistant is a large language model trained by OpenAI.Assistant is designed to be able to assist with a wide range of tasks, from answering simple questions to providing in-depth explanations and discussions on a wide range of topics. As a language model, Assistant is able to generate human-like text based on the input it receives, allowing it to engage in natural-sounding conversations and provide responses that are coherent and relevant to the topic at hand.Assistant is constantly learning and improving, and its capabilities are constantly evolving. It is able to process and understand large amounts of text, and can use this knowledge to provide accurate and informative responses to a wide range of questions. Additionally, Assistant is able to generate its own text based on the input it receives, allowing it to engage in discussions and provide explanations and descriptions on a wide range of topics.Overall, Assistant is a powerful tool that can help with a wide range of tasks and provide valuable insights and information on a wide range of topics. Whether you need help with a specific question or just want to have a conversation about a particular topic, Assistant is here to assist.{history}Human: {human_input}Assistant:"""prompt = PromptTemplate( input_variables = ["history","human_input"], template = template)
We create a template, which specifies the role of the model (system role) to act as an AI Assistant. we are providing two input variables history and human_input along with the template as a prompt. The {history} placeholder is intended to be replaced with a string containing the conversation history, and the {human_input} placeholder is intended to be replaced with a string containing the user's most recent input.
chatgpt_chain = LLMChain( llm = OpenAI(temperature=0), prompt = prompt, verbose = True, memory = ConversationBufferWindowMemory(k=2),)
We create a LLMChain instance that uses an OpenAI language model with a temperature of 0 to generate text. The LLMChain instance is configured to use the prompt template that we defined earlier and to print debugging information to the console. It also uses a ConversationBufferWindowMemory instance to store a fixed-size window of the conversation history, In our case we set it to 2.
output = chatgpt_chain.predict(human_input="I want you to act as an NLP Tutor. I will ask you questions regarding Natural Language Processing and you will answer like you explain it to a teenager. Be specific, Do not write explanations. My first question to you is what are stopwords? should we remove them when doing text generation tasks?")print(output)
Here we are specifying that we want the model to act as an NLP tutor, where we will be asking a NLP question/doubt and the model will briefly give us an answer. Take a look at the generated output.
"""> Entering new LLMChain chain...Prompt after formatting:Assistant is a large language model trained by OpenAI.Assistant is designed to be able to assist with a wide range of tasks, from answering simple questions to providing in-depth explanations and discussions on a wide range of topics. As a language model, Assistant is able to generate human-like text based on the input it receives, allowing it to engage in natural-sounding conversations and provide responses that are coherent and relevant to the topic at hand.Assistant is constantly learning and improving, and its capabilities are constantly evolving. It is able to process and understand large amounts of text, and can use this knowledge to provide accurate and informative responses to a wide range of questions. Additionally, Assistant is able to generate its own text based on the input it receives, allowing it to engage in discussions and provide explanations and descriptions on a wide range of topics.Overall, Assistant is a powerful tool that can help with a wide range of tasks and provide valuable insights and information on a wide range of topics. Whether you need help with a specific question or just want to have a conversation about a particular topic, Assistant is here to assist.Human: I want you to act as an NLP Tutor. I will ask you questions regarding Natural Language Processing and you will answer like you explain it to a teenager. Be specific, Do not write explanations.AI: Sure, I'd be happy to help! Natural Language Processing (NLP) is a field of computer science that focuses on understanding and processing human language. It involves using algorithms and techniques to analyze and interpret text, speech, and other forms of natural language. NLP can be used to build applications such as chatbots, automated customer service agents, and language translation systems.Human: I want you to act as an NLP Tutor. I will ask you questions regarding Natural Language Processing and you will answer like you explain it to a teenager. Be specific, Do not write explanations. My first question to you is what are stopwords? should we remove them when doing text generation tasks?Assistant:> Finished chain."""
As soon as we run the code we get this message where it creates a LLM chain where it has substituted the variables defined earlier with actual values. Immediately, it is followed by the output as follows:
"""Stopwords are words that are commonly used in a language but don't carry much meaning. Examples of stopwords include "the", "a", "an", and "of". When doing text generation tasks, it is usually best to remove stopwords as they don't add much value to the generated text."""
Now let's try asking something interesting:
output = chatgpt_chain.predict(human_input="Write regex to remove http: urls from text")
The output obtained is as follows:
"""The following regular expression can be used to remove http: URLs from text: \b(?:https?://)?\S+\b"""
So basically it returns a regex that we can use to perform a search using the re module in python. It only generates the required output without any further explanation because of our strict instructions.
Now since we are using ChatMemory this is how our LLM chain looks like:
"""> Entering new LLMChain chain...Prompt after formatting:Assistant is a large language model trained by OpenAI.Assistant is designed to be able to assist with a wide range of tasks, from answering simple questions to providing in-depth explanations and discussions on a wide range of topics. As a language model, Assistant is able to generate human-like text based on the input it receives, allowing it to engage in natural-sounding conversations and provide responses that are coherent and relevant to the topic at hand.Assistant is constantly learning and improving, and its capabilities are constantly evolving. It is able to process and understand large amounts of text, and can use this knowledge to provide accurate and informative responses to a wide range of questions. Additionally, Assistant is able to generate its own text based on the input it receives, allowing it to engage in discussions and provide explanations and descriptions on a wide range of topics.Overall, Assistant is a powerful tool that can help with a wide range of tasks and provide valuable insights and information on a wide range of topics. Whether you need help with a specific question or just want to have a conversation about a particular topic, Assistant is here to assist.Human: I want you to act as an NLP Tutor. I will ask you questions regarding Natural Language Processing and you will answer like you explain it to a teenager. Be specific, Do not write explanations. My first question to you is what are stopwords? should we remove them when doing text generation tasks?AI: Stopwords are words that are commonly used in a language but don't carry much meaning. Examples of stopwords include "the", "a", "an", and "of". It is generally recommended to remove stopwords when doing text generation tasks, as they don't add any value to the generated text.Human: Write regex to remove http: urls from textAI: The following regular expression can be used to remove http: URLs from text: \b(?:https?://)?\S+\b"""
We can see that along with the human input we just provided it also has a chat history that shows our previous conversations. It also can remember the context of our previous conversations, making it more similar to our ChatGPT, the use cases for this model are endless.
In this blog post, we have explored how Langchain can be used to create basic chat models for a particular use case, as well as how to build a ChatGPT clone that can be used for larger projects. Langchain is a powerful tool that offers endless possibilities with its generative AI capabilities, and it has never been easier to use. With its ability to process and understand large amounts of text, Langchain is changing the game in the world of AI technology. As AI technology continues to evolve, we can expect Langchain to take the world by storm in the coming years.
For more info regarding langchain refer to this video, and follow this channel to get latest updates in the field of NLP:
https://youtu.be/cVA1RPsGQcw

Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI
Looking to stay up to date on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with aidemos.com

Harnessing OpenAI Embeddings for Advanced NLP: Text Similarity, Semantic Search, and Clustering

Subhadeep Choudhury — Thu, 27 Apr 2023 08:38:07 GMT

Introduction
OpenAI has made waves online with its innovative embedding and transcription models, leading to breakthroughs in NLP and speech recognition. These models enhance accuracy, efficiency, and flexibility while speeding up transcription services. In this blog, we will explore the potential of OpenAI Embeddings for advanced NLP tasks, and in our next blog, we will dive into OpenAI's Whisper transcription models. We'll cover the basics of word embeddings, their advantages, and how OpenAI Embeddings outperform older models. We'll also examine OpenAI Embeddings' applications, such as text similarity, semantic search, and clustering. Join us as we delve into the transformative power of OpenAI Embeddings in the realm of NLP.
What Are Word Embeddings?
One of the key challenges in NLP is representing language in a way that can be easily processed by machine learning algorithms. This is where embeddings come in.
An embedding is a way of representing words or phrases as vectors in a high-dimensional space. By mapping words to vectors, we can capture the meaning of words and their relationships with other words in a way that can be easily processed by machine learning algorithms.
Benefits Of Using Embeddings
The benefit of using embeddings is that they allow us to capture the meaning of words in a way that is more nuanced than simply counting the number of times a word appears in a document. For example, consider the words "cat" and "dog". These words may appear in similar contexts and therefore have similar embeddings because they are both types of pets. By contrast, the words "cat" and "computer" will likely have very different embeddings because they are not semantically related.
Why OpenAi Embeddings Model is Better Than Previous Embedding Models
OpenAI's new text-embedding model, text-embedding-ada-002, outperforms all previous embedding models on text search, code search, and sentence similarity while achieving equivalent performance on text classification. By unifying the capabilities of five different models, the /embeddings endpoint's interface has been greatly simplified. The new model also boasts a longer context length, and smaller embedding size, and is 90% cheaper than previous models of similar size, making it a more efficient and cost-effective choice for natural language processing tasks.
OpenAi Embeddings Use Cases
OpenAi embedding models and whisper gives us the power to create a diverse set of NLP apps by providing functionalities like text similarity, semantic search, clustering, etc.
Use Case 1: Text Similarity
OpenAi Similarity Embeddings models are good at capturing semantic similarity between two or more pieces of text.
Step1
!pip install openai
Install OpenAi Library
Step 2
import pandas as pdimport openai, numpy as npfrom openai.embeddings_utils import get_embedding, cosine_similarity
Import the libraries
Step 3
api_key = 'Your API key here'openai.api_key = api_key
Initialize your API key. You can get your own API key from - https://platform.openai.com/account/api-keys
Step 4
texts = ["eating food", "I am hungry", "I am traveling" , "exploring new places"]resp = openai.Embedding.create( input= texts, engine="text-similarity-davinci-001")embedding_a = resp['data'][0]['embedding']embedding_b = resp['data'][1]['embedding']embedding_c = resp['data'][2]['embedding']embedding_d = resp['data'][3]['embedding']li = []for ele in resp['data']: li.append(ele["embedding"])## Finding text similarity percentagesfor i in range(len(texts) - 1): for j in range(i + 1, len(resp["data"])): print("text similarity percentage between",texts[i], "and", texts[j],"is ", np.dot(resp['data'][i]['embedding'],resp['data'][j]['embedding'])*100)
Initialize the text-similarity-davinci-001 model. Then we provide a list of text as input to it. Then we store the embeddings of the sentences in their respective variables. Finally, we find the percentage similarity between the sentences by vector dot product using the NumPy module
Use Case 2: Semantic Search
Semantic Search is a technology that understands the context and meaning behind the words used in a search query, making it possible to deliver more relevant and accurate results. Semantic Search algorithms use natural language processing (NLP) and machine learning techniques to analyze the search query and match it with the most relevant content available on the web.
Follow Steps 1, 2, and 3 of Use Case 1
Step 4
datafile_path = "https://cdn.openai.com/API/examples/data/fine_food_reviews_with_embeddings_1k.csv" # for your convenience, we precomputed the embeddingsdf = pd.read_csv(datafile_path)df.head()
We loaded the data frame using the pandas module and saved it in the variable df. Now let us observe the data frame
The data frame has 9 columns but our main focus will be on columns combined, babbage_similarity and babbage_search. The combined column is basically the concatenation of the Summary column which contains the title of the review and the Text column which contains the content of the review. babbage_similarity will be used when we are interested in finding the similarity between two sentences and babbage_search will be used when we are interested in searching a document.
Step 5
df["babbage_search"] = df.babbage_search.apply(eval).apply(np.array)df["babbage_similarity"] = df.babbage_similarity.apply(eval).apply(np.array)
Convert the contents of columns - babbage_search and babbage_similarity from string to a NumPy ndarray.
Step 6
# search through the reviews for a specific productdef search_reviews(df, search_query, n=3): embedding = get_embedding( search_query, engine="text-search-babbage-query-001" ) df["similarities"] = df.babbage_search.apply(lambda x: cosine_similarity(x, embedding)) top_n = df.sort_values("similarities", ascending=False).head(n) # res = top_n.combined.str.replace("Title: ", "").str.replace("; Content:", ": ") return top_nres = search_reviews(df, "delicious beans", n=3)res['combined'].to_list()
We have created our custom search_reviews method which will take the user query and search it in the reviews and return the top 3 similar reviews based on the query provided. Our method takes three arguments- data frame, user query, and several reviews to be returned.
Next, we initialize the semantic search embedding model inside the method. Then we create a new column in our data frame called similarities where we store the similarity score between each review and the user query.
Next, we sort our data frame based on the similarities column in descending order and return the top 3 reviews with the highest similarity score.
Use Case 3: Clustering
The clustering process involves creating clusters, which are groups of documents that are more similar to each other than to documents in other clusters. These clusters can be used to identify themes, topics, or patterns within a dataset, and can be useful for various NLP applications, such as text classification, sentiment analysis, and recommendation systems.
Follow Steps 1, 2, and 3 of Use Case 1
Step 4
# source: https://stackoverflow.com/questions/55619176/how-to-cluster-similar-sentences-using-bertfrom sklearn.cluster import KMeans# Corpus with example sentencescorpus = ['A man is eating food.', 'A man is eating a piece of bread.', 'Horse is eating grass.', 'A man is eating pasta.', 'A Woman is eating Biryani.', 'The girl is carrying a baby.', 'The baby is carried by the woman', 'A man is riding a horse.', 'A man is riding a white horse on an enclosed ground.', 'A monkey is playing drums.', 'Someone in a gorilla costume is playing a set of drums.', 'A cheetah is running behind its prey.', 'A cheetah chases prey on across a field.', 'The cheetah is chasing a man who is riding the horse.', 'man and women with their baby are watching cheetah in zoo' ]
We import the KMeans clustering algorithm from sklearn module and then we create our own data corpus on which we will be performing clustering.
Step 5
response = openai.Embedding.create( input=corpus, model="text-similarity-babbage-001")corpus_embeddings = [ d['embedding'] for d in response['data']]# Normalize the embeddings to unit lengthcorpus_embeddings = corpus_embeddings / np.linalg.norm(corpus_embeddings, axis=1, keepdims=True)
In this step, we create embeddings for our data corpus using openai embedding model - "text-similarity-babbage-001".
Next, we create a list of corpus_embeddings where we store all the embeddings.
Finally, we Normalized the embeddings to unit length. We need to normalize the vectors in order to ensure that all dimensions are treated equally.
Step 6
clustering_model = KMeans(n_clusters=3)clustering_model.fit(corpus_embeddings)cluster_assignment = clustering_model.labels_print(cluster_assignment)clustered_sentences = {}for sentence_id, cluster_id in enumerate(cluster_assignment): if cluster_id not in clustered_sentences: clustered_sentences[cluster_id] = [] clustered_sentences[cluster_id].append(corpus[sentence_id])clustered_sentences
In the final step, we create an instance of KMeans and we pass n_clusters = 3 as we want to obtain three clusters. After this, we fit the model with the corpus of embeddings which we obtained in the previous step. Then we initialize a dictionary and pass cluster id as the key and clustered sentences as the value of the keys of the dictionary.
Sample Output
Conclusion
In this blog, we have learned about the powerful abilities of OpenAi and how we can leverage those abilities to build state-of-the-art applications. We started with embeddings and learned about its various use cases. In the next blog, we will make an app by leveraging the knowledge of this blog.
If you are more interested to learn about embeddings and their use cases, then be sure to check out the below YouTube tutorial.
https://www.youtube.com/watch?v=ld3YbhoJz9w&embeds_euri=https%3A%2F%2Fwww.notion.so%2F&feature=emb_logo

To learn about more interesting and cool applications of LLMs look into our other Blogs and YouTube channel.
Also, want to learn about the state-of-the-art stuff in AI? Don't forget to subscribe to AI Demos. A place to learn about the latest and cutting-edge tools in AI!

LangChain, SQL Agents & OpenAI LLMs: Query Database Using Natural Language

Pradip Nichite — Mon, 24 Apr 2023 13:06:56 GMT

Introduction
Natural language querying allows users to interact with databases more intuitively and efficiently. By leveraging the power of LangChain, SQL Agents, and OpenAI's Large Language Models (LLMs) like ChatGPT, we can create applications that enable users to query databases using natural language. In this blog post, we'll discuss the key features of these technologies and provide a step-by-step guide on how to implement them for natural language database querying. We have also created an accompanying YouTube video tutorial for those who prefer a visual demonstration.
LangChain:
A Framework for Language Model-Powered Applications LangChain is a framework designed for building applications powered by language models. It provides a standard interface for chains, integrates with various tools, and offers end-to-end chains for common applications. The two main features of LangChain are data-awareness and agentic behavior.
Data-awareness enables the language model to connect to other sources of data, while agentic behavior allows the model to interact with its environment. By using agents, LangChain can dynamically decide which tools to call based on user input. This makes agents extremely powerful when used correctly.
Tools, Agents, and Toolkits in LangChain
A. Tools are functions that perform specific duties, such as Google Search, database lookups, or Python REPL. They take a string as input and return a string as output.
B. Agents are responsible for determining which actions to take and in what order. They can use tools, observe the output, or return to the user.
C. Toolkits are collections of tools that can be utilized by agents. LangChain supports various toolkits to help developers create powerful applications.
Agents
V. SQL Database Agent The SQL Database Agent is designed to interact with SQL databases, allowing users to ask questions in natural language and receive answers. Here's how to implement it:
pip install langchain openai pymysql --upgrade -q
Import the necessary libraries:
from langchain.agents import load_toolsfrom langchain.agents import initialize_agentfrom langchain.agents import AgentTypefrom langchain.llms import OpenAI
os.environ['OPENAI_API_KEY'] = "your_openai_api_key"os.environ["SERPAPI_API_KEY"] = "your_serpapi_api_key"
llm = OpenAI(temperature=0)tools = load_tools(["serpapi", "llm-math"], llm=llm)agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION)
agent.run("Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?")
Output:
Leo DiCaprio's girlfriend is Eden Polani and her current age raised to the 0.43 power is 3.547023357958959.
agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)agent.run("Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?")
Output:
Entering new AgentExecutor chain... I need to find out who Leo DiCaprio's girlfriend is and then calculate her age raised to the 0.43 power.Action: SearchAction Input: "Leo DiCaprio girlfriend"Observation: After rumours of a romance with Gigi Hadid, the Oscar winner has seemingly moved on. First being linked to the television personality in September 2022, it appears as if his "age bracket" has moved up. This follows his rumoured relationship with mere 19-year-old Eden Polani.Thought: I need to find out Eden Polani's ageAction: SearchAction Input: "Eden Polani age"Observation: 19 Years OldThought: I need to calculate her age raised to the 0.43 powerAction: CalculatorAction Input: 19^0.43Observation: Answer: 3.547023357958959Thought: I now know the final answerFinal Answer: Leo DiCaprio's girlfriend is Eden Polani and her current age raised to the 0.43 power is 3.547023357958959.> Finished chain.Leo DiCaprio's girlfriend is Eden Polani and her current age raised to the 0.43 power is 3.547023357958959.
SQL Database Agent
Database Schema and Resources
The examples in this blog post and the accompanying video tutorial use a specific database schema for demonstration purposes. This schema includes tables related to orders, products, customers, etc. To help you follow along with the examples and explore the database schema on your own, we've provided a link to the schema and resources. Access the database schema here.
The SQL Database Agent is designed to interact with SQL databases, allowing users to ask questions in natural language and receive answers. Here's how to implement it:
Import the necessary libraries
import osfrom langchain.agents import *from langchain.llms import OpenAIfrom langchain.sql_database import SQLDatabase
Connect to your database:
pythonCopy codedb_user = "db_user"db_password = "db_password"db_host = "db_host"db_name = "db_name"db = SQLDatabase.from_uri(f"mysql+pymysql://{db_user}:{db_password}@{db_host}/{db_name}")
Set up the LLM, toolkit, and agent executor:
pythonCopy codefrom langchain.chat_models import ChatOpenAIllm = ChatOpenAI(model_name="gpt-3.5-turbo")toolkit = SQLDatabaseToolkit(db=db)agent_executor = create_sql_agent( llm=llm, toolkit=toolkit, verbose=True)
Query the database with natural language:
a. Describing a table and its relationships:
agent_executor.run("Describe the Order related table and how they are related")
b. Recovering from an error:
agent_executor.run("Describe the PurchaseDetails table")
c. Finding the top 5 products with the highest total sales revenue:
agent_executor.run("Find the top 5 products with the highest total sales revenue")
Output:
> Entering new AgentExecutor chain...Action: list_tables_sql_dbAction Input: ""Observation: customers, employees, payments, products, productlines, orderdetails, offices, ordersThought:I should query the 'products' and 'orderdetails' tables to get the total sales revenue for each productAction: query_checker_sql_dbAction Input: SELECT products.productName, SUM(orderdetails.quantityOrdered * orderdetails.priceEach) AS totalRevenue FROM products INNER JOIN orderdetails ON products.productCode = orderdetails.productCode GROUP BY products.productName ORDER BY totalRevenue DESC LIMIT 5;Observation: SELECT products.productName, SUM(orderdetails.quantityOrdered * orderdetails.priceEach) AS totalRevenue FROM products INNER JOIN orderdetails ON products.productCode = orderdetails.productCode GROUP BY products.productName ORDER BY totalRevenue DESC LIMIT 5;Thought:The query looks correct, I should execute it to get the top 5 products with the highest total sales revenue.Action: query_sql_dbAction Input: SELECT products.productName, SUM(orderdetails.quantityOrdered * orderdetails.priceEach) AS totalRevenue FROM products INNER JOIN orderdetails ON products.productCode = orderdetails.productCode GROUP BY products.productName ORDER BY totalRevenue DESC LIMIT 5;Observation: [('1992 Ferrari 360 Spider red', Decimal('276839.98')), ('2001 Ferrari Enzo', Decimal('190755.86')), ('1952 Alpine Renault 1300', Decimal('190017.96')), ('2003 Harley-Davidson Eagle Drag Bike', Decimal('170686.00')), ('1968 Ford Mustang', Decimal('161531.48'))]Thought:I can see that the top 5 products with the highest total sales revenue are: 1992 Ferrari 360 Spider red, 2001 Ferrari Enzo, 1952 Alpine Renault 1300, 2003 Harley-Davidson Eagle Drag Bike, and 1968 Ford Mustang.Final Answer: The top 5 products with the highest total sales revenue are 1992 Ferrari 360 Spider red, 2001 Ferrari Enzo, 1952 Alpine Renault 1300, 2003 Harley-Davidson Eagle Drag Bike, and 1968 Ford Mustang.> Finished chain.The top 5 products with the highest total sales revenue are 1992 Ferrari 360 Spider red, 2001 Ferrari Enzo, 1952 Alpine Renault 1300, 2003 Harley-Davidson Eagle Drag Bike, and 1968 Ford Mustang.
d. Listing the top 3 countries with the highest number of orders:
agent_executor.run("List the top 3 countries with the highest number of orders")
Conclusion
In this blog post, we explored the capabilities of LangChain, SQL Agents, and OpenAI LLMs for natural language database querying. By combining these technologies, we can create powerful applications that allow users to interact with databases using natural language, making data retrieval more efficient and intuitive. As these technologies continue to develop, we can expect even more advanced and versatile natural language database querying applications in the future.
Remember to check out the accompanying YouTube video tutorial for a visual demonstration of the implementation process, and don't forget to explore the database schema used in the examples.
https://youtu.be/VG9KYCS0-8E

Now that you understand these technologies and their applications, it's time to start experimenting and creating your natural language database querying solutions!
Colab Notebook (Full Code): https://github.com/PradipNichite/Youtube-Tutorials/blob/main/Langchain_Agents_SQL_Database_Agent.ipynb
Read more in-depth articles:
For more detailed information and insights on AI-related topics, don't forget to visit the FutureSmart AI Blog. Our blog features in-depth articles that cover various aspects of AI, machine learning, and natural language processing.
Check out AI Demos:
If you're interested in exploring more AI tools and their applications, head over to AIDemos.com. AIDemos is a directory of video demos showcasing the latest AI tools and technologies. Our goal is to educate and inform users about the possibilities of AI and help them stay updated on the latest advancements.

Extracting Data from Charts and Graphs: The OCR Challenge Solution

shubham mehla — Wed, 19 Apr 2023 12:54:37 GMT

Introduction
In today's data-driven world, extracting insights from complex data is essential. Graphs help represent this data, but analyzing them can be challenging, especially for non-experts. The OCR Challenge aimed to develop machine learning solutions for extracting data from graph images and PDFs, making graph analysis more accessible and efficient.
In this blog post, we will cover various data extraction methods, including machine learning models, contour detection, and rule-based methods. We will explore the use of machine learning for extracting data from graph images, examine the role of contour detection and rule-based techniques in data extraction, provide a step-by-step guide to our winning solution, and showcase the effectiveness of our solution with various graph examples.
Join us as we delve into the OCR Challenge and the transformative potential of these data extraction solutions.
Approach
Several different approaches can be used for graph data extraction from images, each with its own advantages and drawbacks. Some common approaches include using machine learning models, contour detection and rule-based methods.
Machine learning models, such as LayoutLMv2, can effectively classify text in graph images and provide insights into the data. However, these models require pre-training and fine-tuning, and may not be effective for all types of graphs.
now, let us see the pros and cons of machine learning models in our use case
pros
Improved accuracy: By automating the process of classifying text in graph images, the LayoutLMv2 model can help to improve the accuracy of the analysis compared to manual classification methods.
Time-saving: The model can analyze graph images faster than humans, reducing the time required for data analysis.
Scalability: The model can be easily scaled to analyze large datasets, making it ideal for applications that involve processing a large volume of graphical data.
Consistency: The model can provide consistent results across multiple images and datasets, eliminating the potential for human error and bias.
cons
Fine-tuning required: Fine-tuning the model on specific datasets may require additional resources and time.
Limitations of the model: The LayoutLMv2 model may not be effective for all types of graph images or data visualizations, and may require additional customization for certain applications.
Interpretability: Machine learning models like the LayoutLMv2 model can be difficult to interpret and explain, making it challenging to understand how the model arrived at its classifications.
Contour detection and Rule-based methods can be a simpler approach that involves detecting the bars or data points in the graph using contour detection, and then Rule-based methods involve defining rules to extract data from the graph image based on its layout and structure. This approach can be effective for specific types of graphs but may require manual customization for each graph type.
Now, let us see the pros and cons of this method
pros
No pre-training required: Unlike machine learning-based methods, contour detection, and basic mathematics does not require pre-training the model on large datasets, which can be time-consuming and computationally intensive.
Relatively simple: The method involves simple calculations that can be easily implemented, making it accessible to a wide range of users and applications.
Works for simple graphs: This method can be effective for simple graphs with clear and distinct bars and labels.
cons
Limited applicability: This method may not work effectively for more complex graphs with overlapping bars or multiple datasets.
Susceptible to errors: The accuracy of the method can be affected by errors in contour detection or OCR, which can result in the misclassification of text labels.
Lack of flexibility: The method may not be easily adaptable to different types of graphs or applications, as it relies on specific calculations and assumptions about the layout of the graph.
Now, we will discuss both of these methods in detail one by one.
Machine learning model
LayoutLMv2 is a powerful machine learning model developed by Microsoft that is specifically designed to analyze document layouts. This model uses a combination of computer vision techniques and natural language processing (NLP) to understand the structure and content of documents, allowing it to identify key elements such as titles, captions, tables, and graphs.
LayoutLMv2 can be used to finetune a machine learning model to classify the text in a graph image into categories such as title, x-axis, y-axis, and legends. The first step in using LayoutLMv2 for graph analysis is to collect a large dataset of graph images and their associated labels. The labels should include information about the location and content of each element in the graph, such as the title, x-axis label, y-axis label, and legend.
This dataset can be created manually by labeling each graph image. the model can then be fine-tuned using the created dataset of labeled graphs, allowing it to adapt to our use cases. (you can see the sample of the dataset I created here : ShubhamMehla3/graph-ocr/dataset.zip)
Let's take a look at the code for fine-tuning our machine-learning model. The following code is just a sample (you can find the actual working code on my GitHub, https://github.com/ShubhamMehla3/graph-ocr/blob/main/run_seq_labeling.py ).
from layoutlm import LayoutlmConfig, LayoutlmForTokenClassification from transformers import BertTokenizer,AdamW from torch.utils.data import DataLoader, RandomSampler import torch from tqdm import tqdm, trangeMODEL_CLASSES = { "layoutlm": (LayoutlmConfig, LayoutlmForTokenClassification, BertTokenizer), }def train( train_dataset, model, tokenizer, labels, pad_token_label_id): """ Train the model """ if torch.cuda.is_available(): device = torch.device("cuda") print("GPU is available") else: device = torch.device("cpu") print("GPU is not available, using CPU instead") train_sampler = RandomSampler(train_dataset) train_dataloader = DataLoader(train_dataset, sampler=train_sampler, batch_size=args.train_batch_size, collate_fn=None ) no_decay = ["bias", "LayerNorm.weight"] optimizer = AdamW( lr=learning_rate, eps=adam_epsilon) model = torch.nn.Module(model, find_unused_parameters=True) global_step, tr_loss = 0, 0.0 model.zero_grad() train_iterator = trange(num_train_epochs, desc="Epoch") for in trainiterator: epoch_iterator = tqdm(train_dataloader, desc="Iteration") for step, batch in enumerate(epoch_iterator): model.train() inputs = {"input_ids": batch[0].to(device), "attention_mask": batch[1].to(device), "labels": batch[3].to(device)} inputs["bbox"] = batch[4].to(device) inputs["token_type_ids"] = batch[2].to(device) outputs = model(**inputs) loss = outputs[0] loss.backward() tr_loss += loss.item() optimizer.step() # scheduler.step() # Update learning rate schedule model.zero_grad() global_step += 1 return global_step, tr_loss / global_step
The train() function takes as input the train_dataset, model, tokenizer, labels, and pad_token_label_id.The function first checks if a GPU is available and sets the device accordingly. It then creates a RandomSampler for the training dataset and a DataLoader using the RandomSampler, with a batch size of train_batch_size. The function then initializes an AdamW optimizer with a learning rate of learning_rate and an epsilon value of adam_epsilon. The model is then called with the input tensors, and the output loss is computed and backpropagated through the model. The training loop continues for num_train_epochs epochs, with a progress bar for each epoch and iteration. At the end of each epoch, global_step and tr_loss variables are updated. The function returns the final global_step and the average training loss per step.
from layoutlm import FunsdDataset, LayoutlmConfig, LayoutlmForTokenClassification from transformers import BertTokenizer import torch MODEL_CLASSES = { "layoutlm": (LayoutlmConfig, LayoutlmForTokenClassification, BertTokenizer), }def main(): if torch.cuda.is_available(): device = torch.device("cuda") print("GPU is available") else: device = torch.device("cpu") print("GPU is not available, using CPU instead") labels = get_labels(labels) # in our case labels will be x-axis,y-axis,title num_labels = len(labels) # Use cross entropy ignore index as padding label id so that only real label ids contribute to the loss later pad_token_label_id = CrossEntropyLoss().ignore_index config = config_class.from_pretrained( "layoutlm-base-uncased/", num_labels=num_labels,force_download = True, ignore_mismatched_sizes=True, cache_dir= cache_dir_path else None) tokenizer = tokenizer_class.from_pretrained( "microsoft/layoutlm-base-uncased",do_lower_case=True, force_download = True, ignore_mismatched_sizes=True, cache_dir= cache_dir_path else None) model = model_class.from_pretrained( "layoutlm-base-uncased/", config=config) model.to(args.device) train_dataset = FunsdDataset( args, tokenizer, labels, pad_token_label_id, mode="train" ) global_step, tr_loss = train( args, train_dataset, model, tokenizer, labels, pad_token_label_id ) tokenizer = tokenizer_class.from_pretrained( "microsoft/layoutlm-base-uncased",force_download = True, do_lower_case=args.do_lower_case, ignore_mismatched_sizes = True) model = model_class.from_pretrained(args.output_dir) model.to(args.device) result, predictions = evaluate( args, model, tokenizer, labels, pad_token_label_id, mode="test" ) return result,predictions
The main() function reads the dataset then finetune our layoutlmv2 model on the dataset we created along with calculating the loss on training and the test data.
Flow of the main() function goes like : it then calls the get_labels() function to get the list of labels for the token classification task. The number of labels is computed and the padding label id is set to the CrossEntropyLoss().ignore_index. The function then initializes a LayoutlmConfig with the number of labels and downloads the pre-trained LayoutLM-base-uncased model. A LayoutlmForTokenClassification model is initialized using the downloaded LayoutlmConfig. The train_dataset is created using the FunsdDataset class and the train() function is called to train the model. The function then downloads the pre-trained BertTokenizer and initializes a new LayoutlmForTokenClassification model from the output directory. The evaluate() function is called to evaluate the trained model on the test dataset, and the function returns the evaluation result and predictions.
In conclusion, LayoutLMv2 is a powerful machine-learning model that can be used to finetune a model for classifying text in graph images. This method can provide valuable insights into the structure and content of graphs, making it easier to analyze and understand complex data. As the amount of data available continues to grow, LayoutLMv2 will become an increasingly important tool for businesses and organizations looking to gain insights from their data.
Contour detection and Rule-based methods
In addition to using LayoutLMv2, another method for analyzing graphs is to use contour detection and basic mathematics to classify the text extracted from the image. This method is particularly useful for analyzing bar charts, where the bars themselves are the primary means of conveying information.
The first step in this method is to use computer vision techniques to detect the bars in the graph image. This can be done using contour detection, which identifies the outlines of objects in an image. Once the bars have been detected, the height and position of each bar can be extracted using basic mathematics.
Once the heights and positions of the bars have been extracted, the next step is to identify the labels for the x-axis and y-axis. This can be done using NLP techniques to analyze the text in the graph image. The labels for the x-axis and y-axis can then be mapped to the corresponding bars using the positions of the bars and the position of the labels.
You can find the working for this from this colab notebook : https://github.com/ShubhamMehla3/graph-ocr/blob/main/inference.ipynb
To test this code you can also download the horizontal bar graph image from here :
https://github.com/ShubhamMehla3/graph-ocr/tree/main/images
Code Walkthrough
Using contour detection to detect all the bars present in the graph image. To enhance the effectiveness of the contour detection algorithm for detecting all the bars present in the graph image, several filters can be applied. These filters can help to improve the accuracy and quality of the contour detection results. Examples of filters that can be utilized include Gaussian smoothing, thresholding, and edge detection. Gaussian smoothing can help to reduce noise and unwanted details in the image, while thresholding can help to segment the image into distinct regions based on pixel intensity. Edge detection, on the other hand, can help to identify the boundaries of the bars and other features present in the image. By employing such filters, the contour detection algorithm can be made more robust and accurate in detecting all the bars present in the graph image.
def calc_contours(image_path): img = cv2.imread(image_path) # Apply bilateral filter img_bilateral = cv2.bilateralFilter(img, 9, 75, 75) # Convert the image to grayscale gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY) # Apply thresholding to remove text and noise _, thresh = cv2.threshold(gray, 150, 255, cv2.THRESH_BINARY) # Apply ROI masking to exclude non-graph regions mask = np.zeros_like(thresh) mask[50:400, 100:700] = 255 masked_thresh = cv2.bitwise_and(thresh, mask) # Apply edge detection using the Canny algorithm edges = cv2.Canny(gray, 100, 200) # Apply horizontal line detection using HoughLinesP lines = cv2.HoughLinesP(edges, 1, np.pi/180, 100, minLineLength=250, maxLineGap=15) horizontal_lines = [] for line in lines: x1, y1, x2, y2 = line[0] if abs(y1 - y2) < 5: # Check if the line is horizontal horizontal_lines.append(line) # Draw the detected lines on the image for line in horizontal_lines: x1, y1, x2, y2 = line[0] # cv2.line(img, (x1, y1), (x2, y2), (255, 255, 255), 2) # Find contours in the image contours, hierarchy = cv2.findContours(edges, cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE) # Filter out unwanted contours based on contour area and aspect ratio bars = [] for cnt in contours: area = cv2.contourArea(cnt) x, y, w, h = cv2.boundingRect(cnt) aspect_ratio = float(w) / h if area > 7 and aspect_ratio > 5 and aspect_ratio < 50000: bars.append(cnt) ###### 2 consecutive element from the bars represent single contour only ###### and we will consider only one of them, so we will filter out odd index elemnts from the "bars" def filter_bars(bars): lst = [] for i in range(len(bars)): if(i%2==1): lst.append(bars[i]) return lst bars = filter_bars(bars) # Draw the remaining contours on the image cv2.drawContours(img, bars, -1, (255, 255, 0), 2) # Show the image cv2_imshow(img) return bars
I am utilizing the output of LayoutLMv2 (without any fine-tuning), a pre-trained language and layout model, to obtain the coordinates of all the extracted text. This approach has yielded superior results compared to using Pytesseract without any fine-tuning.
def loadFromLayoutlmv2(): feature_extractor = LayoutLMv2FeatureExtractor.from_pretrained("microsoft/layoutlmv2-base-uncased")# apply_ocr is set to True by default tokenizer = LayoutLMv2TokenizerFast.from_pretrained("microsoft/layoutlmv2-base-uncased") model = LayoutLMv2ForTokenClassification.from_pretrained("nielsr/layoutlmv2-finetuned-funsd") return feature_extractor,tokenizer,modeldef labelForBoxes(): dataset = load_dataset("nielsr/funsd", split="test") # define id2label, label2color labels = dataset.features['ner_tags'].feature.names id2label = {v: k for v, k in enumerate(labels)} label2color = {'question':'blue', 'answer':'green', 'header':'orange', 'other':'violet'} return id2label, label2colordef unnormalize_box(bbox, width, height): return [ width * (bbox[0] / 1000), height * (bbox[1] / 1000), width * (bbox[2] / 1000), height * (bbox[3] / 1000), ]def iob_to_label(label): label = label[2:] if not label: return 'other' return labeldef process_image(image,id2label,label2color,feature_extractor,tokenizer,model): # Convert the image to RGB format image = image.convert('RGB') width, height = image.size # get words, boxes encoding_feature_extractor = feature_extractor(image, return_tensors= "pt") words, boxes = encoding_feature_extractor.words, encoding_feature_extractor.boxes # encode encoding = tokenizer(words, boxes=boxes, truncation=True, return_offsets_mapping=True, return_tensors="pt") offset_mapping = encoding.pop('offset_mapping') encoding["image"] = encoding_feature_extractor.pixel_values # forward pass outputs = model(**encoding) # get predictions predictions = outputs.logits.argmax(-1).squeeze().tolist() token_boxes = encoding.bbox.squeeze().tolist() # only keep non-subword predictions is_subword = np.array(offset_mapping.squeeze().tolist())[:,0] != 0 true_predictions = [id2label[pred] for idx, pred in enumerate(predictions) if not is_subword[idx]] true_boxes = [unnormalize_box(box, width, height) for idx, box in enumerate(token_boxes) if not is_subword[idx]] # draw predictions over the image draw = ImageDraw.Draw(image) font = ImageFont.load_default() for prediction, box in zip(true_predictions, true_boxes): predicted_label = iob_to_label(prediction).lower() draw.rectangle(box, outline=label2color[predicted_label]) draw.text((box[0]+10, box[1]-10), text=predicted_label, fill=label2color[predicted_label], font=font) return image,true_boxes,words,true_predictions,true_boxes,is_subword
After , this I performed the rule-based method to get the results. Basically, i just write a couple of mathematical operations.
First of all the, all the coordinates of the text is normalized because We want to write the generalize code .
######## scaled data #############def normalize_bbox(bboxes,img): # removing the fist and last coordinates bboxes= bboxes[1:-1] # Define the new image size new_width = 100 new_height = 100 original_width,original_height = img.size # Calculate the scaling factors x_scale = new_width / original_width y_scale = new_height / original_height #parsing the bboxes new_bboxes = [] for bbox in bboxes: x1,y1,x2,y2 = bbox # Normalize the bounding box coordinates new_x1 = int(x1 * x_scale) new_y1 = int(y1 * y_scale) new_x2 = int(x2 * x_scale) new_y2 = int(y2 * y_scale) new_bbox = new_x1,new_y1,new_x2,new_y2 new_bboxes.append(new_bbox) return new_bboxes
The extracted text is segregated into three categories based on their coordinates, namely, heading, keys (y-axis), and values (representing the bars in the bar graph). To standardize the bounding box, a normalization process is carried out. The categorization is carried out using a certain concept: for y-axis text, the y-coordinate of their bounding box does not vary much (a basic mathematical principle).
The text in the values category is assigned to the nearest bar, while the text in the keys category is also assigned to the nearest bar. Based on the outcomes of the preceding two points, the keys (y-axis) text is matched with the corresponding values (bars in the graph).
######### categorize the words into "keys, values, headers"def graph_categorizer(words,bboxes,img): output_dict = {"headers":[],"keys":[],"values":[]} bboxes = normalize_bbox(bboxes,img) for idx in range(len(words[0])): x1,y1,x2,y2 = bboxes[idx] ###### header if(y2<=13): output_dict['headers'].append(words[0][idx]) ###### keys elif(x2<43): output_dict['keys'].append(words[0][idx]) ###### values elif(x2>=43): output_dict['values'].append(words[0][idx]) return output_dict####### assign values to their respective keys with the help contours def nearest_assignment(updated_categorized_data,updated_words_to_bboxes,contours_right_coords): #### values to contour relation contour_to_values = {} for idx in range(len(contours_right_coords)): contours = contours_right_coords[idx] cont_x,_ = contours min_dist = float('inf') min_value = None for value in updated_categorized_data['values']: _,_,value_x,_ = updated_words_to_bboxes[value] curr_dist = abs(value_x-cont_x) # print(value,curr_dist,value_x,cont_x) if(min_dist>curr_dist): min_dist = curr_dist min_value = value contour_to_values[contours[1]] = min_value ######## 2nd method if(len(updated_categorized_data['values'])==len(contours_right_coords)): contour_to_values = {} for idx in range(len(contours_right_coords)): contours = contours_right_coords[idx] contour_to_values[contours[1]] = updated_categorized_data['values'][idx] return contour_to_values#### assign values to their respective keysdef assign_value_to_keys(updated_categorized_data,updated_words_to_bboxes,contours_right_coords,contour_to_values): # fist assign the keys to the bars bars_to_keys = {} ## parsing the updated_categorized_data for cnt in contours_right_coords: _,cond_y = cnt min_diff = float('inf') right_key = None for key in updated_categorized_data['keys']: _,_,_,y = updated_words_to_bboxes[key] diff = float(abs(cond_y - y)) if(min_diff > diff): min_diff = diff right_key = key bars_to_keys[cond_y] = right_key ###### simple solution for bars to keys if(len(contours_right_coords)==len(updated_categorized_data['keys'])): bars_to_keys = {} for idx in range(len(contours_right_coords)): contours = contours_right_coords[idx] bars_to_keys[contours[1]] = updated_categorized_data['keys'][idx] # now assigning the keys to the values keys_to_values = {} for k,v in bars_to_keys.items(): keys_to_values[v] = contour_to_values[k] return keys_to_values
Sample Outputs
output 1
input image :
output :
{'header': ['It is important to have coding knowledge for Data Scientist'],
'Mapping': {'Data Manipulation': '60',
'Java': '75',
'HTML': '30',
'Python': '95'}}
observation :
our model was not able to detect the last horizontal bar (CSS). For the rest, it was working fine.
output 2
One issue with the OCR I am using is that sometimes it is not able to detect the numerical values. Same Happened with this graph.
ouput :
{'header': ['Data Scientist Skills'],
'Mapping': {'Statistical analysis and computing': '100',
'Machine Learning': '100',
'Processing large data sets': '100',
'Data Visualization': '100',
'Programming' : '100'
}}
observation :
OCR only extracted one numerical value from the image i.e. 100 and hence all the values from the y-axis got mapped to 100.
output 3
input image :
output :
{'Skills Of Coding For Begineer' : 'JavaScript' }
output 4
input image :
output :
{'skills Of Coding For Intermediate' : 'PHP' }
Conclusion
In conclusion, contour detection and basic mathematics can be useful approaches for classifying text in graph images, especially for simple graphs with clear and distinct bars and labels. However, the method has limitations regarding its applicability to more complex graphs, susceptibility to errors, and lack of flexibility for different types of graphs and applications.
Further Reading
If you enjoyed this blog post on the OCR Challenge and its innovative solutions, we invite you to read more about the applications of OCR technology combined with GPT-4 in our other blog posts, where we cover OCR for images, PDFs, table extraction, and entity recognition:
OCR using Amazon Textract and GPT-4: Dive into the integration of Amazon Textract and GPT-4 for OCR applications, exploring how these technologies can be combined to enhance data extraction and text analysis for images, PDFs, tables, and entities.
OCR using Google Vision API and GPT-4: Learn how the powerful combination of Google Vision API and GPT-4 can transform OCR capabilities, providing accurate and efficient data extraction from images, documents, tables, and entities.
OCR using Azure API and GPT-4: Discover the potential of leveraging Azure API and GPT-4 for OCR tasks, improving the analysis of text and data from various sources, including images, PDFs, tables, and entity recognition.
By exploring these additional blog posts, you will gain a deeper understanding of the different OCR technologies and their potential when combined with GPT-4, further expanding your knowledge of data extraction from charts, graphs, and other sources.
To learn about more interesting and cool NLP and LLM Applications, look into our other Blogs and YouTube channel.
Also, want to learn about the state-of-the-art stuff in AI? Don't forget to subscribe to AI Demos. A place to learn about the latest and cutting-edge tools in AI!

Invoice Processing with Azure OCR and GPT-4: An In Depth Step-by-Step Guide

Rupesh Gelal — Sun, 16 Apr 2023 04:08:53 GMT

This blog will guide you through the process of using Azure OCR to extract text from invoice images and PDFs. You will also learn to utilize GPT-4 to extract entities from the invoices. By the end of this guide, you'll have a thorough understanding of Azure OCR service and GPT-4 for invoice entity classification.
Introduction:
Azure OCR(optical character recognition) is a cloud-based service provided by Microsoft Azure that uses machine learning techniques to extract text from images, PDFs and other text-based documents. OCR is a technology that allows computers to recognize and extract text from images, making it possible to search, edit, and analyze text in documents that were previously only available as images. Azure OCR can also recognize and extract text from documents written in various languages, including but not limited to Spanish, Hindi, Portuguese, Korean, and English. This is helpful for freelancers and businesses that operate globally.
GPT-4 is the newest and most advanced language model in the GPT family. Compared to GPT-3.5 or ChatGPT, it's estimated to be ten times more powerful and has performed impressively in various tests and tasks, such as bar exams and standardized tests like SATs and AP exams. Its ability to generate accurate and concise answers that are nearly indistinguishable from human-written responses is remarkable. Moreover, GPT-4 is unique in its capability to process both text and image inputs, making it a highly versatile tool with a wide range of applications.
Getting Started with Azure OCR:
To use Azure OCR service, you need a valid Azure subscription, which you can obtain by creating an account on the Azure portal. In addition to that, you'll need an Azure cognitive service key and an endpoint to connect to the OCR service from your app.
To get the subscription key and endpoint:
Go to the Azure portal (portal.azure.com) and log in to your account.
Navigate to the Cognitive Services dashboard by selecting "Cognitive Services" from the left-hand menu.

Click the "+ Add" button to create a new Cognitive Services resource. Under "Create a Cognitive Services resource," select "Computer Vision" from the "Vision" section.

Choose your subscription, resource group, and region, then give the resource a name and click "Review + create."

Review the settings and click "Create" to create the new resource.
Once the resource has been created, go to the resource dashboard and navigate to the "Keys and Endpoint" tab.
Under "Keys," you will see two keys that you can use to authenticate requests to the OCR service. Copy one of the keys and save it somewhere secure, as you will need it to access the OCR service.
Under "Endpoint," you will see the endpoint URL for the OCR service. Copy this URL as well, as you will need it to connect to the OCR service via the API.

That's it! You now have the key and endpoint you need to use the Azure OCR service.
Extracting text using Azure OCR:
First, we need to install the necessary packages.
azure-cognitiveservices-vision-computervision==0.9.0pdf2image==1.16.2poppler-utils==0.1.0
After that, we need to import the necessary packages from the Azure Cognitive Services SDK for Python and initialize a ComputerVisionClient object using the endpoint and subscription_key obtained from the Azure portal. This client object will be used to make requests to the OCR API.
from azure.cognitiveservices.vision.computervision import ComputerVisionClientfrom msrest.authentication import CognitiveServicesCredentialssubscription_key = ''endpoint = ''computervision_client = ComputerVisionClient(endpoint, CognitiveServicesCredentials(subscription_key))
Next, we will create azure_ocr function that takes an image file as content and returns recognized text from images or pdfs.
from azure_credentials import *from azure.cognitiveservices.vision.computervision.models import OperationStatusCodesfrom io import BytesIOdef azure_ocr(content): image_data = BytesIO(content) headers = {'Content-Type': 'application/octet-stream'} result = computervision_client.recognize_printed_text_in_stream(image_data, headers=headers) text = '' for region in result.regions: for line in region.lines: for word in line.words: text += word.text + ' ' text += '\n' return text
The azure_ocr function creates an image_data object from the bytes of content and sets the Content-Type header to application/octet-stream, and uses the recognize_printed_text_in_stream method of computervision_client to recognize printed text in the image data.
After that, it loops through the resulting regions, lines, and words to extract the recognized text and returns the text as a string.
Now, let's write a code that allows users to upload images or PDF files and extract text from an uploaded file.
from google.colab import filesfrom PIL import Imageimport iouploaded_files = files.upload()
for file_name, file_content in uploaded_files.items(): if file_name.endswith(".pdf"): images = convert_from_bytes(file_content) for i, image in enumerate(images): img_byte_arr = io.BytesIO() image.save(img_byte_arr, format='JPEG') img_bytes = img_byte_arr.getvalue() ocr_text = azure_ocr(img_bytes) print(f"OCR text for page {i+1}:") print(ocr_text) display(Image.open(io.BytesIO(img_bytes))) elif file_name.endswith((".png", ".jpg", ".jpeg")): ocr_text = azure_ocr(file_content) print(f"OCR text for {file_name}:") print(ocr_text) display(Image.open(io.BytesIO(file_content))) else: print(f"{file_name} is not a supported file format.")
Here, if statement checks if the file name ends with the ".pdf" file extension. If it does, it converts the PDF file into a series of images using the convert_from_bytes() function from the pdf2image library. The resulting images are then processed using Azure OCR to extract text.
The elif statement checks if the file name ends with either ".png", ".jpg", or ".jpeg" file extensions. If it does, the image file is processed using Azure OCR to extract text. The extracted text is printed to the console, and the image is displayed.
The else statement is executed when the uploaded file has a file extension that is not supported. It prints an error message to the console.
Testing Azure OCR:
Now that we have all things ready, let's check the accuracy of the Azure OCR service.
Raw ocr_text:Company Name Sample Invoice Billing Information Company ABC Company John Smith Address 111 Pine street, Suite 1815 san Francisco, CA, 94111 Phone Number (123) 123-1232 Email John@example.com 06/10/2021 INVO-005 Shipping Information Name Sam K. Smith Address 111 Pine Street, suite 1815 Product/Service 1 Product/Service 2 Product/Service 3 Product/Service 4 Description Sink Nest Smart Filter Labor Fee Service Fee 2 san Francisco, CA, 94111 unit price 100 150 50 25 Total: Total $200 $150 $50 $25 $425
Extracting text using Form-Recognizer service:
In the previous section, we learned how to extract text from invoices in detail using Azure OCR. However, if you only want to extract key information without unnecessary hassle, you can use the Form Recognizer service of Azure. There are two options: using Form Recognizer with a custom model trained on our specific type of invoice or using the general Form Recognizer without training a custom model. Here, we'll use Form Recognizer without training the custom model.
To use Form Recognizer, you need to create a Form Recognizer resource in the same way as you created the Azure Computer Vision (OCR) service in the previous section, and then obtain the key and endpoint.
To get started, you first need to install the azure-ai-formrecognizer package using pip.
!pip install azure-ai-formrecognizer
Then import the necessary modules.
from azure.ai.formrecognizer import FormRecognizerClientfrom azure.core.credentials import AzureKeyCredential
Actual code for recognizing form:
endpoint = ""key = ""form_recognizer_client = FormRecognizerClient(endpoint, AzureKeyCredential(key))for file_name, file_content in uploaded_files.items(): if file_name.endswith((".png", ".jpg", ".jpeg", ".pdf")): with open(file_name, "rb") as f: if file_name.endswith((".jpg", ".jpeg", ".png")): poller = form_recognizer_client.begin_recognize_receipts( receipt=f ) else: poller = form_recognizer_client.begin_recognize_invoices( invoice=f ) result = poller.result() text = "" for recognized_form in result: for name, field in recognized_form.fields.items(): text += f"{name}: {field.value}\n" print(f"Extracted text from {file_name}:") print(text)
Form-recognizer uses Recognizer API to extract information from receipts and invoices. It first sets up the client by providing the endpoint and access key, and then loops through each uploaded file to determine if it's a receipt or an invoice based on its file extension.
For receipts, it calls the begin_recognize_receipts method of the client, passing in the file content as the receipt parameter. For invoices, it calls the begin_recognize_invoices method instead.
Both methods return a Poller object, which is used to asynchronously get the result of the form recognition operation. The poller.result() method blocks until the operation is complete, and then returns a list of RecognizedForm objects.
The code then loops through each RecognizedForm object and extracts the values of its fields, which are stored in a dictionary. The extracted text is then printed to the console.
Let's check the result:
MerchantAddress: 111 Pine Street, Suite 1815 San Francisco, CA, 94111MerchantPhoneNumber: NoneReceiptType: ItemizedTotal: 425.0TransactionDate: 2021-06-10
Extracting invoice entities using GPT-4:
Next, we will extract invoice entities from the text that was extracted by the Azure OCR service.
Let's import the necessary modules.
import openaiimport os
Set the OpenAI API key to access the GPT-4 model.
os.environ['OPENAI_API_KEY'] = ''
Since GPT-4 is a chat-based model, we need to provide the system role for the chat, as well as an example text and example entities to give context for the model.
system_role="Extract entities and values as a key-value pair from the text provided"example_text="Invoicing Street Address Template.com City , ST ZIP Code BILL TO Name Address City , State ZIP Country Phone Email pp1 pp2 Pp3 P.O. # # / Taxable NOTES : Your Company Name looooo0000 Phone Number , Web Address , etc. Sales Rep . Name Ship Date Description test item 1 for online invoicing test item 2 for onvoice invoicing template This template connects to an online SQL Server SHIP TO Name Address City , State ZIP Country Contact Ship Via Quantity 1 2 3 PST GST INVOICE THANK YOU FOR YOUR BUSINESS ! DATE : INVOICE # : Client # Terms Unit Price 3.00 4.00 5.50 SUBTOTAL 8.000 % 6.000 % SHIPPING & HANDLING TOTAL PAID TOTAL DUE Due Date Line Total 3.00 8.00 16.50 27.50 27.50 27.50"example_entities="""Company Name: Your Company Name Phone Number: looooo0000 Web Address: Template.com Ship To Name: Address: City: State: Zip Code: Country: Contact: Quantity: 1 Quantity: 2 Quantity: 3 Unit Price: 3.00 Unit Price: 4.00 Unit Price: 5.50 Subtotal: 8.00 Taxable: Line Total: 3.00 Subtotal: 8.00 Shipping & Handling: 6.00 Total Paid: 27.50Total Due: 27.50"""
Now, we can use the GPT-4 model to generate a response based on the provided input.
ocr_genrated_text = "Company Name Sample Invoice Billing Information Company ABC Company John Smith Address 111 Pine street, Suite 1815 san Francisco, CA, 94111 Phone Number (123) 123-1232 Email John@example.com 06/10/2021 INVO-005 Shipping Information Name Sam K. Smith Address 111 Pine Street, suite 1815 Product/Service 1 Product/Service 2 Product/Service 3 Product/Service 4 Description Sink Nest Smart Filter Labor Fee Service Fee 2 san Francisco, CA, 94111 unit price 100 150 50 25 Total: Total $200 $150 $50 $25 $425 "response = openai.ChatCompletion.create( model="gpt-4", messages=[ {"role":"system","content":system_role}, {"role":"user","content":example_text}, {"role":"assistant","content":example_entities}, {"role":"user","content":ocr_generated_text} ])
Check the response object to see the generated text.
text=response["choices"][0]["message"]["content"]print(text)
The result:
Company Name: Company ABCBilling Information:Name: John SmithAddress: 111 Pine street, Suite 1815City: San FranciscoState: CAZip Code: 94111Phone Number: (123) 123-1232Email: John@example.comInvoice Date: 06/10/2021Invoice Number: INVO-005Shipping Information:Name: Sam K. SmithAddress: 111 Pine Street, suite 1815City: San FranciscoState: CAZip Code: 94111Product/Service 1:Description: SinkUnit Price: 100Total: $200Product/Service 2:Description: Nest Smart FilterUnit Price: 150Total: $150Product/Service 3:Description: Labor FeeUnit Price: 50Total: $50Product/Service 4:Description: Service Fee 2Unit Price: 25Total: $25Total: $425
Hence, Optical character recognition (OCR) technology can be utilized for automated data extraction from invoices, while advanced language models like GPT-4 can be used for entity extraction. By adopting these technologies, businesses can save costs associated with manual data entry and improve their financial operations.
If you're interested in invoice processing, you might want to read our Invoice Processing with OCR using Google Vision API and Automated Invoice Processing with GPT-4 and Amazon Textract
For more information on how to use GPT-3 or GPT-4 for entity extraction from text, you can check this video.
https://www.youtube.com/watch?v=FA_6TRI1ySY

Also, AIDemos.com is an incredible resource for anyone looking to explore the potential of AI.

Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4 and ChatGPT

Pradip Nichite — Thu, 13 Apr 2023 14:08:36 GMT

1. Introduction
In this blog post, we will delve into the creation of a document-based question-answering system using LangChain and Pinecone, taking advantage of the latest advancements in large language models (LLMs), such as OpenAI GPT-4 and ChatGPT.
LangChain is a powerful framework designed for developing applications driven by language models, while Pinecone serves as an efficient vector database for building high-performance vector search applications. Our use case focuses on answering questions over specific documents, relying solely on the information within those documents to generate accurate and context-aware answers.
By combining the prowess of semantic search with the impressive capabilities of LLMs like GPT, we will demonstrate how to build a state-of-the-art Document QnA system that leverages cutting-edge AI technologies.
2. Why is Semantic Search + GPT QnA better than fine-tuning GPT?
Before diving into the implementation, let's understand the advantages of using semantic search with GPT QnA over fine-tuning GPT:
Broader knowledge coverage:
Semantic Search + GPT QnA uses a two-step process that first finds relevant passages from a large corpus of documents and then generates answers based on those passages. This approach can provide more accurate and up-to-date information, leveraging the latest information from various sources. Fine-tuning GPT, on the other hand, relies on the knowledge encoded in the model during training, which may become outdated or incomplete over time.
Context-specific answers:
Semantic Search + GPT QnA can generate more context-specific and precise answers by grounding answers in specific passages from relevant documents. However, fine-tuned GPT models might generate answers based on the general knowledge embedded in the model, which might be less precise or unrelated to the question's context.
Adaptability:
The Semantic Search component can be easily updated with new information sources or tuned to different domains, making it more adaptable to specific use cases or industries. In contrast, fine-tuning GPT requires re-training the model, which can be time-consuming and computationally expensive.
Better handling of ambiguous queries:
Semantic Search can help disambiguate queries by identifying the most relevant passages related to the question. This can lead to more accurate and relevant answers compared to a fine-tuned GPT model, which may struggle with ambiguity without proper context.
3. LangChain Modules
LangChain provides support for several main modules:
Models: The various model types and model integrations LangChain supports.
Indexes: Language models are often more powerful when combined with your own text data - this module covers best practices for doing exactly that.
Chains: Chains go beyond just a single LLM call, and are sequences of calls (whether to an LLM or a different utility). LangChain provides a standard interface for chains, lots of integrations with other tools, and end-to-end chains for common applications.
4. Setting up the environment
To start, we need to install the required packages and import the necessary libraries.
Installing required packages:
!pip install --upgrade langchain openai -q!pip install unstructured -q!pip install unstructured[local-inference] -q!pip install detectron2@git+https://github.com/facebookresearch/detectron2.git@v0.6#egg=detectron2 -q!apt-get install poppler-utils
Importing necessary libraries:
import osimport openaiimport pineconefrom langchain.document_loaders import DirectoryLoaderfrom langchain.text_splitter import RecursiveCharacterTextSplitterfrom langchain.embeddings.openai import OpenAIEmbeddingsfrom langchain.vectorstores import Pineconefrom langchain.llms import OpenAIfrom langchain.chains.question_answering import load_qa_chain
5. Loading documents
First, we need to load the documents from a directory using the DirectoryLoader from LangChain. In this example, we assume the documents are stored in a directory called 'data'.
directory = '/content/data'def load_docs(directory): loader = DirectoryLoader(directory) documents = loader.load() return documentsdocuments = load_docs(directory)len(documents)
6. Splitting documents
Now, we need to split the documents into smaller chunks for processing. We will use the RecursiveCharacterTextSplitter from LangChain, which by default tries to split on the characters ["\n\n", "\n", " ", ""].
def split_docs(documents, chunk_size=1000, chunk_overlap=20): text_splitter = RecursiveCharacterTextSplitter(chunk_size=chunk_size, chunk_overlap=chunk_overlap) docs = text_splitter.split_documents(documents) return docsdocs = split_docs(documents)print(len(docs))
7. Embedding documents with OpenAI
Once the documents are split, we need to embed them using OpenAI's language model. First, we need to install the tiktoken library.
!pip install tiktoken -q
Now, we can use the OpenAIEmbeddings class from LangChain to embed the documents.
embeddings = OpenAIEmbeddings(model_name="ada")query_result = embeddings.embed_query("Hello world")len(query_result)
8. Vector search with Pinecone
Next, we will use Pinecone to create an index for our documents. First, we need to install the pinecone-client.
!pip install pinecone-client -q
Then, we can initialize Pinecone and create a Pinecone index.
pinecone.init( api_key="pinecone api key", environment="env")index_name = "langchain-demo"index = Pinecone.from_documents(docs, embeddings, index_name=index_name)
We are creating a new Pinecone vector index using the Pinecone.from_documents() method. This method takes three arguments:
docs: A list of documents that have been split into smaller chunks using the RecursiveCharacterTextSplitter. These smaller chunks will be indexed in Pinecone to make it easier to search and retrieve relevant documents later on.
embeddings: An instance of the OpenAIEmbeddings class, which is responsible for converting text data into embeddings (i.e., numerical representations) using OpenAI's language model. These embeddings will be stored in the Pinecone index and used for similarity search.
index_name: A string representing the name of the Pinecone index. This name is used to identify the index in Pinecone's database, and it should be unique to avoid conflicts with other indexes.
The Pinecone.from_documents() the method processes the input documents, generates embeddings using the provided OpenAIEmbeddings instance, and creates a new Pinecone index with the specified name. The resulting index object can perform similarity searches and retrieve relevant documents based on user queries.
9. Finding similar documents
Now, we can define a function to find similar documents based on a given query.
def get_similiar_docs(query, k=2, score=False): if score: similar_docs = index.similarity_search_with_score(query, k=k) else: similar_docs = index.similarity_search(query, k=k) return similar_docs
10. Question answering using LangChain and OpenAI LLM
With the necessary components in place, we can now create a question-answering system using the OpenAI class from LangChain and a pre-built question-answering chain.
# model_name = "text-davinci-003"# model_name = "gpt-3.5-turbo"model_name = "gpt-4"llm = OpenAI(model_name=model_name)chain = load_qa_chain(llm, chain_type="stuff")def get_answer(query): similar_docs = get_similiar_docs(query) answer = chain.run(input_documents=similar_docs, question=query) return answer
11. Example queries and answers
Finally, let's test our question answering system with some example queries.
query = "How is India's economy?"answer = get_answer(query)print(answer)query = "How have relations between India and the US improved?"answer = get_answer(query)print(answer)
Conclusion
In this blog post, we demonstrated how to build a document-based question-answering system using LangChain and Pinecone. By leveraging semantic search and large language models, this approach provides a powerful and flexible solution for extracting information from a large corpus of documents. You can further customize this system to suit your specific needs or domain.
Colab notebook: https://github.com/PradipNichite/Youtube-Tutorials/blob/main/Langchain_Semnatic_Serach_Pinecone.ipynb
Call to Action
Now that you have seen how to build a document-based question answering system using LangChain and Pinecone, we encourage you to explore further and try it out for yourself.
Watch the YouTube video: If you prefer a visual guide, we have created a video demonstrating the process. This video can help solidify your understanding and provide an alternative learning experience.
https://youtu.be/cVA1RPsGQcw

Read more in-depth articles: For more detailed information and insights on AI-related topics, don't forget to visit the FutureSmart AI Blog. Our blog features in-depth articles that cover various aspects of AI, machine learning, and natural language processing.
Check out AI Demos: If you're interested in exploring more AI tools and their applications, head over to AIDemos.com. AIDemos is a directory of video demos showcasing the latest AI tools and technologies. Our goal is to educate and inform users about the possibilities of AI and help them stay updated on the latest advancements.
Happy learning, and enjoy exploring the world of AI!

Semantic Search using LlamaIndex and Langchain

Aneesh Aparajit — Mon, 10 Apr 2023 20:07:30 GMT

Introduction:
We all know it's the era of Large Language Models (LLMs) like GPT4. However, as powerful as these models are, they can be further improved through innovative techniques like semantic search. In this blog post, we will explore why combining semantic search with GPT offers a superior approach compared to simply fine-tuning GPT. We'll discuss the benefits of using tools like LlamaIndex and Langchain and walk you through the process of building your own custom solution. By delving into code examples, data collection, and the integration of custom embeddings with Pinecone, you'll learn how to leverage these advanced technologies to create a powerful Q&A bot and enhance your natural language processing capabilities. Let's embark on this exciting journey to unlock the full potential of GPT and semantic search!
If you want to learn more about semantic search, feel free to look into these videos from FutureSmart AI.
Revolutionizing Search: How to Combine Semantic Search with GPT-3 Q&A
GPT-3 Embeddings: Perform Text Similarity, Semantic Search, Classification, and Clustering
Why is Semantic Search + GPT better than finetuning GPT?
Semantic search is a method that aids computers in deciphering the context and meaning of words in the text. To do this, the connections between words and sentences are examined to determine their underlying meaning. It may be a potent tool for natural language processing when paired with GPT, a pre-trained language model that can produce cohesive and natural-sounding text.
The semantic search technique is more generic and doesn't require specific training data, in contrast to fine-tuning GPT, which entails training the model on a particular task using annotated data. All text data may be subjected to semantic search, which considers the meaning and context of the words to provide more complex searches and results.
By leveraging the natural language understanding capabilities of semantic search and the contextual understanding of GPT, this approach can be used for a wide range of natural languages processing tasks such as question answering, chatbots, and content recommendation. Overall, compared to fine-tuning GPT alone, semantic search + GPT offers a more potent and adaptable method for natural language processing tasks.
Why use LlamaIndex and Langchain?
Two potent methods used in natural language processing to enhance the search and retrieval of pertinent information are the GPT index and Langchain.
A pre-trained language model, such as GPT, is used to create a GPT index, which is a way of indexing a huge corpus of text. It is simpler and quicker to search for and retrieve pertinent information because of the index, which contains representations of each document that encapsulate its semantic meaning. This is especially helpful when consumers need to locate specific information fast when there is a lot of text data present, such as in e-commerce or content-based websites.
How to build it?
The idea is that we will build a database of the corpus we collect from multiple sources, which may include data from clients, crowdsourcing, or even web crawling.
Once we build the data, we want to store the embeddings somewhere so that it becomes easy for us to fetch embeddings and find similarities with the query we send in.
There are various tools to do this, some prominent ones being, Qdrant, Pinecone, and many more.
These are vector search engines which are essentially APIs which can store the data we embed to their clouds and call them whenever we need them.
We also have LlamaIndex which has both the functionality of APIs and local storage in the form of JSON files.
Let's code some stuff...
Collecting the data.
Let's download the book "Alice in Wonderland" and do embedding on that.
!wget https://www.gutenberg.org/cache/epub/11/pg11.txt
Install the requirements
!pip install langchain !pip install llama_index!pip install openai!pip install transformers!pip install torch!pip install sentence_transformers!pip install pinecone-client
Let's get coding...
To create a chat application for a given use case, we need to set up a few components.
"prompt helper": This is a class which takes in parameters like:
max_input_size: This is the largest length of tokens for the input string.
max_chunk_overlap: Maximum chunk overlap for LLM
num_output: Number of outputs for the LLM.
chunk_size_limit: Maximum chunk size to use.
"LLM Predictor": This class creates an encapsulation of the model to be used in LLM.
The only input it needs is the predictor itself.
Generally, we tend to use OpenAI itself.
from langchain import OpenAIfrom llama_index import LLMPredictorllm_predictor = LLMPredictor( llm=OpenAI( temperature=0, model_name='text-ada-001', max_tokens=num_outputs ),)
Documents: This is the class which encapsulated the data/text in various formats like text files, directory of text files and whatnot.
from gpt_index import SimpleDirectoryReaderdocuments = SimpleDirectoryReader(directory_path).load_data()
Vector Indexing: Once, the document is created, we need to index them to process through the semantic search process.
from llama_index import GPTSimpleVectorIndexindex = GPTSimpleVectorIndex([])for doc in documents: index.insert(doc)
These are the basic things we need to have to essentially build a chatbot.
Now, we'll take a look at a few examples. One basic example and one with Pinecone integration to store the data on the cloud.
Examples
In this section, we will look at 2 examples.
Q&A Bot using the functions directly out from the documentation
Incorporating Custom Embeddings and uploading the data to Pinecone.
Q&A Bot straight out of the docs
Download the Data
!wget https://www.gutenberg.org/cache/epub/11/pg11.txt
Move the above data to a data directory so that it would be easier for later processing.
Imports
from llama_index import GPTSimpleVectorIndex, SimpleDirectoryReaderimport openai
Configure OpenAI
Go to openai.com and get your API key and write this piece of code before the next cell.
import osos.environ['OPENAI_API_KEY'] = ''
As of now, the directory structure should be somewhat similar to this:
- directory - data - data.txt - notebook.ipynb
Documents
Now, we can build the documents like this:
documents = SimpleDirectoryReader('data').load_data()
Indexing
Once, the documents are ready and done, we want to index them based on the similarity of the query and the documents. This can be done like this:
index = GPTSimpleVectorIndex(documents)
Querying from the Index
response = index.query("Who is Alice?") print(response)
This would throw some output like this:
Alice is a young girl who is curious and imaginative. She is exploring a rabbit hole and finds herself in a strange world filled with strange creatures and events. She is trying to find her way out of the rabbit hole and back home.
And you can go about any question.
response = index.query("What is the story about?")print(response)
Alice's Adventures in Wonderland is a story about a young girl named Alice who falls down a rabbit hole and finds herself in a strange and magical world. She meets a variety of strange creatures, including the White Rabbit, the Cheshire Cat, the Mad Hatter, the Queen of Hearts, and the Mock Turtle. Through her adventures, Alice learns valuable lessons about life and growing up. The story is available in a variety of formats, including "Plain Vanilla ASCII" and other formats, and can be accessed and distributed in a variety of ways, including in binary, compressed, marked up, nonproprietary or proprietary forms.
Something funny here is, you get this part in the answer which makes no sense towards the end because, if you look at the text file, there's a lot of text which talks about using data.
Saving the data
# save to diskindex.save_to_disk('index.json')
Loading the data
# load from diskindex = GPTSimpleVectorIndex.load_from_disk('index.json')
Using Custom Embeddings + Integrating with Pinecone
Imports
from langchain.embeddings.huggingface import HuggingFaceEmbeddingsfrom llama_index import GPTPineconeIndex, SimpleDirectoryReaderfrom llama_index import GPTListIndex, SimpleDirectoryReader, GPTSimpleVectorIndexfrom langchain.embeddings.huggingface import HuggingFaceEmbeddingsfrom llama_index import LangchainEmbeddingimport pinecone
What is Pinecone?
A cloud-based vector database called Pinecone offers a quick and easy way to search and retrieve high-dimensional data, such as embeddings produced by machine learning algorithms. It is suited for real-time applications since it is scalable and effective in design.
Pinecone enables users to build high-dimensional data indexes that may be searched with closest neighbour queries to bring up related data pieces. It supports a wide range of data kinds, including text, numbers, and graphics.
Pinecone makes it simple to connect with current applications by offering APIs for several programming languages, such as Python, Java, and Go. Moreover, it has built-in support for well-known machine learning frameworks like TensorFlow and PyTorch, making it simple for users to index and search embeddings produced by these frameworks.
If you want to learn more about Vector Databases take a look at these videos:
Build high-performance Semantic Search applications using Vector Databases
Revolutionizing Search: How to Combine Semantic Search with GPT-3 Q&A
Pinecone and OpenAI Config
import osos.environ['PINECONE_API_KEY'] = ''os.environ['PINECONE_ENVIRONMENT'] = ''os.environ['OPENAI_API_KEY'] = ''
Instantiate Pinecone Index
pinecone.init( api_key=os.environ['PINECONE_API_KEY'], environment=os.environ['PINECONE_ENVIRONMENT'])pinecone.create_index( "quickstart", dimension=768, metric="euclidean", pod_type="p1")
Custom Embeddings
To get custom embeddings we use Sentence Transformers. If you want to learn more about sentence transformers take a look at this video.
Sentence Transformers: Sentence Embedding, Sentence Similarity, Semantic Search and Clustering
embed_model = LangchainEmbedding(HuggingFaceEmbeddings(model_name="clips/mfaq"))len(embed_model.get_text_embedding('What is Embeddings?')) # Output: 768
Access the Index
index = pinecone.Index("quickstart")
Create Documents
documents = SimpleDirectoryReader('./data').load_data()
Create indexes in the Pinecone Index
index = GPTPineconeIndex( documents, embed_model=embed_model, pinecone_index=index)
Querying
response = index.query("What is life?") print(response)
This is a very basic example of LlamaIndex and how to upload the index to Pinecone. Please upvote if you like the article.
To learn about more interesting and cool applications of LLMs look into our other Blogs and YouTube channel.
Also, want to learn about the state-of-the-art stuff in AI? Don't forget to subscribe to AI Demos. A place to learn about the latest and cutting-edge tools in AI!

Building a GPT-4 Chatbot using ChatGPT API and Streamlit Chat

Pradip Nichite — Sun, 09 Apr 2023 12:19:56 GMT

Introduction:
In this blog post, we will demonstrate how to build a GPT-4 chatbot using the ChatGPT API and Streamlit Chat. ChatGPT is an advanced language model by OpenAI that can generate human-like text based on user inputs. Streamlit is a popular open-source framework that allows developers to create interactive web applications easily. By combining the capabilities of ChatGPT and Streamlit, we will create a simple yet powerful chatbot that can answer user queries about artificial intelligence (AI).
Dependencies:
To begin with, we need to install the necessary Python packages. Create a file named requirements.txt and add the following dependencies:
streamlitstreamlit-chatopenaipython-dotenv
You can install these packages by running the following command:
pip install -r requirements.txt
Creating the Chatbot:
We will create two Python files, chatbot.py and utils.py. The chatbot.py file will handle the user interface and interaction with the ChatGPT API, while utils.py will contain utility functions to manage the chat messages and API calls.
Setting up the chatbot.py file:
In chatbot.py, we start by importing the required libraries and loading the API key for the OpenAI API.
import streamlit as stfrom streamlit_chat import messagefrom utils import get_initial_message, get_chatgpt_response, update_chatimport osfrom dotenv import load_dotenvload_dotenv()import openaiopenai.api_key = os.getenv('OPENAI_API_KEY')
Then, we set up the Streamlit interface with a title, subheader, and a dropdown box to select the language model (GPT-3.5-turbo or GPT-4).
st.title("Chatbot : ChatGPT and Streamlit Chat")st.subheader("AI Tutor:")model = st.selectbox( "Select a model", ("gpt-3.5-turbo", "gpt-4"))
We initialize the session states to store the generated messages, past queries, and the initial set of messages.
if 'generated' not in st.session_state: st.session_state['generated'] = []if 'past' not in st.session_state: st.session_state['past'] = []query = st.text_input("Query: ", key="input")if 'messages' not in st.session_state: st.session_state['messages'] = get_initial_message()
Next, we process the user's query and generate the AI response.
if query: with st.spinner("generating..."): messages = st.session_state['messages'] messages = update_chat(messages, "user", query) response = get_chatgpt_response(messages, model) messages = update_chat(messages, "assistant", response) st.session_state.past.append(query) st.session_state.generated.append(response)
Finally, we display the chat messages and an expander to show the full message history.
if st.session_state['generated']: for i in range(len(st.session_state['generated'])-1, -1, -1): message(st.session_state['past'][i], is_user=True, key=str(i) + '_user') message(st.session_state["generated"][i], key=str(i)) with st.expander("Show Messages"): st.write(messages)
Creating the utils.py file:
In utils.py, we define three utility functions: get_initial_message, `get_chatgpt_response, and update_chat`.
import openaidef get_initial_message(): messages=[ {"role": "system", "content": "You are a helpful AI Tutor. Who anwers brief questions about AI."}, {"role": "user", "content": "I want to learn AI"}, {"role": "assistant", "content": "Thats awesome, what do you want to know aboout AI"} ] return messagesdef get_chatgpt_response(messages, model="gpt-3.5-turbo"): print("model: ", model) response = openai.ChatCompletion.create( model=model, messages=messages ) return response['choices'][0]['message']['content']def update_chat(messages, role, content): messages.append({"role": role, "content": content}) return messages
get_initial_message sets up the initial conversation between the user and the AI tutor. get_chatgpt_response takes the messages and the model as input, makes an API call to ChatGPT, and returns the generated response. update_chat appends new messages to the conversation.
Running the Chatbot:
To run the chatbot, execute the following command in your terminal:
streamlit run chatbot.py
This will launch the Streamlit web application in your default web browser. You can now interact with the chatbot and ask it questions about AI!
Code files:
https://github.com/PradipNichite/Youtube-Tutorials/tree/main/chatGPT-streamlit
Conclusion:
In this blog post, we demonstrated how to build a simple chatbot using the GPT-4 ChatGPT API and Streamlit Chat. This chatbot can be a foundation for creating more complex and sophisticated chatbots for various purposes. The flexibility and ease of use of Streamlit, combined with the powerful language generation capabilities of ChatGPT, provide an excellent platform for creating interactive and engaging chatbot experiences.
If you're more of a visual learner, don't hesitate to check out our step-by-step video tutorial, where we walk you through the entire process.
https://youtu.be/W7kDwsWFjvE

For more insights, tips, and tutorials on AI and chatbot development, be sure to visit our blog at blog.futuresmart.ai. We're constantly updating our content with the latest advancements and techniques in the world of AI, so stay tuned for more exciting content!
Additionally, if you're interested in discovering more AI tools, check out AIDemos.com. AIDemos is a comprehensive directory of video demonstrations showcasing the latest AI tools and technologies. Their mission is to educate and inform users about the incredible possibilities of AI. Don't miss out on the chance to explore and learn about the cutting-edge AI advancements!

Invoice Processing with OCR using Google Vision API and GPT-4

rajashekar vt — Thu, 06 Apr 2023 12:16:57 GMT

This blog will guide you through using Google Vision to extract text from invoice images and PDFs. You will also learn to extract their corresponding entities with the help of GPT-4. By the end of this guide, you'll have a thorough understanding of Google Vision and GPT-4 for Invoice Processing.
Introduction
Invoice processing is an essential task for any business but can also be time-consuming and error-prone. With large volumes of invoices coming in daily, manual processing becomes challenging and can result in significant delays, inaccuracies, and costs. This is where technology can help. By revolutionizing the invoice processing process with Google Vision and GPT-4, businesses can significantly improve the accuracy, speed, and efficiency of their invoicing process.
Entity Extraction in Invoice Processing
Entity extraction plays a crucial role in invoice processing. It is the process of automatically extracting relevant information from invoices and transforming it into structured data. This structured data can then be used for further processing, such as data entry into an accounting system or analysis. By automating the entity extraction process, businesses can significantly improve the speed, accuracy, and efficiency of their invoice processing.
Google Vision For Invoice Processing
Google Vision is a powerful tool for invoice processing that uses machine learning algorithms to extract data from invoices automatically. With Google Vision, businesses can streamline their invoice processing workflow and reduce the time and effort required to manage invoices manually.
It is designed to automatically recognize and extract text from various document types, including invoices. The technology is highly accurate and can even identify and extract information from scanned documents, making it ideal for businesses that process a large volume of invoices.
To use Google Vision for invoice processing, businesses simply need to upload the invoices to the Google Vision service. The service then uses machine learning algorithms to automatically extract the relevant information from the invoices which can then be used for further processing, such as data entry into an accounting system or analysis.
Here are a few methods showing invoice processing with Google Vision.
Before u follow through with further steps, make sure you have google vision installed in your system.
pip install google-cloud-vision
Once you have installed vision you can then follow these methods.
Invoice Processing as an Image
In this method, we are uploading an image file to get OCR results with Google Vision API.
#importing necessary librariesfrom importlib.resources import pathimport os,iofrom google.cloud import visionfrom google.cloud import vision_v1from google.cloud.vision_v1 import typesimport pandas as pdimport osfrom IPython.display import Imageimport json
Importing libraries: The code begins by importing the required modules, including os, io, pandas, IPython.display, json and the Google Cloud Vision API module google.cloud.vision_v1.
#authorizing client credentialsos.environ["GOOGLE_APPLICATION_CREDENTIALS"]=r"YOUR API KEY"
Set up Google Cloud API credentials: The code sets the GOOGLE_APPLICATION_CREDENTIALS environment variable to the path of the JSON key file that contains your Google Cloud API credentials.
client=vision.ImageAnnotatorClient()
Initialize a Vision API client: The code initializes a Vision API client using the vision.ImageAnnotatorClient() method.
google_ocr_dict={}def detectText(img):with io.open(img,'rb') as image_file: content=image_file.read() image = vision.Image(content=content) response = client.text_detection(image=image) texts = response.text_annotations text_num = 0 google_ocr_dict[text_num]= {} for text in texts: # Create a sub-dictionary for each block of text google_ocr_dict[text_num] = {} # Get the coordinates of the bounding box around the text vertices = ([[vertex.x, vertex.y] for vertex in text.bounding_poly.vertices]) # Add the text and its coordinates to the dictionary google_ocr_dict[text_num]['text'] = text.description google_ocr_dict[text_num]['coords'] = vertices # Increment the text number text_num+=1
Define a OCR Function: The detectText function takes an image file path as input and returns the extracted text from the image.
Read the image file: The function reads the image file using the io.open() method and reads the contents of the file using the read() method.
Create a Vision API image object: The function creates a Vision API image object using the vision.Image(content=content) method.
Perform OCR using the Vision API: The function performs OCR on the image using the client.text_detection(image=image) method, which returns the OCR results in a response object.
Extract text and bounding box coordinates: The function then extracts the text and bounding box coordinates for each detected text element using a for loop and saves them in a dictionary named google_ocr_dict.
with open("processed_image_new.json","w") as json_file: json.dump(google_ocr_dict,json_file,indent=4) print(f"Created processed_image.jpg using Google OCR") return google_ocr_dict[0]["text"].replace("\n"," ")
Save OCR results to JSON: The function saves the OCR results to a JSON file using the json.dump() method.
FILE_NAME="blog_image.png"FOLDER_PATH="./Images/"text=detectText(os.path.join(FOLDER_PATH,FILE_NAME))
Overall, this code uses the Google Cloud Vision API to extract text from an image and sort it based on its position on the page, making it easier to process and analyze the text.
Generated OCR Text:
'Company Name Billing Information Company ABC Company Address 111 Pine ' 'Street, Suite 1815 San Francisco, CA, 94111 Phone Number (123) 123-1232 ' 'Email Name John Smith John@example.com Sample Invoice Product/Service 1 ' 'Product/Service 2 Product/Service 3 Product/Service 4 Description Sink Nest ' 'Smart Filter Labor Fee Service Fee Quantity 2 1 1 1 Shipping Information ' 'Name Sam K. Smith Address 111 Pine Street, Suite 1815 San Francisco, CA, ' '94111 Unit Price 100 150 50 06/10/2021 INVO-005 25 Total $200 $150 $50 $25 ' 'Total: $425 1'
We can't make much sense out of this Generated Output, but once we pass this onto GPT-4, we can see some fruitful results.
Invoice Processing as a PDF Document
This code performs Optical Character Recognition (OCR) on a PDF document using the Google Cloud Vision API to extract text from the document.
#importing necessary librariesfrom importlib.resources import pathimport os,iofrom google.cloud import visionfrom google.cloud import vision_v1from google.cloud.vision_v1 import typesimport pandas as pdimport osfrom IPython.display import Imageimport jsonimport PyPDF2
Importing Necessary Libraries: os, io, pandas, IPython.display, json, PyPDF2, and the Google Cloud Vision API modules from google.cloud.vision and google.cloud.vision_v1 are imported.
os.environ["GOOGLE_APPLICATION_CREDENTIALS"]=r"YOUR API KEY"
Set up Google Cloud API credentials: The code sets the GOOGLE_APPLICATION_CREDENTIALS environment variable to the path of the JSON key file that contains your Google Cloud API credentials.
client = vision.ImageAnnotatorClient()
Initialize a Vision API client: The code initializes a Vision API client using the vision.ImageAnnotatorClient() method.
def google_pdf_ocr(document): # Supported mime_type: application/pdf, image/tiff, image/gif mime_type = "application/pdf" with io.open(document, "rb") as f: content = f.read() input_config = {"mime_type": mime_type, "content": content} features = [{"type_": vision.Feature.Type.DOCUMENT_TEXT_DETECTION}] pdf_reader=PyPDF2.PdfReader(document) num_pages=len(pdf_reader.pages) page_text={} num_pages = num_pages + 1 pages = [[i for i in range(j, min(j + 5,num_pages), 1)] for j in range(1,num_pages,5)] ocr_text = {} for batch in pages: requests = [{"input_config": input_config, "features": features, "pages": batch}] response = client.batch_annotate_files(requests=requests) response = response.responses[0].responses page_no = 0 for image_response in response: ocr_text1 = image_response.full_text_annotation.text ocr_text["Page_"+str(batch[page_no])] = ocr_text1 page_no += 1 return ocr_text #returns OCR Text as a dictionary
Define an OCR Function: The google_pdf_ocr function takes a pdf file path as input and returns the extracted text in the form of a dictionary where the key is the page number and the value is the OCR text.
Read the PDF File: The MIME type of the input document is set to "application/pdf" and the content of the file is read using the io.open() method. We read the document using PyPDF2 to obtain the number of pages in the document for further processing.
Performing OCR on PDF File: The OCR is performed on the PDF document using the client.batch_annotate_files() method, which takes a list of requests. Each request specifies the input configuration, features, and pages to be processed. In this code, the pages of the document are split into batches of 5 pages and OCR is performed on each batch. The OCR results are saved in a dictionary named ocr_text, where the key is the page number and the value is the extracted text.
text=google_pdf_ocr("Invoice.pdf")
Finally, the google_pdf_ocr() function is called with the file path of the PDF document as the input, and the extracted text is saved in a variable named text.
Generated OCR Text:
{'Page_1': 'Company Name Billing Information Company ABC Company Address 111 Pine ' 'Street, Suite 1815 San Francisco, CA, 94111 Phone Number (123) 123-1232 ' 'Email Name John Smith John@example.com Sample Invoice Product/Service 1 ' 'Product/Service 2 Product/Service 3 Product/Service 4 Description Sink Nest ' 'Smart Filter Labor Fee Service Fee Quantity 2 1 1 1 Shipping Information ' 'Name Sam K. Smith Address 111 Pine Street, Suite 1815 San Francisco, CA, ' '94111 Unit Price 100 150 50 06/10/2021 INVO-005 25 Total $200 $150 $50 $25 ' 'Total: $425 1'}
Since I have a single-page document as PDF only a single-page output is generated, if you have multiple pages, u will see multiple key-value pairs indicating the pages and their corresponding OCR Text.
Benefits of Using Google Vision
Increased speed and efficiency: By automating the entity extraction process, businesses can significantly improve the speed and efficiency of their invoice processing.
Improved accuracy: Google Vision's machine learning algorithms are highly accurate, reducing the risk of errors in the invoice processing process.
Reduced manual effort: By automating the entity extraction process, businesses can significantly reduce the manual effort required to process invoices.
Enhancing Entity Extraction with GPT-4
Entity extraction is the process of identifying and extracting specific information or entities from a large text corpus. In the context of invoice processing, entity extraction can help in automatically extracting information like invoice number, vendor name, invoice date, item description, and amount, among others. This information can then be used for further processing, such as data validation, categorization, and accounting.
GPT-4, or Generative Pretrained Transformer 4, is a state-of-the-art language model developed by OpenAI. GPT-4 has been trained on a massive amount of text data, making it capable of understanding and generating human-like text. This capability can be leveraged to improve the accuracy of entity extraction. By using GPT-4, it's possible to extract entities from unstructured or semi-structured invoices with high accuracy and in a more flexible manner than traditional entity extraction methods.
Traditional entity extraction methods, such as rule-based or machine learning-based systems, require a significant amount of manual effort to create and maintain rules or models. Moreover, these methods are limited in their ability to extract entities from new or unseen invoice formats. On the other hand, GPT-4-based entity extraction can be more flexible and scalable, as it does not rely on pre-defined rules or models. Additionally, GPT-4 can learn from the data and improve its accuracy over time, making it an ideal choice for entity extraction in invoice processing.
#import OpenAI Library and Authenticate with OpenAI Keyimport openaiopenai.api_key="Your API Key"
Import OpenAI Library and Authenticate with OpenAI Key: Same as we did with the GPT-3, we import the OpenAI library and provide our API key to authenticate with OpenAI.
#define System Rolesystem_role="Extract entities and thier values as a key-value pair from the provided OCR text and seperate them by a new line.
Define System Role: This is a very important part of defining our use case. we have to define a role for the system so that it tunes its behavior accordingly. This is what makes GPT-4 more reliable compared to GPT-3. It is capable of remembering your past inputs and using them to perform domain-specific tasks.
#Get The Responseocr_text = "06/10/2021 K Company INVO-005 Name Sample Invoice Billing Information Shipping Information Company Name Name ABC Company John Smith Sam K. Smith Address Address 111 Pine Street, Suite 1815 111 Pine Street, Suite 1815 San Francisco, CA, 94111 San Francisco, CA, 94111 Phone Number (123) 123-1232 Email John@example.com Description Quantity Unit Price Total Product/Service 1 Sink 2 100 $200 Product/Service 2 Nest Smart Filter 1 150 $150 Product/Service 3 Labor Fee 1 50 $50 Product/Service 4 Service Fee 1 25 $25 Total: $425 1"response = openai.ChatCompletion.create( model="gpt-4", messages=[ {"role":"system","content":system_role}, {"role":"user","content":ocr_text} #pass the OCR Text obtained from ])
Generated Output:
Entities:Date: 06/10/2021Invoice Number: INVO-005Company Name: K CompanyName: Name Sample InvoiceBilling Information: Company Name: ABC Company Contact Name: John Smith Address: 111 Pine Street, Suite 1815 City: San Francisco State: CA Zip Code: 94111 Phone Number: (123) 123-1232 Email: John@example.comShipping Information: Name: Sam K. Smith Address: 111 Pine Street, Suite 1815 City: San Francisco State: CA Zip Code: 94111Description: Product/Service 1 Item: Sink Quantity: 2 Unit Price: 100 Total: $200Description: Product/Service 2 Item: Nest Smart Filter Quantity: 1 Unit Price: 150 Total: $150Description: Product/Service 3 Item: Labor Fee Quantity: 1 Unit Price: 50 Total: $50Description: Product/Service 4 Item: Service Fee Quantity: 1 Unit Price: 25 Total: $25Total: $425
For More Information and Understanding of GPT-4 You can refer to this Video:
https://youtu.be/FA_6TRI1ySY

Google Vision has proven to be an efficient tool for invoice processing as it can extract information from invoices with high accuracy and at a faster rate than traditional methods. The integration of GPT-4 has further improved the entity extraction process by providing human-like understanding and the ability to recognize complex entities.
With these advancements in technology, the future of invoice processing looks bright. The integration of Google Vision and GPT-4 will continue to improve and streamline the invoice processing process, making it faster, more efficient, and more accurate.
In conclusion, using Google Vision and GPT-4 in invoice processing is is set to revolutionize the Generative AI industry, but it depends on the use case at hand. If the task is small, it may be better to stick to GPT-3 since GPT-4 can be quite expensive when compared to its predecessor. But if accuracy and precision are something that you are looking for then GPT-4 is the right one for the job.
If you're interested in invoice processing, you might want to read our previous post, where we compared Amazon Textract and GPT-4 for text extraction.
Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI
Looking to stay up to date on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with aidemos.com

Mastering Hugging Face Inference API: Integrating NLP Models for Real-Time Predictions

Naman Goel — Tue, 04 Apr 2023 08:28:28 GMT

Inference API is a type of API that allows users to make predictions using pre-trained machine-learning models. It is a crucial component in the deployment of machine learning models for real-time predictions and decision-making. By using an Inference API, developers can integrate pre-trained machine learning models into their applications, enabling them to make predictions based on new data in real-time.
In today's world, natural language processing (NLP) models are increasingly being used in various applications such as chatbots, voice assistants, and sentiment analysis. These models can process large volumes of text data and extract insights that can be used for decision-making. However, to use these models effectively, they need to be integrated into applications in a way that allows them to be used for real-time predictions. This is where Inference API comes in, making it possible for developers to integrate NLP models into their applications and enable them to provide real-time predictions.
Hugging Face Inference API
Image Source
Hugging Face is a popular natural language processing (NLP) company that provides a wide range of NLP tools and resources for developers, researchers, and businesses. One of their most popular offerings is the Hugging Face API, which provides access to pre-trained NLP models for a variety of tasks such as text classification, sentiment analysis, question answering, and more. The API allows developers to easily integrate these pre-trained models into their applications, without the need for extensive training or model development. The Hugging Face API is widely used in the NLP community and has helped to democratize access to advanced NLP capabilities, making it easier for more people to build sophisticated natural language applications.
Types of NLP Models for Inference
Several types of NLP models can be used for inference, each with its own strengths and use cases. Some of the most common types of NLP models for inference include:
Text Classification Models: These models are used to classify text into pre-defined categories such as spam/not spam, positive/negative sentiment, or topic classification.
Named Entity Recognition (NER) Models: These models are used to identify and extract entities such as names, organizations, and locations from the text.
Sentiment Analysis Models: These models are used to determine the emotional tone of a piece of text, such as whether a tweet is positive, negative, or neutral.
Language Translation Models: These models are used to translate text from one language to another, which can be useful for applications that require multilingual support.
Question Answering Models: These models are used to answer questions based on a given context or passage, such as answering questions on a reading comprehension test.
Text Summarization Models: These models are used to summarize long pieces of text into shorter, more concise summaries, which can be useful for applications such as news aggregation or document summarization.
By using these types of NLP models for inference, developers can extract useful insights from text data and use them for real-time predictions and decision-making.
How to use Hugging Face Inference API?
First, create a Hugging Face account and select the pre-trained NLP model you want to use. For this example, let's use the pre-trained BERT model for text classification.

Search BERT in the search bar.

Next, go to the Hugging Face API documentation for the BERT model.

Find the endpoint URL for the model. This will be the URL you use to make API requests to the model.

api_url = "https://api-inference.huggingface.co/models/nbroad/ESG-BERT"
Click on the "Select Token" button in the "Use this model with the Inference API" dialogue box. Enter a name for your API token and click on the "Create" button to generate the token.

After creating the token, you'll be taken to a page with the token details, including the token value itself. Copy the token value and save it somewhere secure, as you'll need it to make API requests to the selected model.

Open up a code editor or Jupyter Notebook and import the requests library, which we'll use to make API requests to the model.
import requests
With the requests library imported, we can now create an API request to the BERT model. To do this, we'll need to specify the endpoint URL for the model, our API key, and the input text we want to classify.
api_key = "YOUR_API_KEY_HERE" headers = {"Authorization": f"Bearer {api_key}"} data = {"inputs": "This is a positive example."} response = requests.post(api_url, headers=headers, json=data)
In the code above, we've created an API request to the BERT model with the input text "This is a positive example." We've also specified our API key and endpoint URL in the headers of the request.
After sending the API request, we'll get a response back from the model with its predictions. We can extract the predictions from the response using the following code:
predictions = response.json()[0]["score"]
In the code above, we've extracted the prediction scores for each class from the API response. In this case, since we're using a binary text classification model, there are two prediction scores: one for the positive class and one for the negative class.
Finally, we can use the prediction scores to make a decision based on the input text. For example, if the positive prediction score is higher than the negative prediction score, we might classify the input text as positive.
if predictions[0] > predictions[1]: print("Positive")else: print("Negative")
By following this step-by-step procedure with the code examples, you can easily use the Hugging Face Inference API to make real-time predictions based on text data using pre-trained NLP models.
Using Hugging Face Inference API for Sentence Embeddings:
Sentence embeddings are numerical representations of text data that are learned using deep learning techniques. These embeddings can be used to measure the similarity between sentences, which is useful for various NLP tasks such as information retrieval, clustering, and classification.
We will explore how to use the Hugging Face Inference API to obtain sentence embeddings using a pre-trained model. Specifically, we will use the example code below to demonstrate how to obtain sentence embeddings using the Hugging Face Inference API:
import requestsdef query(payload): import requests API_URL = "https://api-inference.huggingface.co/pipeline/feature-extraction/clips/mfaq" headers = {"Authorization": f"Bearer {api_key}"} response = requests.post(API_URL, headers=headers, json=payload) return response.json()def get_embeding(text): print("Using Huggingface Inference API") output = query({ "inputs": text, "options": {"wait_for_model": True} }) return outputget_embeding("Hello World")
This example code demonstrates how to use the Hugging Face Inference API to obtain sentence embeddings using the CLIPs model. The get_embeding function takes in a string of text as input and returns a JSON object containing the sentence embeddings.
The output of the code is a JSON object containing the sentence embedding for the input text "Hello World". The exact output may vary depending on the specific pre-trained model being used, but it should be a dictionary-like object that contains the sentence embedding as a list of floats. Here's an example output:
{ "last_hidden_state": [ [ -0.07987409853935242, 0.10686787259578705, -0.0418370364010334, ... 0.023708984166026115, -0.11434602707624435, -0.07202731847715378 ] ]}
In this example output, the sentence embedding is contained in the last_hidden_state the key, which contains a list with a single element (since we only passed in a single sentence). The sentence embedding is represented as a list of floats, with each float representing the strength of a particular feature in the sentence.
Using Hugging Face Inference API for NER:
Named Entity Recognition (NER) is a subtask of Natural Language Processing (NLP) that involves identifying and classifying named entities in text into predefined categories such as person names, organizations, locations, and others. NER is used in various applications, including information extraction, machine translation, question answering, and sentiment analysis.
Hugging Face provides a number of pre-trained NER models that can be used for extracting named entities from text data. To use a pre-trained NER model from the Hugging Face Inference API, you can follow these steps:
Install the requests library in Python using pip install requests.
Get your API key by signing up on the Hugging Face website and creating an API token.
Use the following Python code to load the pre-trained NER model from the Hugging Face Inference API and extract named entities from text data:
import requestsimport json# API endpoint URLapi_url = "https://api-inference.huggingface.co/models/"# API tokenapi_token = ""# Headersheaders = {"Authorization": "Bearer " + api_token}# Input datadata = {"inputs": "John Smith works at Apple Inc."}# Send a POST request to the API endpoint with the input dataoutputs = requests.post(api_url, headers=headers, json=data)# Extract named entities from the outputner_results = outputs.json()[0]["entity_group"]print(ner_results)
In the code above, replace with the name of the pre-trained NER model you want to use, and with your actual API token.
The input data is provided as a dictionary with a single key "inputs" and the value is the input text to extract named entities from. The requests.post() function is used to send a POST request to the API endpoint with the input data, and the response is stored in the outputs variable. The named entities extracted from the text are then extracted from the output JSON and stored in the ner_results variable.
Using Hugging Face Inference API for QnA:
Question Answering (Q&A) is a natural language processing (NLP) task where a machine is asked a question in natural language, and it provides a relevant answer. With advancements in NLP, models have been developed that can answer complex questions by processing large amounts of text data. Pretrained Q&A models are available on Hugging Face that can be used for various use cases such as chatbots, customer support, and information retrieval.
Using Hugging Face Inference API for Q&A:
Load the pre-trained Q&A model:
url = 'https://api-inference.huggingface.co/models/{MODEL_OWNER}/{MODEL_NAME}' headers = {'Authorization': f'Bearer {API_TOKEN}'} model_id = '{MODEL_OWNER}/{MODEL_NAME}' model_url = url.format(MODEL_OWNER=model_id.split('/')[0], MODEL_NAME=model_id.split('/')[1])
Prepare the input data in the required format:
context = 'John lives in New York City. He works at ABC Inc. His office is in Manhattan.' question = 'Where does John work?' data = {"inputs": {"question": question, "context": context}}
Send the request to the API to get the answer:
response = requests.post(model_url, headers=headers, json=data) answer = json.loads(response.content.decode('utf-8'))[0]['answer'] print(answer)
In this code, you can replace {MODEL_OWNER} and {MODEL_NAME} with the respective owner and name of the Q&A model that you want to use. You also need to replace {API_TOKEN} with your Hugging Face API token.
Once you have loaded the pre-trained model and provided input data, the API will return the relevant answer for your question based on the context provided. Here's an example output for a question-answering task using the Hugging Face Inference API:
{ "score": 0.912345, "start": 42, "end": 54, "answer": "New York City"}
In this example, the API has predicted the answer to a question to be "New York City" with a confidence score of 0.912345. The start and end fields indicate the positions of the answer within the input text.
Using Hugging Face Inference API for Summarization:
Summarization is the process of reducing a text to its essential information, providing a concise version that retains the most important parts of the original. Summarization models are widely used in various applications such as news summarization, document summarization, and even chatbot responses. Hugging Face provides pre-trained summarization models that can be easily accessed through their Inference API.
Example Code: Here is an example Python code that uses the Hugging Face Inference API to summarize a text:
import requests# Replace 'API_TOKEN' with your actual API tokenAPI_TOKEN = 'API_TOKEN'# Replace 'model_name' with the name of the pre-trained summarization model you want to usemodel_name = 'sshleifer/distilbart-cnn-12-6'# Replace 'text_to_summarize' with the text you want to summarizetext_to_summarize = 'Hugging Face is a company that specializes in Natural Language Processing (NLP). Their Inference API allows developers to easily use pre-trained NLP models to perform various NLP tasks such as Named Entity Recognition (NER), Question Answering (QA), and Summarization. In this example, we will use the Hugging Face Inference API to summarize a text.'# Define the API endpointendpoint = f'https://api-inference.huggingface.co/models/{model_name}'# Set the request headersheaders = { 'Authorization': f'Bearer {API_TOKEN}', 'Content-Type': 'application/json'}# Set the request datadata = { 'inputs': text_to_summarize, 'parameters': { 'max_length': 100, 'min_length': 20, 'do_sample': False }}# Send the requestresponse = requests.post(endpoint, headers=headers, json=data)# Get the summarized text from the responsesummarized_text = response.json()[0]['summary_text']# Print the summarized textprint(summarized_text)
In this code, we first specify the API token and the name of the pre-trained summarization model we want to use. We then define the text that we want to summarize. Next, we set the API endpoint and request headers, as well as the data that we want to send to the API. Finally, we send the request and extract the summarized text from the response. The summarized text is then printed to the console.
Scaling NLP Model Deployment with Inference API
The Hugging Face Inference API offers a streamlined way to deploy NLP models quickly and easily, making it possible to scale up to larger datasets more efficiently. Here are some ways the Inference API can speed up NLP model deployment and make it easier to scale up:
Pre-trained models: The Inference API provides access to pre-trained models that have already been fine-tuned on large datasets, allowing you to skip the time-consuming and resource-intensive process of training your models from scratch. This can save a lot of time and resources, especially when working with large datasets.
Cloud-based infrastructure: The Inference API is hosted in the cloud, so you don't need to worry about setting up and maintaining your server infrastructure. This not only saves time and money but also provides more scalability and flexibility for handling large amounts of data.

Streamlined API: The Inference API provides a simple and consistent interface for making API requests to the deployed model, regardless of the underlying model architecture. This makes it easy to integrate NLP models into your applications and scale up to larger datasets as your needs grow.
Fast response times: The Inference API is designed for high performance, with low latency and high throughput. This means that even when processing large datasets, you can get results quickly and efficiently.
Flexible pricing plans: The Inference API offers a variety of pricing plans to suit different use cases and budget constraints. You can choose from pay-as-you-go plans, subscription plans, or enterprise plans, depending on your needs.
Overall, the Inference API provides a convenient and scalable way to deploy NLP models, allowing you to focus on the data and the problem you're trying to solve, rather than the underlying infrastructure. With access to pre-trained models, cloud-based infrastructure, and a streamlined API, you can quickly and easily deploy NLP models to handle larger datasets and more complex natural language processing tasks.
If you're interested in learning more about how to use the Hugging Face Inference API in practice, be sure to check out this YouTube tutorial on the topic:
https://youtu.be/XMYlqm2Dq1w

This tutorial provides a step-by-step guide to using the Inference API to deploy an NLP model and make real-time predictions on text data. You'll learn how to work with the API, how to prepare your data for inference, and how to interpret the results. Whether you're a seasoned NLP practitioner or just getting started with natural language processing, this tutorial is a great resource for learning how to leverage the power of the Hugging Face Inference API.

How to Create a MySQL DB on AWS RDS and Connect with Python: The Ultimate Step-by-Step Guide

jay — Sun, 02 Apr 2023 05:49:57 GMT

In today's digital world, data is everything. Every organization has to deal with a large amount of data, and they need a reliable and scalable solution to manage it. This is where AWS RDS comes into play. AWS RDS is a managed database service that provides you with a reliable and scalable solution for managing your databases in the cloud. In this blog, we will guide you through the steps of creating an AWS RDS MySQL instance, which you can use to store and manage your data.
Before we get started, let's understand why we need AWS RDS. AWS RDS offers several benefits over traditional database management solutions. First, it is fully managed, which means that AWS will handle all the operational aspects of managing the database. This includes backups, software updates, and security patches. This allows you to focus on your core business activities rather than worrying about database management.
Second, AWS RDS offers scalability. You can easily scale up or down your database instance as your business needs change. This allows you to pay only for what you use, reducing your costs and increasing your flexibility.
Now that we understand the benefits of AWS RDS, let's get started with creating an AWS RDS MySQL instance.
Step 1: Create an AWS RDS MySQL instance
You can create an AWS RDS MySQL instance, through following these steps:
Log in to the AWS Management Console.
Click on "Services" and then click on "RDS" under the "Databases" section.

Click on the "Create database" button.

Choose "MySQL" as the database engine.
Choose the version of MySQL you want to use.
Choose the instance size based on your needs.
Set the database name, username, and password.
Choose the availability zone.
Set the backup retention period and preferred backup window.
Click on the "Create database" button.
This is the screenshot of that window page.
Step 2: Set the inbound rules for the security group
In AWS RDS, security groups act as virtual firewalls that control the incoming and outgoing traffic of the database instances. Inbound rules define the source of the traffic that is allowed to connect to the database instance.
Step 2.1: Open the RDS console
To set the inbound rules for the security group in AWS RDS, you need to first open the RDS console. To do this, follow these steps:
Log in to your AWS account.
Click on "Services" in the top navigation menu.
Select "RDS" from the list of services.
Step 2.2: Select the security group
Once you have opened the RDS console, select the security group that you want to modify. To do this, follow these steps:
Click on the "Databases" tab in the navigation pane.
Click on the name of the database instance whose security group you want to modify.
Click on the "VPC security groups" link under the "Security" section.
Step 2.3: Edit the inbound rules
To edit the inbound rules of the security group, follow these steps:
Select the security group that you want to modify.
Click on the "Inbound rules" tab.
Click on the "Edit inbound rules" button.
Step 2.4: Add a new rule
To add a new inbound rule, follow these steps:
Click on the "Add rule" button.
Select the type of rule that you want to add from the drop-down menu.
Enter the required information, such as the source IP address, port range, and protocol.
Step 2.5: Save the changes
Once you have added the new rule, click on the "Save rules" button to save the changes. Your new inbound rule will be added to the security group.
Step 3: Connect to the AWS RDS MySQL instance
Once you have created the AWS RDS MySQL instance, you need to connect to it. To do this, follow these steps:
Log in to the AWS Management Console.
Click on "Services" and then click on "RDS" under the "Databases" section.
Click on the name of the MySQL instance you just created.
Click on the "Connectivity & security" tab.
Note down the endpoint of the database instance.
Click on the "Create new security group" button.
Set the inbound rules for the security group. This will allow access to the MySQL instance from your local machine.(see step 2)
Click on the "Create" button.
Open your MySQL client and connect to the MySQL instance using the endpoint and credentials you noted down in step 5.
Step 4: Connecting the RDS instance with Python
Connecting to an RDS instance from Python can be useful for accessing and manipulating data stored in the database. In this blog, we will go through the steps to connect to an RDS instance from Python.
Step 4.1: Install the necessary packages
pip install pymysql
We have already created RDS instance and set the inbound rules for the security group as done in step 2 and 3.
Step 4.2: Create a database and table
Once you have created an RDS instance, you can create a database and table. For this example, we will create a simple users table with columns for id, name, and email. You can create the table using the following SQL command:
CREATE TABLE users ( id INT AUTO_INCREMENT PRIMARY KEY, name VARCHAR(255), email VARCHAR(255) );
Step 4.3: Connect to the RDS instance from Python.
Once we have the necessary packages installed, the RDS instance created, the database and table created, and the inbound rules set, we can connect to the RDS instance from Python using the pymysql package. Here is an example code snippet:
import pymysql# Set the database credentialshost = ''port = 3306user = ''password = ''database = ''# Connect to the databaseconnection = pymysql.connect( host=host, port=port, user=user, password=password, database=database)# Create a cursor objectcursor = connection.cursor()# Execute a SQL querycursor.execute('SELECT * FROM users')# Fetch the resultsresults = cursor.fetchall()# Print the resultsfor result in results: print(result)# Close the cursor and connectioncursor.close()connection.close()
In the above code, we first set the database credentials (replace , , , and with your own values). We then use pymysql.connect() to connect to the database and create a cursor object.
We then execute a SQL query to select all records from the users table and fetch the results using cursor.fetchall(). Finally, we print the results and close the cursor and connection.
Let's understand each line of code
To establish a connection to the RDS instance from Python, we will need to use a database connector. There are several connectors available for MySQL, but in this tutorial, we will use the popular mysql-connector-python library.
To install it, you can use pip:
pip install mysql-connector-python
Once the installation is complete, you can import the library in your Python script:
import mysql.connector
To connect to the RDS instance, you will need the endpoint, username, and password that you specified when creating the instance. You will also need the name of the database that you want to connect to (in our case, it is mydatabase). You can define these values as variables:
import mysql.connectorendpoint = 'my-rds-instance.123456789012.us-east-1.rds.amazonaws.com'username = 'admin'password = 'mypassword'database = 'mydatabase'
Then, you can use the connect() method of the mysql.connector module to establish a connection:
import mysql.connectorendpoint = 'my-rds-instance.123456789012.us-east-1.rds.amazonaws.com'username = 'admin'password = 'mypassword'database = 'mydatabase'cnx = mysql.connector.connect(user=username, password=password, host=endpoint, database=database)
The connect() method returns a connection object, which you can use to interact with the database.
Create a Cursor Object
import mysql.connectorendpoint = 'my-rds-instance.123456789012.us-east-1.rds.amazonaws.com'username = 'admin'password = 'mypassword'database = 'mydatabase'cnx = mysql.connector.connect(user=username, password=password, host=endpoint, database=database)cursor = cnx.cursor()
Execute SQL Queries
With the cursor object, you can execute SQL queries. For example, to create a table in the database, you can use the execute() method:
import mysql.connectorendpoint = 'my-rds-instance.123456789012.us-east-1.rds.amazonaws.com'username = 'admin'password = 'mypassword'database = 'mydatabase'cnx = mysql.connector.connect(user=username, password=password, host=endpoint, database=database)cursor = cnx.cursor()create_table_query = '''CREATE TABLE users ( id INT(11) NOT NULL AUTO_INCREMENT, name VARCHAR(255) NOT NULL, email VARCHAR(255) NOT NULL, PRIMARY KEY (id))'''cursor.execute(create_table_query)
The execute() method on the cursor object is used to execute SQL queries on the database. Here, we are executing a SQL CREATE TABLE query to create a new table.
Commit the changes
After executing SQL queries, you need to commit the changes to the database using the commit() method on the connection object.
# commit the changesconn.commit()
Close the cursor and connection objects
After you are done working with the database, you should close the cursor and connection objects using the close() method.
# close cursor and connection objectscursor.close()conn.close()
Closing the cursor and connection objects is important to free up any resources that were allocated to them.
And that's it! You have now connected to an RDS instance from Python and performed various database operations.
Conclusion
In this blog, we learned how to create an RDS instance of MySQL on AWS and connect it with Python using the mysql-connector-python module. We also saw how to create a table in the RDS instance from Python. This knowledge will be useful for anyone who wants to create a cloud-based managed database service and connect it with their Python application.
AIDemos.com is an informative resource for anyone interested in the latest AI tools and technologies. With its extensive collection of video demonstrations, AIDemos.com provides an interactive and engaging way to learn about these tools and stay up-to-date with the latest developments in AI. Whether you're a beginner or a seasoned professional, AIDemos.com has something for everyone, making it an excellent platform to explore the possibilities of AI and expand your knowledge. Visit AIDemos.com today to start exploring the exciting world of AI!

GPT-4, Semantic Search, and Vector Databases: Revolutionizing Question Answering

Namrata Kadasi — Thu, 30 Mar 2023 19:29:14 GMT

Introduction
This blog post will delve into the revolutionary combination of GPT-4, semantic search, and vector databases, which is redefining the landscape of question-answering.
By harnessing the power of sentence transformer embeddings to index documents into vector databases, we can efficiently match user queries with relevant documents. The real magic happens when we pass these matching documents as context and the user's query to GPT-4, which intelligently processes the information to generate accurate and meaningful answers. In this blog, we will explore the underlying principles of this approach, the benefits of using GPT-4, semantic search, and vector databases in question answering, and how these technologies are shaping the future of AI-driven information retrieval.
Traditional keyword search and its limitations
Imagine you're lost in the vast expanse of the internet, trying to find a nugget of information hidden somewhere among the billions of websites out there. Earlier, you would have had to depend on conventional keyword-based search engines to retrieve this information. These engines would lookup through a corpus of documents and retrieve results that contained one or more keywords from your search query. This method has certain limitations which we will review next.
Loss of semantic meaning
Suppose you enter "raining cats and dogs" in the search bar. The keyword-based search engines provided us with all the documents that contained the words "cats," "raining," "and," and "dogs," including various cat images, rain forecasts, and so forth. However, you want to understand the meaning of an idiomatic expression referring to "heavy rainfall" in this context. Although there may be some relevant results, this approach generates a large number of false positives. It poses a serious issue because the semantic relation between words of the search query is not being determined.
Biased ranking
Some websites use tactics to include an excessive amount of keywords and phrases in their content to rank higher in search results and attract more visitors. However, these tactics can lead to biased rankings and result in websites with less relevant or even clickbait content appearing at the top of search results. This can create a frustrating and misleading experience for users
Handling ambiguous queries
When using traditional keyword-based search engines, ambiguous search queries can be a major problem. For instance, if you were to type "banks near me" into the search bar, the search engine would have trouble determining whether you are referring to the river bank or cash bank. This often forces users to refine their query and add more specific keywords to get the desired results, like "cash bank near me" in this case.
However, the search for information has undergone a major transformation in recent years with the advent of semantic search and question-answering which help us solve these problems. Today, search engines are using techniques like vector databases, embeddings, and indexing to provide users with more accurate and relevant information. So, let's delve right into these interesting topics next.
The emergence of semantic search
The emergence of semantic search has revolutionized information retrieval and has widened applications. Semantic search uses natural language processing techniques to analyze the context, intent, and relationships between words and phrases of search queries to better understand the user's query and to provide results that are not only relevant but also more comprehensive and nuanced.
Not surprisingly, it helps us solve all the problems addressed in the above section. A semantic search engine would know that "raining cats and dogs" is an idiom for heavy rainfall since it understands the contextual meaning of the query and would provide search results related to it. Also, here, the semantic similarity between the query and the document is computed to find the most relevant documents to the query rather than just matching specific keywords or phrases making it more robust and avoiding irrelevant results. The semantic search engine is also great at handling ambiguous queries and powers a lot of recommendation systems today because it generates relevant results based on the user's previous browsing behaviour and the user's intent behind the current search query. This improves flexibility and enhances the user experience
Having known the power of semantic search, let's demystify its working through practical implementation. It is highly recommended to launch an IDE and follow the code yourself while continuing to read through the rest of the article.
Sentence Embeddings: Enhancing search relevance
It is a known fact that machines understand only numerical entities. So, how do you think they are able to process sentences and phrases? This is where embeddings come into the picture. Specifically, sentence embedding is a high-dimensional vector representation of a sentence that encapsulates its meaning in a dense and continuous space. By utilizing these embeddings, we can enable machines to process natural language.
In the context of semantic search, these sentence embeddings are used to represent both the query and the documents being searched in the same latent space. The similarity between the query and documents is measured by the distance between their projected vectors. The ultimate goal of semantic search is to retrieve semantically similar documents by finding the nearest documents in the embedding space.
We will use the sentence-transformers library to generate these embeddings for us. You can install and import it by running the following commands
!pip install sentence_transformers
from sentence_transformers import SentenceTransformer
Then we instantiate a pre-trained SentenceTransformer model called 'all-MiniLM-L6-v2' which encodes a given sentence into a 384-dimensional vector representation.
model = SentenceTransformer('all-MiniLM-L6-v2')
Vector Database and Pinecone
A vector database is a database that is specifically designed to store and retrieve vector embeddings efficiently. Unlike traditional databases, which are designed to store and query structured data such as tables, a vector database focuses on the storage and retrieval of high-dimensional vectors. The benefits of using a vector database include
Faster query times: When searching for similar documents in a large dataset, traditional database systems may require scanning every document to compute the similarity between the query and each document. However, a vector database can perform such searches much faster because it uses Approximate Nearest Neighbour algorithms that allow for fast distance calculations.
Lower memory usage: Traditional databases may store documents in their original text form, which can require a lot of storage space. However, by using vector embeddings, we can represent documents in a much more compact form that requires less storage space. This not only saves disk space but also reduces the amount of memory required to perform similarity searches.
Better scalability: As the amount of data grows, traditional database systems may struggle to maintain fast query times and efficient memory usage. Vector databases, on the other hand, are designed to handle large amounts of vector data and can scale efficiently with increasing data volume.
These benefits are particularly relevant in the context of semantic search, where the number of documents and queries can be massive and where the speed of retrieval is critical for a good user experience. Pinecone is a cloud-based vector database that is designed to provide fast and scalable storage and retrieval of vector embeddings. It is particularly well-suited for use in semantic search. Let's see how we can leverage pinecone for semantic search.
For the sake of demonstrating how to insert text into the pinecone database, I have curated a sample list of three paragraphs. However, feel free to use any text data of your choice for this purpose.
text_data = ["The Great Barrier Reef, located off the coast of Australia, is the world's largest coral reef system. It is home to thousands of marine species, including fish, sharks, turtles, and dolphins. The reef also attracts millions of visitors each year, making it a significant contributor to the Australian economy. Unfortunately, the reef is under threat due to climate change, pollution, and overfishing. Efforts are being made to protect the Great Barrier Reef, such as implementing fishing restrictions, reducing pollution, and promoting sustainable tourism.","The iPhone, first released in 2007, revolutionized the smartphone industry. It introduced the world to the concept of a touchscreen phone with a user-friendly interface and a range of useful apps. Since then, Apple has released numerous versions of the iPhone, each with new features and improvements. The latest model, the iPhone 13, has a more powerful processor, improved camera system, and a longer battery life. Despite its popularity, the iPhone has faced criticism for its high price point and the company's approach to repairing and recycling its products.","Artificial intelligence (AI) is a rapidly growing field that involves the development of machines that can perform tasks that typically require human intelligence, such as recognizing speech and images, making decisions, and learning from experience. AI is being used in a wide range of industries, including healthcare, finance, and transportation. While AI has the potential to revolutionize many aspects of our lives, it also raises ethical concerns, such as the risk of bias and discrimination, privacy violations, and job displacement."]
Step-1: Create a Pinecone Index
To create a Pinecone index, you first need to sign up for a Pinecone account. Then install the pinecone-client python package by running the following command
!pip install pinecone-client -q
Then, we import the pinecone library and initialize the Pinecone API using the pinecone.init() function which accepts two parameters: api_key and environment. The api_key is a unique identifier that is used to authenticate and authorize access to the Pinecone service. The pinecone.create_index() function creates a new index in the Pinecone service with the specified indexnameand dimensionality (Here we use 384 because of the 'all-MiniLM-L6-v2' model). This index will be used to store our embeddings that can be efficiently queried using Pinecone's search functionality.
import pineconepinecone.init(api_key="your_api_key", environment="env") index = pinecone.create_index("indexname", dimension=384)index = pinecone.Index("demo")
Step-2: Loading Data into the index
First, we initialize a variable i and an empty list called upserted_data to store data that will be added or updated in the Pinecone index. Then we loop through each item in the text_data list and get the current total vector count of the index using the describe_index_stats() method. This count and variable i are used to generate the ID for the new vector that will be added to the index.
A tuple is created for each item, containing the ID of the vector, the encoded vector representation of the text data (embedding) using the all-MiniLM-L6-v2 model, and a dictionary that contains additional metadata, in this case, the content/text itself.
These tuples are then appended to the upserted_data list. Finally, the upsert() method is called with the upserted_data list passed as the vectors parameter to load the data into the Pinecone index.
upserted_data = []i=0for item in text_data: id = index.describe_index_stats()['total_vector_count'] upserted_data.append( ( str(id+i), model.encode(item).tolist(), { 'content': item } ) ) i+=1index.upsert(vectors=upserted_data)
Step-3: Query the index
Here, we initialise a query variable to execute a semantic search and find the most related document in the database to this input query. The variable query_em stores the vector representation of the query generated by the all-MiniLM-L6-v2 model. The index.query() function is called with the query_em parameter, the top_k parameter which specifies the number of relevant search results to return, in this case, only the most relevant search result is returned and the includeMetadata parameter which specifies whether to include any metadata associated with the search results.
query = "What are some of the features of the latest iPhone model, the iPhone 13"query_em = model.encode(query).tolist()result = index.query(query_em, top_k=1, includeMetadata=True)print(result)
The output is as follows. We see that the query is matched to the paragraph containing information about iPhone as expected since it is the most related document in the database
Question answering and semantic search with GPT-4
By incorporating Question Answering into Semantic Search, search engines can not only return a list of relevant documents but also extract specific answers to the user's question from those documents. This not only enhances the relevance and accuracy of search results but also provides a more satisfying search experience for the user.
One of the most recent and exciting advancements in AI is the launch of GPT-4. It is a large language model with the ability to process text and images and produce more accurate answers while handling queries more efficiently compared to its previous generations. This makes it an amazing tool for various NLP tasks including question-answering. Let's do this hands-on.
To get started, you'll first need to create an OpenAI GPT-4 API key. Then, install the openai library using the following command
!pip install openai
Next, import the openai library and set your openai key
import openaiopenai.api_key = "YOUR_OPENAI_API_KEY"
The primary idea here is to utilize semantic search to retrieve the most relevant document based on a given query and then pass on it as context to a question-answering model such as GPT-4, which is capable of extracting specific information from the document. By integrating these two techniques, we can effectively extract precise and accurate answers to user queries. As we have already executed the semantic search and obtained the relevant document, we can utilize the same document as context for our question-answering model.
The system role defines the instructions for AI and indicates what task to be performed precisely. Then we make an openai.ChatCompletion request using the GPT-4 model and passing the system role and user_input in messages.
system_role="Answer the question as truthfully as possible using the provided context, and if the answer is not contained within the text and requires some latest information to be updated, print 'Sorry Not Sufficient context to answer query' \n" context = result['matches'][0]['metadata']['content']user_input = context + '\n' + query +'\n'gpt4_response = openai.ChatCompletion.create( model="gpt-4", messages=[ {"role":"system","content":system_role}, {"role":"user","content":user_input} ])print(gpt4_response)
The answer can be obtained in gpt4_response["choices"][0]["message"]["content"]
If you have come this far, you have one of the most powerful tools powering a lot of real-world applications like automated customer support, organizing employee accounts etc in your hand. Experiment further and build exciting applications on top of it. Check out these links for a video walkthrough of sentence embeddings and semantic search plus question answering
https://www.youtube.com/watch?v=OlhNZg4gOvA

https://www.youtube.com/watch?v=4QaodVdUTf0

Follow FutureSmart AI to stay up-to-date with the latest and most fascinating AI-related blogs - FutureSmart AI
Looking to stay up to date on the latest AI tools and applications? Look no further than AI Demos This directory features a wide range of video demonstrations showcasing the latest and most innovative AI technologies. Whether you're an AI enthusiast, researcher, or simply curious about the possibilities of this exciting field, AI Demos is your go-to resource for education and inspiration. Explore the future of AI today with aidemos.com

FutureSmart AI Blog

Leveraging ChromaDB for Finding Similar Products

Introduction

Fundamentals of ChromaDB

Installation

Creating ChromaDB Collection

Adding File Data to ChromaDB

Querying for Similar Products from chromadb

Similar Products

Conclusion

From Novice to Expert: My Internship Odyssey at FutureSmartAI

Introduction

My First Project: Diving into Lambda Functions

Leadership and Learning

Exploring New Technologies

Conclusion

Mastering Natural Language to SQL with LangChain | NL2SQL

Introduction

Building a Basic NL2SQL Model

Understanding the Basics

Setting Up LangChain

The First Query

Seeing the Results

Moving Forward

Rephrasing Answers for Enhanced Clarity

Implementing Rephrasing with LangChain

Example: Transforming SQL Results into User-Friendly Responses

Enhancing NL2SQL Models with Few-Shot Examples

Incorporating Few-Shot Examples into LangChain

The Impact of Few-Shot Learning

Dynamic Few-Shot Example Selection:

The Need for Dynamism

Implementing Dynamic Few-Shot Selection

Dynamic Relevant Table Selection

Leveraging Smaller, Focused Prompts for Faster Execution

Enhancing Chatbots with Memory for Follow-up Database Queries

The Significance of Memory in Chatbots

Implementing Memory in Your NL2SQL Model

Example Scenario: Handling Follow-Up Questions

Conclusion:

Interacting with Videos via VideoDB and OpenAI

Introduction

Setting Up the Environment

Install Required Libraries

Here's a complete breakdown of all the libraries:

Creating account on VideoDB

Set Up the.env File

Understanding Project's structure

Building the Chatbot: Streamlit Interface(streamlit.py)

Streamlit Interface Setup

Handling Youtube video links input

Chatbot Response Processing

Integrating VideoDB and OpenAI services (db_utils.py)

Uploading Video to Database

Getting the resultant portion of the video for query

Parsing transcript and query via GPT

Use Case

Conclusion

Reflecting on the Project

Additional Resources

Data Science Diaries: Navigating My Internship Journey

Introduction

Background

Learning and Growth

Overview of Projects and Responsibilities

A step by step overview of working on Project

Challenges Faced and How They Were Overcome

Tools and techstack

Python

OpenAI

ML libraries

Langchain

Streamlit

Hugging Face and Sentence Transformers

Chromadb

Llamaindex

FastAPI

Amazon Web Services (AWS)

Professional Development

Soft Skills

Set Up the`.env` File

Building the Chatbot: Streamlit Interface`(streamlit.py)`

Integrating VideoDB and OpenAI services `(db_utils.py)`

Defining `get_freshchat_message` Function:

Defining the `get_order_details` Function:

Defining the `get_chatgpt_response_for_freshchat` Function:

Defining `gpt_function` Function :

Defining `get_odoo_ticket_description` Function:

Defining the `get_knowledge_chromadb` Function:

Defining the `get_order_details` Function:

Defining the `generate_ticket_response` Function:

Defining `get_chatgpt_summary_orderID_details` Function:

Defining `gpt_function` Function