Expert Langchain Developer for Facebook Business Page Scraper Using Open-Source Llama

**Job Title:** Expert Langchain Developer for Facebook Business Page Scraper Using Open-Source Llama

**Description:**
I am seeking an experienced Langchain developer to create a **web application** that scrapes Facebook business pages and extracts **100% of available data fields**. This application will enrich an external database of 22 million leads, each containing company name, address, phone number, and sometimes a website.

This project **must use a free, open-source Llama model** for the LLM within Langchain. **Applicants must be experts in building Langchain scrapers** and provide proof of prior projects they have successfully completed.

The project will involve the following:

### **Project Scope**

1. **Langchain Scraper with Open-Source Llama Model**
– Develop a highly efficient Langchain-based scraper utilizing the **free, open-source Llama model** as its LLM.
– Use **ScraperAPI** or similar **free scraping tools** to optimize and scale scraping efforts.
– Scrape Facebook business pages and extract all available data fields.

2. **Web Application Playground**
– Create a **UI Playground** to test the scraper with a single lead by inputting details (company name, address, phone, and website).
– The playground must be **password protected**, with:
– A **temporary default password** for development.
– Ability to change the password via the UI.
– **Encrypted password storage** for enhanced security.

3. **API Development with Key Management**
– Develop an **API** to handle high-volume requests (100,000+ API calls daily).
– Include a robust **API key management system** with the ability to:
– Generate, manage, and revoke API keys.
– Ensure the API does not function without a valid key.
– API output must be in **JSON format**, containing all scraped fields from the Facebook business pages.
– Include a comprehensive **API documentation page**, detailing endpoints, request parameters, response formats, error codes, examples, and how to use API keys.

4. **Hosting and Deployment**
– I will provide an **Amazon Lightsail server** with IP, credentials, and access keys for deployment.
– I will also set up the required **subdomain** for the application.

5. **Future Expandability**
– While this project focuses on Facebook business pages, future scopes will include scraping additional sources for business information, such as:
– **Emails**
– **LinkedIn profiles**
– **WhatsApp numbers**
– **Google PPC listings**
– **Other online platforms** where business information can be found.

### **Technical Requirements**

– **LLM:** Must use the free, open-source Llama model for Langchain workflows.
– **Web Scraping Tools:**
– **ScraperAPI** or other **free, scalable scraping tools** for managing requests and data extraction.
– **Programming Languages:**
– Required for **Langchain and Web Scraping:**
– **Python** (preferred for Langchain and overall development).
– **JavaScript** (for handling browser-based scraping with tools like Puppeteer or Playwright).
– **HTML/CSS** (for scraping and understanding DOM structures).
– **Node.js** (optional, but useful for backend and advanced scraping scenarios).
– Knowledge of **API integration and key management** is essential.
– **Authentication:** Temporary password with the ability to change it, using encrypted storage.
– **Database:** No database needed for now; scraped data will be output via API and playground.
– **API Security:** API must be key-protected, ensuring no access without a valid API key.
– **API Output:** JSON format with all available fields from Facebook business pages.
– **API Documentation:** Clear, well-written documentation for developers integrating with the API.
– **Scalability:** API must handle large-scale scraping efficiently (100k+ API calls daily).
– **Security:** Ensure secure API calls and password-protected playground.

### **Deliverables**

1. Fully functional Langchain scraper integrated with the Llama model.
2. Password-protected playground with encrypted password storage and a mechanism to change passwords.
3. API endpoint with JSON output, capable of handling high-volume requests.
4. Comprehensive **API key management system**, ensuring only authorized usage.
5. Detailed **API documentation page** for developers.
6. Deployment on the Amazon Lightsail server provided.
7. Documentation for setup, usage, and deployment.

### **Qualifications**

– Proven expertise in **Langchain scrapers** (must provide proof of prior successful projects).
– Experience with **open-source Llama models** and Langchain workflows.
– Strong knowledge of building secure, scalable APIs with key management and clear documentation.
– Familiarity with web scraping using **ScraperAPI**, **Selenium**, or other tools, and **Python, JavaScript, HTML/CSS**, or tools like Puppeteer or Playwright.
– Experience deploying applications on **Amazon Lightsail**.

### **Budget and Timeline**

– **Budget:** Taking project offers ranging from **$500 to $1,500**.
– **Timeline:** Completion within **5 days of signing the contract**.

**Note:** If you are awarded this project, you will also be committing to all follow-up projects to upgrade and enhance this scraper in the future.

### **Next Steps**

If you’re interested, please Apply and answer the questions:

Share the Post:

Related Posts

High Quality Palm Oil

RBD Palm Olein CP6 (Refined, Bleached, and Deodorized) is a high-quality, liquid fraction of palm oil obtained through the fractionation

Read More