🚀 NetSuite Web Crawler

This project is a Python-based web crawler that logs into NetSuite, navigates to specific sections (like Custom Records), and extracts links for analysis. It supports 2FA authentication, security question handling, and headless mode for automation.

📌 Features

✅ Automated Login (Email & Password)
✅ Handles 2FA (Console Input in Headless Mode)
✅ Answers Security Questions Automatically
✅ Crawls NetSuite Pages & Extracts Links
✅ Supports Headless Mode for Automation
✅ Navigates Directly to Custom Records
✅ Scrapes User Role Permissions across Transactions, Reports, Lists, Setup and Custom Record sections
✅ Extracts Workflow Actions for any record type
✅ Scrapes Custom List Values into CSV
✅ Exports Scraped Data to CSV
✅ Handles Multi‐Page Role Lists with Pagination

⚙️ Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/Joewus12345/netsuite_crawler.git
cd netsuite_crawler

2️⃣ Set Up a Virtual Environment (Recommended)

python -m venv venv
source venv/bin/activate  # On macOS/Linux
venv\Scripts\activate     # On Windows

3️⃣ Install Dependencies

pip install -r requirements.txt

4️⃣ Set Up config.py

Create a config.py file in the root directory and add your NetSuite credentials:

# config.py

NETSUITE_URL = "https://your-netsuite-url.com"
NETSUITE_EMAIL = "[email protected]"
NETSUITE_PASSWORD = "your-password"
SECURITY_ANSWER = "your-security-question-answer"
HEADLESS_MODE = False  # Change to True to run without opening a browser

Note: The NetSuite administrator login URL can vary between companies or accounts. Replace https://your-netsuite-url.com with the correct admin URL for your environment.

🚀 Running the Crawler

Choose one or more scrapers to run using the --scrapers flag. Scrapers run sequentially after a single login.

Available scrapers:

crawler
workflows
user-roles
list-values

Default Output Files

Each scraper saves its results to a CSV file in the project root:

list-values → list_values.csv, containing custom list IDs, names, and their associated values.
user-roles → user_role_permissions.csv, capturing each role's permissions across transactions, reports, lists, and setup categories.
workflows → workflow_actions.csv, listing workflow names, record types, and their associated actions.

Examples

Scrape list values and user roles:

python main.py --scrapers list-values,user-roles

Scrape workflows for specific record types:

The --records flag expects a JSON array of record-type names. Quoting rules vary by terminal:

bash (Linux/macOS):

python main.py --scrapers workflows --records '["Admin Request","Feedback"]'

cmd.exe:

python main.py --scrapers workflows --records "[\"Admin Request\",\"Feedback\"]"

For quick runs you can pre-populate workflow_scraper.HARDCODED with a list of record names. This list is used whenever --records is not passed:

# workflow_scraper.py
HARDCODED: list[str] = ["Admin Request", "Feedback"]

Headless Mode (Without Browser)

Edit config.py and set:

HEADLESS_MODE = True

Use the same command-line options; the browser runs hidden and prompts you in the terminal for the 2FA code.

NB: This only applies to user accounts with Administrative Privileges

Provide the 2FA code when prompted.

📂 Project Structure

📂 netsuite_crawler
 ┣ 📜 config.py              # Stores credentials & config
 ┣ 📂 chromedriver           # Chrome browser for running project
 ┣ 📜 main.py                # Entry point for the bot
 ┣ 📜 auth_utils.py          # Authentication helpers
 ┣ 📜 crawler.py             # Core logic for logging in & crawling
 ┣ 📜 list_values_scraper.py # Scrapes custom list values
 ┣ 📜 user_roles_scraper.py  # Scrapes role permissions
 ┣ 📜 workflow_scraper.py    # Scrapes workflow actions
 ┣ 📂 tests                  # Unit tests
 ┣ 📜 requirements.txt       # Dependencies list
 ┗ 📜 README.md              # Project documentation (You are here!)

🧪 Testing

Run the test suite with:

python -m pytest

The tests mock browser interactions and require no live NetSuite credentials.

🛠️ Troubleshooting

1️⃣ WebDriver Issues?

Ensure Google Chrome is installed & updated.
Download ChromeDriver from: chromedriver.chromium.org

2️⃣ 2FA Not Submitting in Headless Mode?

Check if the submit button selector is correct:

div[data-type='primary'][role='button']

3️⃣ Login Not Working?

Try logging in manually to ensure credentials are correct.
Check if NetSuite has CAPTCHA enabled (this bot does not bypass CAPTCHA).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 NetSuite Web Crawler

📌 Features

⚙️ Installation & Setup

1️⃣ Clone the Repository

2️⃣ Set Up a Virtual Environment (Recommended)

3️⃣ Install Dependencies

4️⃣ Set Up config.py

🚀 Running the Crawler

Default Output Files

Examples

Headless Mode (Without Browser)

NB: This only applies to user accounts with Administrative Privileges

📂 Project Structure

🧪 Testing

🛠️ Troubleshooting

1️⃣ WebDriver Issues?

2️⃣ 2FA Not Submitting in Headless Mode?

3️⃣ Login Not Working?

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
__pycache__		__pycache__
chromedriver		chromedriver
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
auth_utils.py		auth_utils.py
crawler.py		crawler.py
list_values_scraper.py		list_values_scraper.py
main.py		main.py
requirements.txt		requirements.txt
user_roles_scraper.py		user_roles_scraper.py
workflow_scraper.py		workflow_scraper.py

Joewus12345/netsuite_crawler

Folders and files

Latest commit

History

Repository files navigation

🚀 NetSuite Web Crawler

📌 Features

⚙️ Installation & Setup

1️⃣ Clone the Repository

2️⃣ Set Up a Virtual Environment (Recommended)

3️⃣ Install Dependencies

4️⃣ Set Up config.py

🚀 Running the Crawler

Default Output Files

Examples

Headless Mode (Without Browser)

NB: This only applies to user accounts with Administrative Privileges

📂 Project Structure

🧪 Testing

🛠️ Troubleshooting

1️⃣ WebDriver Issues?

2️⃣ 2FA Not Submitting in Headless Mode?

3️⃣ Login Not Working?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages