Remote
0-2
Jr. Data Engineer
12/21/2025
The Jr. Data Engineer will write and deploy crawling scripts to collect data from the web and standardize bulk datasets using Scala Spark. They will also diagnose and fix bugs, analyze internal datasets, and collaborate with a team of engineers.
Salary
85000 - 90000 USD
Working Hours
40 hours/week
Language
English
Visa Sponsorship
No
About The Company
No description available for this Company.
About the Role
<div class="content-intro"><h2><strong>About Sayari: </strong></h2>
<div>
<p>Sayari is a risk intelligence provider that equips the public and private sectors with immediate visibility into complex commercial relationships by delivering the largest commercially available collection of corporate and trade data from over 250 jurisdictions worldwide. Sayari's solutions enable risk resilience, mission-critical investigations, and better economic decisions. </p>
<p>Headquartered in Washington, D.C., its solutions are trusted by Fortune 500 companies, financial institutions, and government agencies, and are used globally by thousands of users in over 35 countries. Funded by world-class investors, with a strategic $228 million investment by TPG Inc. (NASDAQ: TPG) in 2024, Sayari has been recognized by the Inc. 5000 and the Deloitte Technology Fast 500 as one of the fastest growing private companies in the United States and was featured as one of Inc.’s “Best Workplaces” for 2025.</p>
</div></div><p><strong>POSITION DESCRIPTION</strong></p>
<p>Sayari is looking for an Entry-Level Data Engineer to join our Data team located in Washington, DC. The Data team is an integral part of our Engineering division and works closely with our Software & Product teams, as well as other key stakeholders across the business.</p>
<p><strong>JOB RESPONSIBILITIES:</strong></p>
<ul>
<li>Write and deploy crawling scripts to collect source data from the web</li>
<li>Write and run data transformers in Scala Spark to standardize bulk data sets</li>
<li>Write and run modules in Python to parse entity references and relationships from source data</li>
<li>Diagnose and fix bugs reported by internal and external users</li>
<li>Analyze and report on internal datasets to answer questions and inform feature work</li>
<li>Work collaboratively on and across a team of engineers using basic agile principles</li>
<li>Give and receive feedback through code reviews</li>
</ul>
<p><strong>SKILLS & EXPERIENCE</strong></p>
<p><em>Required Skills & Experience</em></p>
<ul>
<li>Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related technical field — or equivalent hands-on experience</li>
<li>Working knowledge of SQL and relational databases (such as Postgres)</li>
<li>Experience writing code in Python (e.g., pandas, NumPy, Scrapy) or Java/Scala</li>
<li>Familiarity with data processing frameworks like Apache Spark, or strong interest in learning them on the job</li>
<li>Understanding of object-oriented programming principles and collaborative development in shared repositories</li>
<li>Ability to work closely with data scientists, analysts, and engineers to help solve complex problems across large, diverse datasets</li>
</ul>
<p><em>Desired Skills & Experience</em></p>
<ul>
<li>Exposure to workflow orchestration tools such as Apache Airflow and CI/CD pipelines</li>
<li>Familiarity with graph, search, or NoSQL databases</li>
<li>Experience contributing to data ingestion, transformation, or ETL pipelines</li>
<li>Comfort working with containerized applications (e.g., Docker)</li>
<li>Experience using cloud-based data tools in AWS or GCP environments</li>
<li>Introductory experience or coursework involving machine learning, especially in distributed systems like Spark</li>
<li>Awareness of entity resolution concepts or interest in learning how entities are linked across data sources</li>
<li>Experience working with international or non-English datasets</li>
</ul>
<p>The target base salary for this position is $85,000-$90,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.</p>
<p> </p>
<h3><strong>Benefits: </strong></h3>
<div>
<ul>
<li>100% fully paid medical, vision, and dental for employees and their dependents</li>
<li>Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days</li>
<li>Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions</li>
<li>A strong commitment to diversity, equity, and inclusion</li>
<li>Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave</li>
<li>A collaborative and positive culture - your team will be as smart and driven as you</li>
<li>Limitless growth and learning opportunities</li>
</ul>
</div>
<div> </div>
<div><em>Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.</em></div><div class="content-pay-transparency"><div class="pay-input"><div class="title">Pay Range</div><div class="pay-range"><span>$85,000</span><span class="divider">—</span><span>$100,000 USD</span></div></div></div>
Key Skills
SQLRelational DatabasesPythonJavaScalaApache SparkObject-Oriented ProgrammingData ProcessingData TransformationETL PipelinesDockerCloud ComputingMachine LearningEntity ResolutionData IngestionCollaboration
Categories
TechnologyData & AnalyticsEngineeringSoftware
Benefits
Health InsuranceVision InsuranceDental InsurancePaid Time OffSick Leave401kLife InsuranceParental Leave
Apply Now
Please let Sayari know you found this job on InterviewPal. This helps us grow!
Prepare for Your Interview
We scan and aggregate real interview questions reported by candidates across thousands of companies. This role already has a tailored question set waiting for you.
Elevate your application
Generate a resume, cover letter, or prepare with our AI mock interviewer tailored to this job's requirements.