Hi there! We are fal, and we are on a mission to build world’s first generative media platform for developers.

We built a serverless runtime for Python that is optimized to run large ML models on 1000s of GPUs efficiently. The applications built on our platform are currently serving millions of users around the world and our goal is to 1000x that over the next few years.

To see some examples of our product in action, go to our model gallery and docs

fal was set up as a fully remote company from the start. Today, we are 20 people strong, distributed across the world. We are looking for founding team members who share our excitement about the fast moving nature of AI and can independently build world-class infrastructure. Let’s chat!

Open Roles

Product Engineer

PRODUCT ENGINEER You are a versatile engineer who thrives on building and deploying seamless user experiences. You possess a strong understanding of both backend and frontend technologies, enabling you to take ownership of features from concept to launch. You are proficient in crafting robust APIs, managing databases, and developing interactive user interfaces. Your focus is on delivering high-quality, scalable, and maintainable products. KEY RESPONSIBILITIES: - You will have access to our cloud infrastructure for development and deployment. You will make our model playgrounds more interactive and help make them more discoverable. - Some core technologies we use include Python, Postgres, and Next.js. - You'll collaborate with a cross-functional team to rapidly iterate and deploy new features. WHAT WE OFFER AT FAL: - Interesting and challenging work - Competitive salary and equity - Employee-friendly equity terms (early exercise, extended exercise) - A lot of learning and growth opportunities - We offer visa sponsorship and will help you relocate to San Francisco. - Health, dental, and vision insurance (US) - Regular team events and offsite COMPENSATION: - $180,000 - $250,000 + equity + comprehensive benefits package LOCATION: - We are currently hiring in downtown San Francisco. We prefer to work in person, but we also offer remote work opportunities for exceptional candidates.

$180K – $230K • Offers Equity
Show more

Customer Success Manager

About fal: fal is building the future of AI-powered media by offering best-in-class infrastructure that empowers developers, creators, and companies to bring generative AI into the real world—fast, reliable, and at scale. Our mission is to make AI accessible and operational for every organization, enabling innovation across media, entertainment, design, and beyond. We're a team of builders, dreamers, and technologists who believe AI should be as easy to deploy as it is powerful. Our customers include groundbreaking startups, creative agencies, and global enterprises who rely on fal to scale their generative AI capabilities in production. About the Role: We’re looking for a Scaled Customer Success Manager to lead post-sales engagement across a large and dynamic portfolio of customers. You will help users onboard smoothly, realize immediate value, and scale their use of fal’s platform through personalized support programs, targeted campaigns, and strategic outreach. As a trusted advisor, you’ll play a critical role in shaping how our customers bring AI-powered experiences to life. What You'll Do: • Manage a high-volume, high-impact portfolio of customers across the Globe, ensuring adoption, satisfaction, and growth. Lead onboarding initiatives and design tailored consultations to help customers integrate fal into their workflows. • Develop and run data-driven outreach programs that drive adoption of fal’s generative AI infrastructure at scale. Collaborate closely with sales and product teams to ensure a unified, value-focused customer journey. • Act as a product advocate—up-leveling customer knowledge, uncovering new use cases, and helping users get the most out of fal. Identify customers at risk of churn or stagnation, and proactively intervene with customized success plans. • Surface feedback and market insights that inform product development, feature prioritization, and roadmap decisions. • Help shape the Customer Success foundation at fal, establishing scalable processes, playbooks, and success metrics. What You'll Achieve: • Enable companies to bring generative AI products to market faster and more efficiently using fal’s platform. • Drive product adoption and seat expansion across a wide variety of use cases in creative, enterprise, and developer segments. • Influence how fal grows by working with customers and internal teams to iterate on success strategies and product experience. • Elevate your skills by helping customers navigate emerging AI tech and scaling a success function in a fast-moving startup. About You: • 5+ years of experience in Customer Success, Account Management, or post-sales roles in a fast-paced SaaS or AI company. • Proven ability to manage a large, diverse set of customers with varying needs and technical sophistication. • Strong communication skills—you can clearly explain technical concepts to both developers and business stakeholders. • Highly organized with experience in building and running scaled programs or automated success campaigns. • Collaborative, resourceful, and energized by helping others succeed. • Comfortable with ambiguity and excited by the opportunity to build from 0 to 1. Bonus Points: • Creative Technologist – Generative Media Focus - Blends a strong creative instinct with a deep passion for generative media, passionate about AI-driven image and video generation. Constantly exploring the intersection of art and technology. • A Slack Ninja - Quick to reply, never misses a mention, and somehow remembers every thread, all while keeping tone professional and helpful. It’s part communication, part prioritization, and a dash of magic. • Previously built Customer Success playbooks or scaled CS motions at an early-stage startup. • Experience with APIs, SDKs, or technical product onboarding.

$112K – $144K
Show more

Sr. Technical Writer

ABOUT THE ROLE We're looking for a passionate and detail-oriented Technical Writer to help us create best-in-class API, SDK, and platform documentation that empowers developers to build with confidence. You’ll work at the intersection of engineering, product, and customer success—translating complex technical concepts into clear, engaging, and actionable content. WHAT YOU’LL DO - Own the end-to-end creation and maintenance of API and SDK documentation for fal’s developer platform. - Collaborate closely with engineers and product managers to understand new features, APIs, and developer workflows. - Work with solutions engineers, support, and customers to identify gaps in content and improve the onboarding and support experience. - Design and structure documentation to support real-world developer use cases and workflows. - Maintain a consistent voice and tone across documentation that aligns with fal’s brand and developer-first philosophy. - Edit and curate contributed documentation to ensure quality, accuracy, and clarity. - Contribute to internal tooling improvements and documentation processes. - Use data and customer feedback to inform documentation priorities and improvements. WHAT YOU’LL ACHIEVE - A world-class developer documentation experience that helps users integrate fal’s platform faster and more efficiently. - Improved clarity and discoverability of our APIs, SDKs, and developer tools. - A measurable decrease in support tickets related to documentation gaps. - Strong collaboration workflows across product, engineering, and DevRel teams. ABOUT YOU - 5+ years of experience writing developer-focused technical documentation or equivalent experience as a software developer, solutions architect, or product manager. - Experience documenting RESTful APIs, SDKs, and developer platforms. - Excellent writing, editing, and organizational skills with strong attention to detail. - Strong technical acumen and the ability to quickly understand complex systems. - A customer-first mindset with a passion for helping others succeed. - Ability to thrive in a fast-paced startup environment where you’ll wear multiple hats. BONUS POINTS - Experience working on developer infrastructure, cloud platforms, or GenAI tooling. - Hands-on experience with Python or JavaScript. - Visual storytelling skills—experience creating diagrams, gifs, or video tutorials. - Understanding of AI and ML concepts, including model training, inference, and pipelines.

$70K – $150K • Offers Bonus
Show more

Staff Software Engineer, ML Performance & Systems

Help fal maintain its frontier position on model performance for generative media models. Design and implement novel approaches to model serving architecture on top of our in-house inference engine, focusing on maximizing throughput while minimizing latency and resource usage. Develop performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. Work closely with our Applied ML team and customers (frontier labs on the media space) and make sure their workloads benefit from our accelerator. Key Responsibilities: - Help fal maintain its frontier position on model performance for generative media models. - Design and implement novel approaches to model serving architecture on top of our in-house inference engine, focusing on maximizing throughput while minimizing latency and resource usage. - Develop performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. - Work closely with our Applied ML team and customers (frontier labs on the media space) and make sure their workloads benefit from our accelerator. Requirements: - Strong foundation in systems programming with expertise in identifying and fixing bottlenecks. - Deep understanding of cutting edge ML infrastructure stack (anything from PyTorch, TensorRT, TransformerEngine to Nsight), including model compilation, quantization, and serving architectures. Ideally following closely the developments in all these systems as they happen. - Have a fundamental view of the underlying hardware (Nvidia based systems at the moment), and when necessary go deeper into the stack to fix bottlenecks (custom GEMM kernels with CUTLASS for common shapes). - Proficient in Triton or willingness to learn with comparable experience in lower-level accelerator programming. - New frontier: multi-dimensional model parallelism (combining multiple parallelism techniques like TP with context parallel / sequence parallel). - Familiar with internals of Ring Attention, FA3, FusedMLP implementations.

$180K – $250K • Offers Equity • Offers Bonus
Show more

Growth Engineering

GROWTH ENGINEERING fal is building the next generation of generative-media infrastructure. Our users turn raw creativity into finished experiences in minutes, and we’re growing faster than we can keep up. We’re looking for a Growth Engineer who blends rapid-fire engineering with product-led growth instincts—someone who can ship code in the morning and close a partnership over coffee in the afternoon. WHY THIS ROLE MATTERS You’ll sit at the intersection of engineering, product, and GTM. Your scrappy prototypes, experiments, and content will be the first touchpoint for new creators, studios, and Fortune 500 innovation teams. When you do your job well, the rest of the company feels it in tomorrow’s dashboards. WHAT YOU’LL DO Spin up lightweight client libraries, demo apps, or event microsites in a few hours. Run data-driven experiments. Segment cohorts, design A/B tests, and automate reporting. We have a clear, metric-based view of acquisition cost and activation rate for every segment. Draft compelling blog posts, tweets, and teardown threads (zero “AI slop”). Our content consistently drives qualified sign-ups and sparks industry conversations. Own customer touchpoints: Meet prospects, debug their first calls, and represent fal at meetups and hackathons. Prospects leave every interaction saying, “These folks get it—and ship fast.” Identify high-leverage problems, time-box solutions, and ship. After ramp-up, you propose your own roadmap—and we mostly just say “Yes.” YOU MIGHT BE A FIT IF YOU - Ship code at the speed of thought. Fluent in Python and JavaScript (Next.js, React) and can stitch APIs, CLIs, and scrapers together before lunch. - Live in the metrics. SQL, Amplitude/Looker, or plain-text CSVs—whatever gets you to the insight fastest. - Write to persuade. Your copy earns clicks because it’s human, helpful, and opinionated. - Love people as much as code. You’re energized by demos, DMs, and IRL events. - Crave ownership. Ambiguous problems and blank pages don’t scare you; they excite you. - Geek out on generative media. You follow the latest diffusion paper for fun and have strong opinions on video model architectures. NICE-TO-HAVES - Prior experience in PLG or developer-tool startups - Familiarity with growth analytics stacks - A portfolio of technical writing, open-source libs, or side projects LOCATION: SAN FRANCISCO, CA · ON-SITE WHAT WE OFFER AT FAL - Interesting and challenging work - Competitive salary and equity - Employee-friendly equity terms (early exercise, extended exercise) - A lot of learning and growth opportunities - We offer visa sponsorship and will help you relocate to San Francisco. - Health, dental, and vision insurance (US) - Regular team events and offsites

$170K – $220K • Offers Equity
Show more

Staff Software Engineer, Compute

You are an experienced software engineer who thrives on building large scale computation platforms. You have deep expertise in backend systems that orchestrate workloads and route requests efficiently, while taking care of capacity and resource constraints. You possess a strong understanding of foundational cloud infrastructure and Linux provisioning and management tools. You know how to achieve reliability and scale with minimum operational load. Key responsibilities - Develop and maintain our core Python platform, which handles routing of requests, orchestration of AI workloads, GPU server capacity management, observability, authentication, rate limiting, and many others - Develop and maintain our infrastructure layer where we use Terraform, Ansible, and provider APIs to manage our fleet of GPU workers - Own K8s, FluxCD, Nomad, Prometheus, Thanos, Grafana, Loki, distributed networking storage, and other technologies that underpin our platform - Create the vision and lay the foundation for where our infrastructure should go in the next 1/2/5 years Requirements - Deep experience building distributed compute platforms, preferably with Python - Strong foundation in managing both cloud and bare metal infrastructure - Solid understanding of K8s and CI/CD on it - Excellent communication - Self-starter who executes quickly, takes ownership and constantly seeks improvement Location - San Francisco, CA What we offer at fal - Interesting and challenging work - Base salary $180,000-250,000 plus equity - Employee-friendly equity terms (early exercise, extended exercise) - A lot of learning and growth opportunities - We are currently hiring in downtown San Francisco. We prefer to work in-person but we also offer remote work opportunities for exceptional candidates. - We offer visa sponsorship and will help you relocate to San Francisco. - Health, dental, and vision insurance (US) - Regular team events and offsites

$180K – $250K • Offers Equity
Show more

More Roles

Senior Account Executive

SENIOR ACCOUNT EXECUTIVE You are a seasoned Account Executive with a proven track record of driving significant revenue growth in AI, SaaS, or technology startups. You excel at navigating complex sales cycles, building long-term relationships with enterprise customers, and leading strategic negotiations. By leveraging your deep understanding of AI infrastructure, you will help innovative organizations adopt fal's platform, amplify their capabilities, and shape the future of AI-powered media on a global scale. Responsibilities: - Own and optimize the entire sales cycle—from high-level prospecting to enterprise-level contracting—while effectively articulating fal's AI infrastructure benefits. - Develop and implement advanced sales strategies to break into new markets, manage executive-level relationships, and consistently exceed revenue targets. - Lead detailed product demonstrations and complex contract negotiations, ensuring alignment with both technical and business stakeholders. - Provide strategic market insights and champion customer feedback to influence product roadmaps and priority features. - Mentor junior members of the sales team, fostering a culture of knowledge sharing and consistent performance improvement. You may be a good fit if you have: - 5+ years of B2B sales experience in AI, SaaS, or technology startups, with a strong track record of exceeding quotas. - Proficiency in engaging and selling to C-suite or other high-level decision-makers within complex organizational structures. - Exceptional negotiation skills, including the ability to navigate multi-stakeholder deals with technical, legal, and financial components. - Outstanding communication and presentation abilities, capable of addressing both technical and non-technical audiences effectively. - A growth mindset with a passion for generative AI and a drive to pioneer cutting-edge solutions in a fast-paced environment. WHAT WE OFFER AT FAL - Interesting and challenging work - Competitive salary and equity - Employee-friendly equity terms (early exercise, extended exercise) - A lot of learning and growth opportunities - We are currently hiring in downtown San Francisco. We prefer to work in-person but we also offer remote work opportunities for exceptional candidates. - We offer visa sponsorship and will help you relocate to San Francisco. - Health, dental, and vision insurance (US) - Regular team events and offsites

$100K – $130K • Offers Equity
Show more

Applied ML Engineer

APPLIED ML ENGINEER You are an ML engineer who excels at bridging cutting-edge research and real-world applications. You have hands-on experience training, deploying, and maintaining machine learning models, ensuring they perform at scale. You stay current with new developments in AI, but your focus is always on delivering impactful solutions in production environments. TECH: - You will have access to our massive GPU cluster for training and inference - Some core technologies we use include Python, torch, diffusers, and the fal Python SDK - You'll work alongside a team dedicated to quickly iterating on and deploying new AI breakthroughs WHAT WE OFFER AT FAL - Interesting and challenging work - Competitive salary and equity - Employee-friendly equity terms (early exercise, extended exercise) - A lot of learning and growth opportunities - We are currently hiring in downtown San Francisco. We prefer to work in-person but we also offer remote work opportunities for exceptional candidates. - We offer visa sponsorship and will help you relocate to San Francisco. - Health, dental, and vision insurance (US) - Regular team events and offsites

Show more

Careers

What we offer at fal

  • Interesting and challenging work
  • Competitive salary and equity
  • Employee-friendly equity terms (early exercise, extended exercise)
  • We are currently hiring in downtown San Francisco. We prefer to work in-person but we also offer remote work opportunities for exceptional candidates.
  • We offer visa sponsorship and will help you relocate to San Francisco.
  • Health, dental, and vision insurance (US)
  • Regular team events and offsites