Phoenix is practically built with this idea in mind. My startup is built out of ...

travisjungroth · on March 9, 2021

Anyone like me first hearing about Phoenix and had trouble finding it, it's an Elixir framework: https://www.phoenixframework.org/

PoignardAzur · on March 9, 2021

Yeah, my first thought was "If this is called Phoenix and it's Cloud-related, I'm gonna have to swim through a lot of FFVII results before I find it".

hajhatten · on March 10, 2021

I read your comment and halfway through i thought you were gonna bring up the devops book: The Phoenix Project https://www.amazon.com/Phoenix-Project-DevOps-Helping-Busine...

acjohnson55 · on March 9, 2021

In my experience, this is also a major benefit of running Akka on Scala or Java. I had the realization that it's basically a single-language Kubernetes, with some really nice abstractions built on top.

linkdd · on March 9, 2021

> it's basically a single-language Kubernetes

Well, yes and no.

By using Kubernetes, you get a scalable infrastructure. By using OTP/Akka, you get a scalable application.

While there is common problems to both domains, they are still 2 different domains.

For example, using only Kubernetes, you won't have the ability to react to a Pod restart within your application (unless your application is aware of Kubernetes).

Using only OTP/Akka, you still need a workflow for deployment and infrastructure management, and you still need to implement node discover for clustering.

NB: For Elixir, you have libcluster[1] that can use different strategies for node discovery, including using a Kubernetes Service to discover the nodes (as Pods).

EDIT: Using Kubernetes, libcluster, and Horde[2], you get the best of both worlds IMHO.

[1] - https://hexdocs.pm/libcluster/2.5.0/readme.html

[2] - https://hexdocs.pm/horde/0.1.4/api-reference.html

tonyarkles · on March 9, 2021

Thanks! I was just going to point out that Elixir/Phoenix + Libcluster + K8s is like a match made in heaven. I haven't tried Horde yet but I'm quite intrigued now.

linkdd · on March 10, 2021

Horde uses a CRDT[1] (Conflict-Free Replicated Data Type) to provide a distributed Supervisor/Registry, this allows you to run an OTP supervision tree only once in your cluster, with automatic takeover. Basically, you run your supervisor on each nodes, but only one will run it (thanks to the CRDT).

I find it very useful because "Distributed OTP Application" are a pain IMHO (they must be started during the boot of the BEAM).

[1] - https://en.wikipedia.org/wiki/Conflict-free_replicated_data_...

cultofmetatron · on March 10, 2021

wow thankyou! this is actually handles a usecase we're trying to solve in an upcoming sprint. (each tenant needs to maintain a persistent websocket connection to a third party api but though us) I was trying to use registry to do it but was was wondering how I'd scale it once it got big enough.

wcarss · on March 9, 2021

this sounds really neat and I'm going to go read about it!

I also wanted to call out that this ~sentence has just... a bunch of things that seem like jargon/names within the community? As an outsider, I have no idea what they mean:

> make a Genserver or Oban worker and mount it in the supervision tree. The Beam takes care of maintaining the process

it sounds very sci-fi :)

cultofmetatron · on March 9, 2021

sorry! so I'll clarify

Elixir inherits a library called OTP from erlang which is a set of primitives for building massively concurrent systems.

A Genserver is sort of like a Base Class for a object that runs as an independant process that plugs into OTP. By inheriting/implementing the Genserver behavior, you create an independant process that can be mounted in an OTP supervision tree which runs at the top of your application and monitors everything below it. Out of the box, that means your process can be sent messages by other processes and can send messages itself. If a crash happens, the supervisor will kill it, resurrect it and redirect messages to the new process.

Creating a Genserver is as easy as adding an annotation and implementing a few callbacks.

Genservers are the base on which a lot of other systems build on. Oban, a job worker essentially builds on Genserver to use a postgres table as a job processor. Since its just a Genserver with some added behavior, adding a background worker is as simple as adding a file that inherits from Oban and specifying how many workers for it should be allocated in a config file. The result is that adding a background worker is about as much work for me as adding a controller. No additional work for devops either.

And yes, it is very sci fi. Honestly I'm shocked elixir isn't more widespread. there's very little hype behind it but the engineering is pretty solid. Every scaling bottleneck we've had so far has been in the database (only because we particularly make heavy use of stored procedures)

pkos98 · on March 9, 2021

!Important clarification!

In the world of OTP (Open telecom platform), a process is the term for what essentially is a green-thread, not an OS process!

So it is: a) much, much more lightweight (IIRC ~ 1Kb) b) scheduled by the Erlang virtual machine (so called BEAM)'s scheduler. The BEAM's schedulers run on a per-thread-basis, inside the BEAM's process c) independently garbage collected, no mutable memory sharing

dwohnitmok · on March 10, 2021

> there's very little hype behind it

This isn't a knock against Elixir or the Erlang ecosystem but I would definitely say that Elixir gets a decent amount of hype. Each time a new release comes out it invariably shoots to the front page of HN.

macintux · on March 10, 2021

And before Elixir, Erlang itself was a frequent visitor on the HN front page. Still is.

iudqnolq · on March 9, 2021

Might be helpful to know GenServer is short for "generic server"

iudqnolq · on March 9, 2021

What's it like using stored procedures with Ecto?

cultofmetatron · on March 9, 2021

surprisingly easy.

1. if I'm calling directly, I can just use a raw sql query 2. views can be backed with a read only ecto model 3. triggers can be set to run without your ecto code even being aware of it. 4. for custom errors, you can add overides for the error handling in ecto to transform things like deadlocks to 400 errors

iudqnolq · on March 9, 2021

Cool. What language do you write the stored procedures in?

cultofmetatron · on March 9, 2021

plpgsql

shoo · on March 9, 2021

Some of these ideas are written up in the late Joe Armstrong's dissertation about Erlang and OTP "Making reliable distributed systems in the presence of software errors". It's a few hundred pages but quite readable to a programming audience -- it isn't filled with acres of formal proofs or highly specialised jargon.

https://erlang.org/download/armstrong_thesis_2003.pdf

> supervision tree

See chapter 5 "Programming Fault Tolerant Systems" & section 5.2 "supervision hierarchies"

> genserver / generic server

See chapter 4 "Programming Techniques" section 4.1 "Abstracting out concurrency" and chapter 6 "Building an Application" section 6.2 "Generic server principles"

jorgeavaldez · on March 9, 2021

They do sound very sci-fi. I would say it made me more or less inclined to dig a little deeper myself hahahaha.

Genservers are an abstraction encapsulating the typical request/response lifecycle of what we would consider a "server", but applied to a BEAM-specific process. Like "general server".

Oban allows for job processing, instrumented much like you would any other process in the BEAM. This is an external library while genserver is built in.

notamy · on March 9, 2021

Oban is a Postgres-backed job processing library https://github.com/sorentwo/oban

machiaweliczny · on March 9, 2021

I can recommend watching this[0] to understand BEAM design better.

[0] - https://www.youtube.com/watch?v=JvBT4XBdoUE

whiskeytuesday · on March 9, 2021

+1, probably my favourite talk on the BEAM too, even allowing for the fact that listening to Joe Armstrong himself was always a distinct pleasure.

mrlucax · on March 12, 2021

Really great talk! He talks about some problems about the BEAM Distribution, but didn't get into details about it. Do you have any idea about those problems?

twic · on March 10, 2021

I have done much the same in plain old Java, at a couple of jobs, going back ten years or so.

No OTP, GenServer, etc. Just a webserver and some framework for scheduled jobs, third party or write your own. Config enables individual routes and jobs at the top level. You can deploy one instance with everything enabled, or a hundred instances with the customer-facing web routes enabled in 60, API routes in 20, admin routes in 5, report generation jobs in 10, maintenance jobs in 5, or anywhere in between.

The only discipline you have to stick to is that you must pass information between components in a way which will work when the app is distributed. A shared database is the most obvious, and may be enough. We also used message queues, and at one point one of those distributed caches that were all the rage (Hazelcast / Terracotta / Infinispan / EHCache / etc - anyone remember JavaSpaces and Jini?).

bitexploder · on March 9, 2021

We have a similar philosophy with our Python code base. Redis and Postgres are our only real dependencies. We use celery but it’s basically just Python. Maybe takes a little more work setting it all up, but once done provides similar benefits.

ngrilly · on March 9, 2021

How do you keep track of background jobs? Is the queue persisted on disk somewhere? If it's not and a background worker crashes for some reason, then is the job lost?

valzam · on March 9, 2021

Oban uses Postgres for persistence https://hexdocs.pm/oban/Oban.html

ngrilly · on March 11, 2021

That’s a very nice library. BEAM + Elixir + OTP supervision trees + Obama for background jobs + PostgreSQL for persistent storage seems a killer combination. And like you said you can scale horizontally naturally with the Erlang plateform without even using k8s. Really interesting.

damethos · on March 10, 2021

This was possible due to the fact that BEAM does allow you to do things like the ones you describe and enjoy advantages of both worlds (monolithic and microservices). If someone does not use Elixir/Erlang however or if the the product consists of parts written in multiple languages (for whatever reasons) then it's simply not possible to have the advantages of microservices in a monolithic approach.

OkayPhysicist · on March 11, 2021

Sounds like an argument for using Elixir.

frobisher · on March 10, 2021

Great stuff!

We moved to microservices, despite my love for a monolith with libraries: - to enable different teams to deploy their smaller microservice more easily (without QA, database migrations etc affecting the whole app); - to solve the human temptation of crossing abstraction boundaries.

Not black and white, and both approaches can solve these problems. Open to any feedback!

doctor_eval · on March 10, 2021

Yeah I moved to microservices for the exact same reasons. People are undisciplined, and making it harder (and much more obvious) for them to do the wrong thing is more important than having all your code in one place. Plus, if you need to upgrade some dependency or if you want to try a new language or library or idiom, you can do so without the risk, effort and sunk cost of upgrading the entire application.

dgellow · on March 9, 2021

> And all the advantages of microservices without the associated costs. (orchestration, management etc)

Really curious about this! What’s the deployment experience? What environment do you use for your production? How do you run/maintain the erlang VM and deploy your service?

cultofmetatron · on March 9, 2021

We use eks with some customizations specific to elixir and use a autodiscovery service for new nodes to connect to the mesh.

One of the big differences we had to make was allocating one node per machine as the beam likes to have access to all resources. in practice this isn't a problem because its internal scheduler is way more efficient and performant than anything managing OS level processes.

That said, a more complex deployemnt story is defiantly one of the downsides. But the good news is that once setup, its pretty damn resilient.

Now of course, our deployment setup is more complex specifically to take advantage of beam such as distributed pubsub and shared memory between the cluster. If you don't need that, you could use dokku or heroku.

valzam · on March 9, 2021

To add to this, we use Phoenix in a typical dockerized, stateless Loadbalancer- X Webserver - Postgres/Redis setup and it works great. Deployment is exactly the same as any other dockerized webapp. What OP is using is the "next level" that allows you to really leverage the BEAM but you don't have to.

whycombagator · on March 9, 2021

> Leaves me more time to work on tuning out database to keep up!

As your using postgres have you looked at citus[0] at all?

[0] https://github.com/citusdata/citus

cultofmetatron · on March 9, 2021

I hadn't heard of this but it looks cool. Looks like something that will do the job when we do need it. so far we just store the parameters from certain heavily used mutations into a job table and run the actual insertion in a background worker. We're no where near needing this yet but it'll be good to have it on hand when we (hopefully) get to that point.

qertoip · on March 10, 2021

Agree, we did similar with Elixir/Phoenix in my past company.

Obviously, the services will still be necessary where Elixir simply isn't the right tool for the job like machine learning or cryptocurrency.

cultofmetatron · on March 10, 2021

the new numbat library https://dashbit.co/blog/nx-numerical-elixir-is-now-publicly-... exposes primatives for gpu acccellerated math powered by google's xla library. the same one that powers tensorflow.

fiddlerwoaroof · on March 9, 2021

BEAM will automatically distribute processes across a cluster, right?

jorgeavaldez · on March 9, 2021

No, typically you register a node and instruct it on what processes to run. But there are libraries to help instrument this kind of behavior.

For elixir:

- https://github.com/derekkraan/horde

- https://github.com/bitwalker/swarm

tonyarkles · on March 9, 2021

A zero-cost alternative that has worked well for me so far is to use a front-end load balancer to distribute requests to multiple Phoenix instances (in k8s), and then just let those requests' background tasks run on the node that starts them.

The whole app is approximately a websocket-based chat app (with some other stuff), and the beauty of OTP + libcluster is that the websocket processes can communicate with each other, whether or not they're running on the same OTP node.

cultofmetatron · on March 9, 2021

not automatically but its pretty easy to configure in your supervision tree file. I don't know the details because whatever happens by default has taken care of our needs so far.

IT does automatically distribute processes across all the cores on a cpu though.

fiddlerwoaroof · on March 9, 2021

What I like about the Erlang platform is that it seems like it has the most sensible “microservice” story: deploy your language runtime to all the nodes of a cluster and then configure distribution in code. Lambdas, containers, etc. all push this stuff outside your code into deployment tooling that is, inevitably, less pleasant to manage than your codebase.