libwebsockets

mirror of https://github.com/warmcat/libwebsockets.git synced 2025-03-16 00:00:07 +01:00

History

Andy Green fd810f198a http proxy: client: unix socket support This allows the client stuff to understand that addresses beginning with '+' represent unix sockets. If the first character after the '+' is '@', it understands that the '@' should be read as '\0', in order to use Linux "abstract namespace" sockets. Further the lws_parse_uri() helper is extended to understand the convention that an address starting with + is a unix socket, and treats the socket path as delimited by ':', eg http://+/var/run/mysocket:/my/path HTTP Proxy is updated to allow mounts to these unix socket paths. Proxy connections go out on h1, but are dynamically translated to h1 or h2 on the incoming side. Proxy usage of libhubbub is separated out... LWS_WITH_HTTP_PROXY is on by default, and LWS_WITH_HUBBUB is off by default.		2018-09-12 13:58:13 +08:00
..
mainpage.md	libwebsockets.h: split out into a dir of sub-includes included by libwebsockets.h	2018-09-11 18:27:59 +08:00
README-plugin-sshd-base.md	Plugins: add ssh-base ssh server plugin	2017-10-16 16:59:57 +08:00
README.build.md	mbedtls: wrapper: client: Force mbedTLS to attemp to verify cert	2018-04-06 10:38:03 +08:00
README.coding.md	client: h2	2018-04-06 10:38:03 +08:00
README.content-security-policy.md	docs: CSP	2018-09-12 13:58:13 +08:00
README.esp32.md	esp32: map basic auth to nvs	2018-02-24 08:14:17 +08:00
README.generic-sessions.md	clean up top level of project	2017-09-27 08:24:05 +08:00
README.generic-table.md	clean up top level of project	2017-09-27 08:24:05 +08:00
README.lws-meta.md	clean up top level of project	2017-09-27 08:24:05 +08:00
README.lwsws.md	network interface: defer bindings to absent network interfaces	2018-04-06 10:38:03 +08:00
README.plugin-acme.md	ACME client plugin	2017-12-01 11:37:35 +08:00
README.problems.md	clean up top level of project	2017-09-27 08:24:05 +08:00
README.test-apps.md	autobahn fixes	2018-04-22 06:45:46 +08:00
README.unix-domain-reverse-proxy.md	http proxy: client: unix socket support	2018-09-12 13:58:13 +08:00
release-checklist	http: enlarge headers buffers since they may meet large headers from vhost config	2018-09-11 18:27:59 +08:00

README.unix-domain-reverse-proxy.md

Unix Domain Sockets Reverse Proxy

Introduction

lws is able to use a mount to place reverse proxies into the URL space.

These are particularly useful when using Unix Domain Sockets, basically files in the server filesystem, to communicate between lws and a separate server process and integrate the result into a coherent URL namespace on the lws side.

This has the advantage that the actual web server that forwards the data from the unix socket owner is in a different process than the server that serves on the unix socket. If it has problems, they do not affect the actual public-facing web server. The unix domain socket server may be in a completely different language than the web server.

Compared to CGI, there are no forks to make a connection to the unix domain socket server.

Mount origin format

Unix Domain Sockets are effectively "files" in the server filesystem, and are defined by their filepath. The "server" side that is to be proxied opens the socket and listens on it, which creates a file in the server filesystem. The socket understands either http or https protocol.

Lws can be told to act as a proxy for that at a mountpoint in the lws vhost url space.

If your mount is expressed in C code, then the mount type is LWSMPRO_HTTP or LWSMPRO_HTTPS depending on the protocol the unix socket understands, and the origin address has the form +/path/to/unix/socket:/path/inside/mount.

The + at the start indicates it is a local unix socket we are proxying, and the ':' acts as a delimiter for the socket path, since unlike other addresses the unix socket path can contain '/' itself.

Connectivity rules and translations

Onward proxy connections from lws to the Unix Domain Socket happen using http/1.1. That implies transfer-encoding: chunking in the case that the length of the output is not known beforehand.

Lws takes care of stripping any chunking (which is illegal in h2) and translating between h1 and h2 header formats if the return connection is actually in http/2.

The h1 onward proxy connection translates the following headers from the return connection, which may be h1 or h2:

Header	Function
host	Which vhost
etag	Information on any etag the client has cached for this URI
if-modified-since	Information on the freshness of any etag the client has cached for this URI
accept-language	Which languages the return path client prefers
accept-encoding	Which compression encodings the client can accept
cache-control	Information from the return path client about cache acceptability
x-forwarded-for	The IP address of the return path client

This implies that the proxied connection can

return 301 etc to say the return path client's etag is still valid
choose to compress using an acceptable content-encoding

The following headers are translated from the headers replied via the onward connection (always h1) back to the return path (which may be h1 or h2)

Header	Function
content-length	If present, an assertion of how much payload is expected
content-type	The mimetype of the payload
etag	The canonical etag for the content at this URI
accept-language	This is returned to the return path client because there is no easy way for the return path client to know what it sent originally. It allows clientside selection of i18n.
content-encoding	Any compression format on the payload (selected from what the client sent in accept-encoding, if anything)
cache-control	The onward server's response about cacheability of its payload

h1 -> h2 conversion

Chunked encoding that may have been used on the outgoing proxy client connection is removed for h2 return connections (chunked encoding is illegal for h2).

Headers are converted to all lower-case and hpack format for h2 return connections.

Header and payload proxying is staged according to when the return connection (which may be an h2 child stream) is writable.

Behaviour is unix domain socket server unavailable

If the server that listens on the unix domain socket is down or being restarted, lws understands that it couldn't connect to it and returns a clean 503 response HTTP_STATUS_SERVICE_UNAVAILABLE along with a brief human-readable explanation.