rdelaage/gmid - gmid - Gitea: Git with a cup of tea

mirror of https://github.com/omar-polo/gmid.git synced 2024-09-30 14:31:13 +02:00

Author	SHA1	Message	Date
Omar Polo	b9b77f5344	fix comment	2022-01-27 09:28:27 +00:00
Omar Polo	901905e0cf	bail out of client_read if we've already decide what to do libevent2 can still somehowe call client_read even in code paths that never enable reading from the evbuffer. Can't reproduce on the libevent in base on OpenBSD. It's a bit ugly, but it's a small workaround for something that otherwise always make gmid crash when linked against libevent2. (client_read works under the assumption that c->host != NULL, matched_proxy crashes otherwise.)	2022-01-05 18:58:01 +00:00
Omar Polo	876a417023	tweak comment	2022-01-05 18:03:47 +00:00
Omar Polo	d474a97922	add missing prototype	2022-01-04 23:15:13 +00:00
Omar Polo	ba94a608a8	add `require client ca' for proxy blocks refactor the code that calls validate_against_ca into an helper function to reuse it in both apply_require_ca and (optionally) in apply_reverse_proxy.	2022-01-04 23:14:34 +00:00
Omar Polo	b7967bc1f6	proxy: allow multiple proxy blocks, matching options and validations as a side effect the order of the content of a server block is relaxed: options, location or proxy blocks can be put in any order.	2022-01-02 16:33:28 +00:00
Omar Polo	593e412b49	allow to disable TLS when proxying requests	2022-01-01 20:16:14 +00:00
Omar Polo	7bdcc91ec7	simplify the proxying code it doesn't make any sense to keep the proxying info per-location: proxying only one per-vhost. It can't work differently, it doesn't make sense anyway.	2022-01-01 17:08:39 +00:00
Omar Polo	d49093c105	support optional client certificate for proxy rule	2022-01-01 16:33:44 +00:00
Omar Polo	6a6b4a2a98	typo	2021-12-29 20:36:54 +00:00
Omar Polo	72b033ef18	add ability to proxy requests Add to gmid the ability to forwad a request to another gemini server and thus acting like a reverse proxy. The current syntax for the config file is server "example.com" { ... proxy relay-to host:port } Further options (like the use of custom certificates) are planned. cf. github issue #7	2021-12-29 20:36:54 +00:00
Omar Polo	52c92ef680	relax the "wont proxy request" check: don't check the port number Don't refuse to serve the request if the port number doesn't match the one we're listening on, as initially suggested by Allen Sobot. Complex setup may have a gmid instance reachable from multiple ports and the meaning of the check in the first places was to avoid tricking clients into thinking that we're serving for those domains: the port number is way less important than the schema or domain name. In the long run, the best way would probably to add a `listen on' keyword for the servers blocks, just like OpenBSD' httpd, but gmid can't listen on multiple ports/interfaces yet	2021-12-09 20:59:05 +00:00
Omar Polo	4842c72d9f	fmt	2021-10-18 10:05:55 +00:00
Omar Polo	8044493865	move bufferevent initialization early in handle_handshake the error path needs an initialized bufferevent too, otherwise it'll crash when trying to write the response. This moves the initialisation early, right after the tls_handshake. Another option would be to initialise it in do_accept, but that may be too early.	2021-10-15 07:46:30 +00:00
Omar Polo	c62a411f4f	don't die on ECONNABORTED ECONNABORTED is returned if a connections gets aborted after being queued before the accept(2). I had some cases of accept: Software caused connection abort on FreeBSD, this should avoid that.	2021-10-13 20:49:58 +00:00
Omar Polo	5eb3fc905f	don't work around a missing -Wno-unused-parameter It's been there for a long time, and it's frankly annoying to pretend to use parameters. Most of the time, they're there to satisfy an interface and nothings more.	2021-10-09 18:54:41 +00:00
Omar Polo	207b3e80d8	Store clients inside a splay tree From day one we've been using a static array of client struct to hold the clients data. This has variuos drawbacks, among which: * reuse of the storage ("shades of heartbleed") * maximum fixed amount of clients connected at the same time * bugs are harder to debug The last point in particular is important because if we mess the client ids, or try to execute some functions (e.g. the various fcgi_*) after a client has been disconnected, it's harder to "see" this "use after free"-tier kind of bug. Now I'm using a splay tree to hold the data about the live connections. Each client' data is managed by malloc. If we try to access a client data after the disconnection we'll probably crash with a SIGSEGV and find the bug is more easy. Performance-wise the connection phase should be faster since we don't have to loop anymore to find an empty spot in the clients array, but some operations could be slightly slower (compare the O(1) access in an array with a SPLAY_FIND operation -- still be faster than O(n) thought.)	2021-10-07 11:20:34 +00:00
Omar Polo	4cd2520965	one FastCGI connection per client FastCGI is designed to multiplex requests over a single connection, so ideally the server can open only one connection per worker to the FastCGI application and that's that. Doing this kind of multiplexing makes the code harder to follow and easier to break/leak etc on the gmid side however. OpenBSD' httpd seems to open one connection per client, so why can't we too? One connection per request is still way better (lighter) than using CGI, and we can avoid all the pitfalls of the multiplexing (keeping track of "live ids", properly shut down etc...)	2021-10-07 10:47:02 +00:00
Omar Polo	e4daebe44a	plug a memory leak c->req is set in client_read but never deallocated	2021-10-06 17:38:37 +00:00
Omar Polo	807a80cb9e	fmt	2021-10-06 16:36:31 +00:00
Omar Polo	acafce5b7d	libevent2 fix: unfreeze the client evbuffer libevent2 has this concept of "freezeness" of a buffer. It's a way to avoid accidentally write/remove data from the wrong "edge" of the buffer. The client_tls_{read,write} functions need to add/drain data from the opposite edge, hence the need for the unfreeze call. This is the minimum change in order to work on libevent2 too. Another way would be to define evbuffer_{un,}freeze as NOP on libevent 1, but it's ugly IMHO.	2021-10-02 17:20:56 +00:00
Omar Polo	efe7d18029	new I/O handling on top of bufferevents This is a big change in how gmid handles I/O. Initially we used a hand-written loop over poll(2), that then was evolved into something powered by libevent basic API. This meant that there were a lot of small "asynchronous" function that did one step, eventually scheduling the re-execution, that called each others in a chain. The new implementation revolves completely around libevent' bufferevents. It's more clear, as everything is implemented around the client_read and client_write functions. There is still space for improvements, like adding timeouts for one, but it's solid enough to be committed as is and then further improved.	2021-10-02 17:20:56 +00:00
Omar Polo	741b69be96	fastcgi completely asynchronous This changes the fastcgi implementation from a blocking I/O to an async implementation on top of libevent' bufferevents. Should improve the responsiveness of gmid especially when using remote fastcgi applications.	2021-09-26 17:00:07 +00:00
Omar Polo	83fe545a2b	initialize mbufhead	2021-09-26 16:43:19 +00:00
Omar Polo	3571854e94	fix possible out-of-bound access While computing the parent directory it an out-of-bound access can occur, which usually means the server process dies. In particular, it can be triggered by making a request for a non-existent file in the root of a virtual host if the path matches the `cgi` pattern. Thanks cage for helping in debugging!	2021-09-24 10:48:51 +00:00
Omar Polo	353e3c8ebe	style	2021-09-24 08:16:28 +00:00
Omar Polo	a91ad7f2ff	drop unnecessary bzero the whole struct client is already memset'd to 0 in do_accept. handle_handshake doesn't touch the request or iri buffer in the code path that leads to handle_open_conn. (It does so in the error router alone.)	2021-09-24 08:08:49 +00:00
Omar Polo	79288c8b60	making more explicit the case of missing SNI Missing SNI (i.e. servname == NULL) is already handled correctly. puny_decode refuses to work on NULL servname, c->domain is still the empty string and everything flows as expected towards the error at the end. However, it's better to bail out early and make more explicit how the case of missing SNI is handled.	2021-09-24 07:40:24 +00:00
Omar Polo	efb48052dc	relax openat rule: follow symlinks O_NOFOLLOW acts only on the last component, so on open("/foo/bar/baz") only when baz is a symlink open fails. Checking every path component is not viable. gh issue #5 related (sort of)	2021-07-27 09:21:42 +00:00
Omar Polo	a8a1f43921	style(9)-ify	2021-07-07 09:46:37 +00:00
Omar Polo	090b8a89fa	gracefully shut down fastcgi backends we need to delete the events associated with the backends, otherwise the server process won't ever quit. Here, we add a pending counter to every backend and shut down immediately if they aren't handling any client; otherwise we try to close them as soon as possible (i.e. when they close the connection to the last connected client.)	2021-07-06 10:54:27 +00:00
Omar Polo	1b78bd563a	strncpy -> strlcpy quoting strncpy(3) strncpy() only NUL terminates the destination string when the length of the source string is less than the length parameter. strlcpy is more intuitive. this is another warning gcc 8 found that clang didn't.	2021-06-16 15:06:10 +00:00
Omar Polo	24d362cd67	explicitly use c->fd instead of fd Yep, fd should be the file descriptor, but for lazyness when manually calling the function sometimes we supply 0 as fd and event. Instead of fixing the usage, do as other of such functions do in this circumstances: use c->fd.	2021-06-12 13:42:43 +00:00
Omar Polo	89c88caa3c	mark backend as FCGI_READY when getting a fd otherwise clients will remain stuck waiting for a pending request that doesn't exist (see apply_fastcgi switch.)	2021-06-12 13:41:33 +00:00
Omar Polo	1feaf2a618	use the correct document root pass the correct loc_off to the executor, so the various variables that depends on the matched location (like DOCUMENT_ROOT) are computed correctly.	2021-05-15 10:31:43 +00:00
Omar Polo	91b9f2a8f9	const-ify strip_path	2021-05-15 10:07:21 +00:00
Omar Polo	571d20fbb3	fmt	2021-05-15 10:04:58 +00:00
Omar Polo	8ad1c57024	fastcgi: a first implementation Not production-ready yet, but it's a start. This adds a third ``backend'' for gmid: until now there it served local files or CGI scripts, now FastCGI applications too. FastCGI is meant to be an improvement over CGI: instead of exec'ing a script for every request, it allows to open a single connection to an ``application'' and send the requests/receive the responses over that socket using a simple binary protocol. At the moment gmid supports three different methods of opening a fastcgi connection: - local unix sockets, with: fastcgi "/path/to/sock" - network sockets, with: fastcgi tcp "host" [port] port defaults to 9000 and can be either a string or a number - subprocess, with: fastcgi spawn "/path/to/program" the fastcgi protocol is done over the executed program stdin of these, the last is only for testing and may be removed in the future. P.S.: the fastcgi rule is per-location of course :)	2021-05-09 18:23:36 +00:00
Omar Polo	737a6b50c5	ensure %p (path) is always absolute with the recent changes, sometimes the path may not start with a '/'. This ensures that %s is ALWAYS an absolute path.	2021-04-30 19:07:37 +00:00
Omar Polo	fdea6aa0bc	allow ``root'' rule to be specified per-location block	2021-04-30 17:16:34 +00:00
Omar Polo	cc8c2901ad	added ``alias'' option to define hostname aliases for a server	2021-04-29 18:23:35 +00:00
Omar Polo	e76f2c74b8	don't save the directory fd in c->pfd scandir_fd already calls closedir, which in turns closes the fd	2021-04-25 12:19:06 +00:00
Omar Polo	11c986679a	sort the auto index alphabetically	2021-04-25 12:06:54 +00:00
Omar Polo	74c0c7e4ce	rename reschedule_* to yield_*	2021-04-20 09:40:09 +00:00
Omar Polo	89541eeec0	define TLS_VERSION, TLS_CIPHER and TLS_CIPHER_STRENGTH for CGI scripts	2021-04-13 06:59:54 +00:00
Omar Polo	b8e64ccd44	list instead of fixed-size array for vhosts and locations saves some bytes of memory and removes the limit on the maximum number of vhosts and location blocks.	2021-03-31 16:32:18 +00:00
Omar Polo	62e001b067	move all sandbox-related code to sandbox.c while there, add capsicum for the logger process	2021-03-20 08:42:08 +00:00
Omar Polo	bc99d868bc	refactoring: imsg everywhere use imsg to handle ALL kinds of IPC in gmid. This simplifies and shorten the code, and makes everything more uniform too.	2021-03-19 19:21:29 +00:00
Omar Polo	4604dc9671	move vhost_should_log call to server.c log.o is linked to some regress/ stuff. Calling from there a vhost_* function means that we should link the regress/stuff to server.o too (and that would pull in other stuff...). Moving the call is easier, and also probably better.	2021-02-23 13:43:33 +01:00
Omar Polo	793835cb26	add `log on/off' to enable/disable logs per-location	2021-02-23 13:43:24 +01:00
Omar Polo	6b191ed52a	tests and compat for imsg	2021-02-23 13:43:14 +01:00
Omar Polo	c39b26d308	mark reschedule_write inline & static	2021-02-12 20:25:48 +00:00
Omar Polo	eecad7a3ca	other s/fnmatch/matches	2021-02-12 19:51:54 +00:00
Omar Polo	52418c8d82	fix various compilation errors Include gmid.h as first header in every file, as it then includes config.h (that defines _GNU_SOURCE for instance). Fix also a warning about unsigned vs signed const char pointers in openssl.	2021-02-12 12:47:20 +00:00
Omar Polo	3cb3dd4d42	accept4 -> accept accept4(2) isn't part of any standard (even though it'll be part in the future) and raises warnings on some linux distro. Moreover, we don't have thread that may fork at any time, so doing a mark_nonblock after isn't a big deal.	2021-02-12 11:59:03 +00:00
Omar Polo	5e3285d52e	typo	2021-02-12 11:34:17 +00:00
Omar Polo	98ee8406aa	fix occurrence of (killed) load_file	2021-02-12 11:32:49 +00:00
Omar Polo	27b2fa9ae5	don't mmap Before we mmap(2) file for reading, and use a buffer to handle CGI scripts. Turns out, for sequential access over the whole mmap isn't better than our loop on read. This has also the additional advantage that we can use handle_cgi (now handle_copy) for both files and CGI, which is pretty cool. This also fixes a nasty bug where we could hang a connection forever, because we scheduled the wrong type of event (read on POLLOUT and write on POLLIN, it's the other way around!)	2021-02-12 11:27:33 +00:00
Omar Polo	a6e689d745	fix config reload the old server processes would stick around waiting on the signals events. While there, also drop the `struct server_events' and define events as globals.	2021-02-12 08:50:25 +00:00
Omar Polo	49b73ba1ab	fix "first location" bug reported by devel at datenbrei dot de. The first location would overwrite the default value for a server, triggering the "`foo' rule specified more than once" error. This also needed a small tweak on how we match locations to avoid breaking other tests.	2021-02-10 16:37:08 +00:00
Omar Polo	02be96c6dd	add `require client ca' rule to require certs signed by a CA	2021-02-09 22:30:04 +00:00
Omar Polo	57ec3e776e	refactor apply_block_return move the strip and fmt logic to their own function	2021-02-08 20:50:30 +00:00
Omar Polo	df58efff26	fix seccomp for the new event loop add/remove syscalls from the BPF filter and move sandbox() after libevent initialisation	2021-02-08 12:46:46 +00:00
Omar Polo	abc007d2b3	rewrite main loop using libevent	2021-02-08 10:01:45 +00:00
Omar Polo	b63e30ff44	define TLS_CLIENT_NOT_BEFORE/NOT_AFTER in CGI scripts	2021-02-07 21:47:01 +00:00
Omar Polo	3077ce5bee	don't fprintf	2021-02-07 16:10:09 +00:00
Omar Polo	3abf91b0b4	improve logs management	2021-02-07 15:30:28 +00:00
Omar Polo	cfb8a77fd4	handle also EAGAIN together with EWOULDBLOCK	2021-02-07 12:04:11 +00:00
Omar Polo	e3ddf39095	add the ``entrypoint'' option	2021-02-06 18:28:43 +00:00
Omar Polo	cd76162494	swap check in vhost_* fns it's faster (statistically speaking) to first compute if the option is set and then fnmatch than the inverse. This way we can avoid unnecessary fnmatch.	2021-02-06 17:31:03 +00:00
Omar Polo	6abda252e9	added ``block return'' and` `strip'' options	2021-02-06 17:22:37 +00:00
Omar Polo	daac4a9452	fix auto index precedence	2021-02-06 14:36:26 +00:00
Omar Polo	ca21e10043	reload configuration on SIGHUP	2021-02-04 13:23:15 +00:00
Omar Polo	1e3ef7ab4f	use upper bound given by poll it's a waste to loop through all fds. We know the exact number of clients that needs attention, so use that information to limit the looping.	2021-02-03 21:14:48 +00:00
Omar Polo	9b8f5ed2c0	revert commit `346f28eeaa` keep mark_nonblock in utils.c, as otherwise the build for the regress suite will fail (mark_nonblock needs fatal which is in gmid.c, and we can't link gmid.o with the regress suite...)	2021-02-03 14:16:39 +00:00
Omar Polo	346f28eeaa	move mark_nonblock to utils.c	2021-02-02 23:03:33 +00:00
Omar Polo	fe40638928	mark various functions as static By marking all those function as static, the compiler is free to do more optimizations. In addition, those functions are not used outside server.c	2021-02-02 23:01:09 +00:00
Omar Polo	87f2b68b58	cgi now follows globbing rules	2021-02-02 22:38:35 +00:00
Omar Polo	5f715ce43f	print the header in the directory listing	2021-02-02 09:48:32 +00:00
Omar Polo	35744950aa	simplify handle_cgi Now that I got rid of the enum+switch, adding more state is easier. Before, we used an hack to remember if we had read the CGI reply or not (c->code = -1). This introduces a new state, handle_cgi_reply that reads the CGI script reply, logs it, and only then switches to handle_cgi. handle_cgi itself is cleaner, now it only reads into c->sbuf and send what it had red. We even get, almost for free, the 42 error. If read exists with -1 or 0 from in handle_cgi_reply, we return a proper error to the client. We can extend this further in the future and also try to validate the CGI reply (for now we're only looking for a \n).	2021-02-01 22:04:51 +00:00
Omar Polo	b06f80cdf4	switch to handle_open_conn right after handshake So we don't re-enter the handle_handsahke and re-do the loop on fnmatch etc. This way, once we're successfully past the handshake, we'll re-enter no handle_open_conn.	2021-02-01 20:27:08 +00:00
Omar Polo	112802ea31	client state machine: function pointers instead of enum+switch	2021-02-01 20:00:33 +00:00
Omar Polo	2fafa2d23e	bring the CGI implementation in par with GLV-1.12556	2021-02-01 11:11:43 +00:00
Omar Polo	b59f3cdd27	typo	2021-01-30 12:12:37 +00:00
Omar Polo	6016a593a3	invert the location precedence: first match wins It's how httpd(8) does it, and it allows us to call fnmatch less time	2021-01-30 12:04:20 +00:00
Omar Polo	a8d4a89770	don't ignore punycode errors when decoding SNI-provided servname	2021-01-29 17:29:14 +00:00
Omar Polo	a2fd801327	puny_decode: set an error string	2021-01-29 17:11:03 +00:00
Omar Polo	90cb9eea8a	don't log the SNI & matching I'll re-enable this when i'll improve the logging	2021-01-28 16:28:44 +00:00
Omar Polo	22c6d6334d	log info about SNI, punycode and matched vhost	2021-01-27 15:06:15 +00:00
Omar Polo	caad03081b	some null checks	2021-01-27 15:05:50 +00:00
Omar Polo	c4f682f855	trim_req_iri: set error string	2021-01-27 15:05:16 +00:00
Omar Polo	3300cbe06a	initial punycode support	2021-01-27 10:47:49 +00:00
Omar Polo	8443bff77a	rework the configless mode: change flags and generate certs	2021-01-25 14:08:31 +00:00
Omar Polo	252908e6bb	added support for location blocks	2021-01-24 18:53:26 +00:00
Omar Polo	c8b7433918	added support for location blocks	2021-01-24 14:11:40 +00:00
Omar Polo	07b0a14218	void-ify some functions their return value is no longer used, it's only confusing at this point.	2021-01-24 09:54:44 +00:00
Omar Polo	a87f662565	refactoring state management instead of having a flag to discern between two different behaviours in S_SENDING, split that state into S_SENDING_FILE and S_SENDING_CGI (this will also make it easier in the future to add other sending states). While there, also get rid of `goodbye' and make start_reply advance the state machine by itself.	2021-01-24 09:49:09 +00:00
Omar Polo	e7a2a99b5a	added index option	2021-01-24 09:14:01 +00:00
Omar Polo	3309ef975c	accumulate the whole response line for CGI scripts	2021-01-23 15:32:38 +00:00
Omar Polo	f890c8c54d	use a helper to handle no-body replies	2021-01-22 13:58:54 +00:00

1 2 3 4

164 Commits