haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-02-11 06:43:42 -05:00

Author	SHA1	Message	Date
Christopher Faulet	3a394fa7cd	MEDIUM: filters: Add pre and post analyzer callbacks 'channel_analyze' callback has been removed. Now, there are 2 callbacks to surround calls to analyzers: * channel_pre_analyze: Called BEFORE all filterable analyzers. it can be called many times for the same analyzer, once at each loop until the analyzer finishes its processing. This callback is resumable, it returns a negative value if an error occurs, 0 if it needs to wait, any other value otherwise. * channel_post_analyze: Called AFTER all filterable analyzers. Here, AFTER means when an analyzer finishes its processing. This callback is NOT resumable, it returns a negative value if an error occurs, any other value otherwise. Pre and post analyzer callbacks are not automatically called. 'pre_analyzers' and 'post_analyzers' bit fields in the filter structure must be set to the right value using AN_* flags (see include/types/channel.h). The flag AN_RES_ALL has been added (AN_REQ_ALL already exists) to ease the life of filter developers. AN_REQ_ALL and AN_RES_ALL include all filterable analyzers.	2016-05-18 15:11:54 +02:00
Christopher Faulet	309c6418b0	MEDIUM: filters: Replace filter_http_headers callback by an analyzer This new analyzer will be called for each HTTP request/response, before the parsing of the body. It is identified by AN_FLT_HTTP_HDRS. Special care was taken about the following condition : * the frontend is a TCP proxy * filters are defined in the frontend section * the selected backend is a HTTP proxy So, this patch explicitly add AN_FLT_HTTP_HDRS analyzer on the request and the response channels when the backend is a HTTP proxy and when there are filters attatched on the stream. This patch simplifies http_request_forward_body and http_response_forward_body functions.	2016-02-09 14:53:15 +01:00
Christopher Faulet	d7c9196ae5	MAJOR: filters: Add filters support This patch adds the support of filters in HAProxy. The main idea is to have a way to "easely" extend HAProxy by adding some "modules", called filters, that will be able to change HAProxy behavior in a programmatic way. To do so, many entry points has been added in code to let filters to hook up to different steps of the processing. A filter must define a flt_ops sutrctures (see include/types/filters.h for details). This structure contains all available callbacks that a filter can define: struct flt_ops { /* * Callbacks to manage the filter lifecycle / int (init) (struct proxy p); void (deinit)(struct proxy p); int (check) (struct proxy p); / * Stream callbacks / void (stream_start) (struct stream s); void (stream_accept) (struct stream s); void (session_establish)(struct stream s); void (stream_stop) (struct stream s); / * HTTP callbacks / int (http_start) (struct stream s, struct http_msg msg); int (http_start_body) (struct stream s, struct http_msg msg); int (http_start_chunk) (struct stream s, struct http_msg msg); int (http_data) (struct stream s, struct http_msg msg); int (http_last_chunk) (struct stream s, struct http_msg msg); int (http_end_chunk) (struct stream s, struct http_msg msg); int (http_chunk_trailers)(struct stream s, struct http_msg msg); int (http_end_body) (struct stream s, struct http_msg msg); void (http_end) (struct stream s, struct http_msg msg); void (http_reset) (struct stream s, struct http_msg msg); int (http_pre_process) (struct stream s, struct http_msg msg); int (http_post_process) (struct stream s, struct http_msg msg); void (http_reply) (struct stream s, short status, const struct chunk msg); }; To declare and use a filter, in the configuration, the "filter" keyword must be used in a listener/frontend section: frontend test ... filter <FILTER-NAME> [OPTIONS...] The filter referenced by the <FILTER-NAME> must declare a configuration parser on its own name to fill flt_ops and filter_conf field in the proxy's structure. An exemple will be provided later to make it perfectly clear. For now, filters cannot be used in backend section. But this is only a matter of time. Documentation will also be added later. This is the first commit of a long list about filters. It is possible to have several filters on the same listener/frontend. These filters are stored in an array of at most MAX_FILTERS elements (define in include/types/filters.h). Again, this will be replaced later by a list of filters. The filter API has been highly refactored. Main changes are: * Now, HA supports an infinite number of filters per proxy. To do so, filters are stored in list. * Because filters are stored in list, filters state has been moved from the channel structure to the filter structure. This is cleaner because there is no more info about filters in channel structure. * It is possible to defined filters on backends only. For such filters, stream_start/stream_stop callbacks are not called. Of course, it is possible to mix frontend and backend filters. * Now, TCP streams are also filtered. All callbacks without the 'http_' prefix are called for all kind of streams. In addition, 2 new callbacks were added to filter data exchanged through a TCP stream: - tcp_data: it is called when new data are available or when old unprocessed data are still waiting. - tcp_forward_data: it is called when some data can be consumed. * New callbacks attached to channel were added: - channel_start_analyze: it is called when a filter is ready to process data exchanged through a channel. 2 new analyzers (a frontend and a backend) are attached to channels to call this callback. For a frontend filter, it is called before any other analyzer. For a backend filter, it is called when a backend is attached to a stream. So some processing cannot be filtered in that case. - channel_analyze: it is called before each analyzer attached to a channel, expects analyzers responsible for data sending. - channel_end_analyze: it is called when all other analyzers have finished their processing. A new analyzers is attached to channels to call this callback. For a TCP stream, this is always the last one called. For a HTTP one, the callback is called when a request/response ends, so it is called one time for each request/response. * 'session_established' callback has been removed. Everything that is done in this callback can be handled by 'channel_start_analyze' on the response channel. * 'http_pre_process' and 'http_post_process' callbacks have been replaced by 'channel_analyze'. * 'http_start' callback has been replaced by 'http_headers'. This new one is called just before headers sending and parsing of the body. * 'http_end' callback has been replaced by 'channel_end_analyze'. * It is possible to set a forwarder for TCP channels. It was already possible to do it for HTTP ones. * Forwarders can partially consumed forwardable data. For this reason a new HTTP message state was added before HTTP_MSG_DONE : HTTP_MSG_ENDING. Now all filters can define corresponding callbacks (http_forward_data and tcp_forward_data). Each filter owns 2 offsets relative to buf->p, next and forward, to track, respectively, input data already parsed but not forwarded yet by the filter and parsed data considered as forwarded by the filter. A any time, we have the warranty that a filter cannot parse or forward more input than previous ones. And, of course, it cannot forward more input than it has parsed. 2 macros has been added to retrieve these offets: FLT_NXT and FLT_FWD. In addition, 2 functions has been added to change the 'next size' and the 'forward size' of a filter. When a filter parses input data, it can alter these data, so the size of these data can vary. This action has an effet on all previous filters that must be handled. To do so, the function 'filter_change_next_size' must be called, passing the size variation. In the same spirit, if a filter alter forwarded data, it must call the function 'filter_change_forward_size'. 'filter_change_next_size' can be called in 'http_data' and 'tcp_data' callbacks and only these ones. And 'filter_change_forward_size' can be called in 'http_forward_data' and 'tcp_forward_data' callbacks and only these ones. The data changes are the filter responsability, but with some limitation. It must not change already parsed/forwarded data or data that previous filters have not parsed/forwarded yet. Because filters can be used on backends, when we the backend is set for a stream, we add filters defined for this backend in the filter list of the stream. But we must only do that when the backend and the frontend of the stream are not the same. Else same filters are added a second time leading to undefined behavior. The HTTP compression code had to be moved. So it simplifies http_response_forward_body function. To do so, the way the data are forwarded has changed. Now, a filter (and only one) can forward data. In a commit to come, this limitation will be removed to let all filters take part to data forwarding. There are 2 new functions that filters should use to deal with this feature: * flt_set_http_data_forwarder: This function sets the filter (using its id) that will forward data for the specified HTTP message. It is possible if it was not already set by another filter _AND_ if no data was yet forwarded (msg->msg_state <= HTTP_MSG_BODY). It returns -1 if an error occurs. * flt_http_data_forwarder: This function returns the filter id that will forward data for the specified HTTP message. If there is no forwarder set, it returns -1. When an HTTP data forwarder is set for the response, the HTTP compression is disabled. Of course, this is not definitive.	2016-02-09 14:53:15 +01:00
Willy Tarreau	748179eb5a	MEDIUM: stream: move HTTP request body analyser before process_common Since 1.5, the request body analyser has become independant from any other element and does not even disturb the message forwarder anymore. And since it's disabled by default, we can place it before most analysers so that it's can preempt any other one if an intermediary one enables it.	2015-05-02 00:10:44 +02:00
Willy Tarreau	81cd90069a	MEDIUM: channel: remove now unused ->prod and ->cons pointers Nothing uses them anymore.	2015-03-11 20:41:47 +01:00
Willy Tarreau	ef573c0f22	MEDIUM: channel: add a new flag "CF_ISRESP" for the response channel This flag designates the response channel. This will be used to know what channel we're seeing and finding our way back to the session.	2015-03-11 20:41:47 +01:00
Willy Tarreau	a27dc19eda	CLEANUP: remove now unused channel pool The channels are now part of the struct session. Their pool is not needed anymore.	2015-03-11 20:41:46 +01:00
Thierry FOURNIER	b83862dd74	MEDIUM: channel: wake up any request analyzer on response activity This behavior is already existing for the "WAIT_HTTP" analyzer, this patch just extends the system to any analyzer that would be waked up on response activity.	2015-02-28 23:12:31 +01:00
Willy Tarreau	9dc1c61c43	BUG/CRITICAL: http: don't update msg->sov once data start to leave the buffer Commit `bb2e669` ("BUG/MAJOR: http: correctly rewind the request body after start of forwarding") was incorrect/incomplete. It used to rely on CF_READ_ATTACHED to stop updating msg->sov once data start to leave the buffer, but this is unreliable because since commit `a6eebb3` ("[BUG] session: clear BF_READ_ATTACHED before next I/O") merged in 1.5-dev1, this flag is only ephemeral and is cleared once all analysers have seen it. So we can start updating msg->sov again each time we pass through this place with new data. With a sufficiently large amount of data, it is possible to make msg->sov wrap and validate the if() condition at the top, causing the buffer to advance by about 2GB and crash the process. Note that the offset cannot be controlled by the attacker because it is a sum of millions of small random sizes depending on how many bytes were read by the server and how many were left in the buffer, only because of the speed difference between reading and writing. Also, nothing is written, the invalid pointer resulting from this operation is only read. Many thanks to James Dempsey for reporting this bug and to Chris Forbes for narrowing down the faulty area enough to make its root cause analysable. This fix must be backported to haproxy 1.5.	2014-09-02 16:48:54 +02:00
Willy Tarreau	644c101e2d	BUG/MAJOR: http: connection setup may stall on balance url_param On the mailing list, seri0528@naver.com reported an issue when using balance url_param or balance uri. The request would sometimes stall forever. Cyril Bont� managed to reproduce it with the configuration below : listen test :80 mode http balance url_param q hash-type consistent server s demo.1wt.eu:80 and found it appeared with this commit : `80a92c0` ("BUG/MEDIUM: http: don't start to forward request data before the connect"). The bug is subtle but real. The problem is that the HTTP request forwarding analyzer refrains from starting to parse the request body when some LB algorithms might need the body contents, in order to preserve the data pointer and avoid moving things around during analysis in case a redispatch is later needed. And in order to detect that the connection establishes, it watches the response channel's CF_READ_ATTACHED flag. The problem is that a request analyzer is not subscribed to a response channel, so it will only see changes when woken for other (generally correlated) reasons, such as the fact that part of the request could be sent. And since the CF_READ_ATTACHED flag is cleared once leaving process_session(), it is important not to miss it. It simply happens that sometimes the server starts to respond in a sequence that validates the connection in the middle of process_session(), that it is detected after the analysers, and that the newly assigned CF_READ_ATTACHED is not used to detect that the request analysers need to be called again, then the flag is lost. The CF_WAKE_WRITE flag doesn't work either because it's cleared upon entry into process_session(), ie if we spend more than one call not connecting. Thus we need a new flag to tell the connection initiator that we are specifically interested in being notified about connection establishment. This new flag is CF_WAKE_CONNECT. It is set by the requester, and is cleared once the connection succeeds, where CF_WAKE_ONCE is set instead, causing the request analysers to be scanned again. For future versions, some better options will have to be considered : - let all analysers subscribe to both request and response events ; - let analysers subscribe to stream interface events (reduces number of useless calls) - change CF_WAKE_WRITE's semantics to persist across calls to process_session(), but that is different from validating a connection establishment (eg: no data sent, or no data to send) The bug was introduced in 1.5-dev23, no backport is needed.	2014-04-30 20:02:02 +02:00
Willy Tarreau	b145c78623	MINOR: channel: add the date of last read in the channel We store the time stamp of last read in the channel in order to be able to measure some bit rate and pause lengths. We only use 16 bits which were unused for this. We don't need more, as it allows us to measure with a millisecond precision for up to 65s.	2014-02-12 11:45:59 +01:00
Willy Tarreau	d7ad9f5b0d	MAJOR: channel: add a new flag CF_WAKE_WRITE to notify the task of writes Since commit `6b66f3e` ([MAJOR] implement autonomous inter-socket forwarding) introduced in 1.3.16-rc1, we've been relying on a stupid mechanism to wake up the task after a write, which was an exact copy-paste of the reader side. The principle was that if we empty a buffer and there's no forwarding scheduled or if the producer is not in a connected state, then we wake the task up. That does not make any sense. It happens to wake up too late sometimes (eg, when the request analyser waits for some room in the buffer to start to work), and leads to unneeded wakeups in client-side keep-alive, because the task is woken up when the response is sent, while the analysers are simply waiting for a new request. In order to fix this, we introduce a new channel flag : CF_WAKE_WRITE. It is designed so that an analyser can explicitly request being notified when some data were written. It is used only when the HTTP request or response analysers need to wait for more room in the buffers. It is automatically cleared upon wake up. The flag is also automatically set by the functions which try to write into a buffer from an applet when they fail (bi_putblk() etc...). That allows us to remove the stupid condition above and avoid some wakeups. In http-server-close and in http-keep-alive modes, this reduces from 4 to 3 the average number of wakeups per request, and increases the overall performance by about 1.5%.	2013-12-31 18:37:36 +01:00
Willy Tarreau	983eb31fd1	BUG/MINOR: channel: CHN_INFINITE_FORWARD must be unsigned This value is stored as unsigned in chn->to_forward. Having it defined as signed makes it impossible to pass channel_forward() a previously saved value because the argument will be zero-extended during the conversion to long long, while the test will be performed using sign extension. There is no impact on existing code right now.	2013-12-28 21:33:37 +01:00
Godbach	2b8fd54287	DOC: fix typo in comments Hi Willy, There is a patch to fix typo in comments, please check the attachment for you information. The commit log is as below: commit 9824d1b3740ac2746894f1aa611c795366c84210 Author: Godbach <nylzhaowei@gmail.com> Date: Mon Sep 30 11:05:42 2013 +0800 DOC: fix typo in comments 0x20000000 -> 0x40000000 vuf -> buf ethod -> Method Signed-off-by: Godbach <nylzhaowei@gmail.com> -- Best Regards, Godbach From 9824d1b3740ac2746894f1aa611c795366c84210 Mon Sep 17 00:00:00 2001 From: Godbach <nylzhaowei@gmail.com> Date: Mon, 30 Sep 2013 11:05:42 +0800 Subject: [PATCH] DOC: fix typo in comments 0x20000000 -> 0x40000000 vuf -> buf ethod -> Method Signed-off-by: Godbach <nylzhaowei@gmail.com>	2013-10-01 09:49:21 +02:00
Willy Tarreau	b31c971bef	CLEANUP: channel: remove any reference of the hijackers Hijackers were functions designed to inject data into channels in the distant past. They became unused around 1.3.16, and since there has not been any user of this mechanism to date, it's uncertain whether the mechanism still works (and it's not really useful anymore). So better remove it as well as the pointer it uses in the channel struct.	2012-11-11 23:05:39 +01:00
Willy Tarreau	0a8535fec8	OPTIM: channel: reorganize struct members to improve cache efficiency Now that the buffer is moved out of the channel, it is possible to move the pointer earlier in the struct and reorder some fields. This new ordering improves overall performance by 2%, mainly saved in the HTTP parsers and data transfers.	2012-10-13 10:55:22 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	8263d2b259	CLEANUP: channel: use "channel" instead of "buffer" in function names This is a massive rename of most functions which should make use of the word "channel" instead of the word "buffer" in their names. In concerns the following ones (new names) : unsigned long long channel_forward(struct channel buf, unsigned long long bytes); static inline void channel_init(struct channel buf) static inline int channel_input_closed(struct channel buf) static inline int channel_output_closed(struct channel buf) static inline void channel_check_timeouts(struct channel b) static inline void channel_erase(struct channel buf) static inline void channel_shutr_now(struct channel buf) static inline void channel_shutw_now(struct channel buf) static inline void channel_abort(struct channel buf) static inline void channel_stop_hijacker(struct channel buf) static inline void channel_auto_connect(struct channel buf) static inline void channel_dont_connect(struct channel buf) static inline void channel_auto_close(struct channel buf) static inline void channel_dont_close(struct channel buf) static inline void channel_auto_read(struct channel buf) static inline void channel_dont_read(struct channel buf) unsigned long long channel_forward(struct channel *buf, unsigned long long bytes) Some functions provided by channel.[ch] have kept their "buffer" name because they are really designed to act on the buffer according to some information gathered from the channel. They have been moved together to the same place in the file for better readability but they were not changed at all. The "buffer" memory pool was also renamed "channel".	2012-09-03 20:47:33 +02:00
Willy Tarreau	03cdb7c678	CLEANUP: channel: usr CF_/CHN_ prefixes instead of BF_/BUF_ Get rid of these confusing BF_* flags. Now channel naming should clearly be used everywhere appropriate. No code was changed, only a renaming was performed. The comments about channel operations was updated.	2012-09-03 20:47:33 +02:00
Willy Tarreau	f941cf2ef2	MAJOR: channel: remove the BF_FULL flag This is similar to the recent removal of BF_OUT_EMPTY. This flag was very problematic because it relies on permanently changing information such as the to_forward value, so it had to be updated upon every change to the buffers. Previous patch already got rid of its users. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's input and output is more reliable than checking the flag's replacement.	2012-09-03 20:47:33 +02:00
Willy Tarreau	8e21bb9e52	MAJOR: channel: remove the BF_OUT_EMPTY flag This flag was very problematic because it was composite in that both changes to the pipe or to the buffer had to cause this flag to be updated, which is not always simple (eg: there may not even be a channel attached to a buffer at all). There were not that many users of this flags, mostly setters. So the flag got replaced with a macro which reports whether the channel is empty or not, by checking both the pipe and the buffer. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's output size is more useful than checking the flag's replacement.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00

23 commits