The @web
module contains functions for interfacing with the
World Wide Web.
Make a web request.
The headers User-Agent, Accept, and Content-Length are automatically included in the request.
Any additional request headers can be specified via headers, which should be a case-sensitive map of header name to header value.
If method is empty and data is null then method becomes 'GET'. If data is not null then method becomes 'POST'.
The server's SSL certificate is not validated. You can check the certificate yourself against a known certificate or authority chain if validation is desired.
The function :get_header() can be used to parse out a single header value from the raw response headers.
res = web.req('http://example.com/') print res.head ## HTTP/1.1 200 OK Age: 509983 Cache-Control: max-age=604800 Content-Type: text/html; charset=UTF-8 ... ## print res.body ## <!doctype html> <html> <head> <title>Example Domain</title> ... ##
Download a file to the Axiom cache.
This uses the same cache as :import(), which is the folder .axiom.
This will show a progress bar while the file is downloading.
The the URL is already cached and the cache value is younger than max age, then no download occurs. (The cached file path is just returned.)
On error, the path is returned but the file will be empty.
Get a URL with caching. Like :dl(), but returns response body instead of path of cached file.
This is equivalent to: io.load(web.dl(url))
Like :dl(), a progress bar is shown when a request actually occurs (cache miss).
Parse out the value of a single header from a string of raw headers.
Get cookies from raw response headers.
Do a form submission
This is equivalent to doing a form submit from an HTML form like:
<form action="url" method="POST"> <input name="key0" value="value0" /> <input name="key1" value="value1" /> ... </form>
Where keyN and valueN are key/value pairs in form.
To submit mutiple form fields with the same name, the value of the form field should be an array.
To submit files, the value of the form field should be an object with the following fields:
Split a URL into parts.
If the "host" contains Unicode, it will be converted into an Internationalized Domain Name with Punycode.
If no protocol is specified, "http" is assumed. If the URL is protocol-relative (begins with "//"), "https" is assumed.
print web.url('https://example.com/whatever?q=something') ##{ "protocol" : "https", "host" : "example.com", "port" : 443, "resource" : "/whatever?q=something" }## print web.url('//example.com/whatever?q=something') ##{ "protocol" : "https", "host" : "example.com", "port" : 443, "resource" : "/whatever?q=something" }## print web.url('δΎ‹.com/whatever?q=something') ##{ "protocol" : "http", "host" : "xn--fsq.com", "port" : 80, "resource" : "/whatever?q=something" }##
Get or set user agent name.
The user agent is sent in all requests as part of the request header.
The default user agent is "Mozilla/5.0" for maximum compatibility.
print web.user_agent() #show current user agent #mimic Python web.user_agent('Python-urllib/2.6') #mimic IE9, the most popular browser in 2011 web.user_agent('Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; WOW64; Trident/5.0)') #mimic Android Chrome, the most popular browser in 2023 web.user_agent('Mozilla/5.0 (Linux; Android 13; Pixel 6a) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/112.0.0.0 Mobile Safari/537.36')
Get the HTTP content type for a given file. The content type is inferred from the extension, or the first 128 bytes if none. See notes for details.
The following extensions are supported:
If the filepath has no extension or the extension is not supported above, the first 128 bytes are inspected to see if they contain a null byte. If so, "application/x-octet-stream" is returned, otherwise "text/plain" is returned.
@server
FunctionsSet the global web server event handlers.
All of the handlers take no parameters, and should use the other web.server
functions to
get context information.
By default (when the handlers are null) the on_request/on_message handlers echo back the request/message (for debugging), and on_upgrade closes the connection immediately (to deny unwanted websocket connections).
#this is code for a simple chat server :on_request() if web.server.path()=='/' web.server.write_file('chat.html') else web.server.write_404() :on_upgrade() if web.server.path()=='/chat' web.server.subscribe('chat') this.name = randstr() web.server.publish('chat','['+this.name+' has entered the chat room]',1) else web.server.close() :on_message() if web.server.path()=='/chat' web.server.publish('chat','['+this.name+']: '+web.server.message(),1) else web.server.close() :on_destroy() if web.server.path()=='/chat' web.server.publish('chat','['+this.name+' has left the chat room]',1) web.server.use(on_request,on_upgrade,on_message,on_destroy) web.server.run({ port: 8999 })
See chat.html for the corresponding HTML/CSS/JS code.
Run the global web server using specified configuration.
The default config file (web.ini) looks like this:
port = <random integer in range [1000,9999]> local = true secret = <random UUID> ;crt = server.crt ;key = server.key
Instead of a config file, an object can be passed containing all the necessary variables. At a bare minimum, the "port" variable is required, although it is recommended to specify "local" and "secret" as well.
The global web server handles two routes automatically:
Any route of the form "/static/*" will serve the corresponding file in the subdirectory named "static". Simply do not create a subdirectory named "static" if you do not want this behavior.
Any route of the form "/web/*" will serve the corresponding file in the directory specified by the environment variable "WEB", or the subdirectory named "web" if the environment variable does not exist. This feature is to facilitate sharing of client-side code between projects during development (via environment variable), while also allowing for deployments (via subdirectory). If the environment variable "WEB" is specified, it should contain an absolute path with no trailing slash. Simply do not create an environment variable named "WEB" and do not create a subfolder named "web" if you do not want this behavior.
This function blocks until the global web server is shut down (by ctrl-c or process signal/kill). This function should only be called from the main thread. Consider using background threads or processes if background processing is needed.
#create a file called web.ini if not exist, and use those values #do nothing except serve files from /static/ and /web/ web.server.run()
Get the current request method.
Get the current request resource.
The resource is the path plus any querystring. Use :path() to get just the path. Use :querystring() to get just the querystring. Unlike :path(), the :resource() is not modified by :pre() and :pop().
Get the current request path.
Excludes the querystring. Use :querystring() to get the querystring.
Get the raw headers for the current request.
Get the current request content (body).
Get the current request querystring.
Excludes leading question mark. For example if the full path was "/index.html?abc=123", the querystring would be "abc=123".
Use :load_get() to parse the querystring.
Parse getdata from querystring.
Parse postdata from request body.
Multiple form fields with the same name are combined into a single array value.
Files become objects with the following fields:
Load and unscramble basic authorization header if specified
Use :write_401() to force a login prompt in browsers.
Get the client IPv4 as a string.
If the client IP is 127.0.0.1 (localhost), this function will look for the IP in the "X-Forwarded-For" header, in order to get the true client IP when behind a reverse proxy.
Checks if path matches prefix. If so, returns true and removes prefix from path.
This is used for routing. When returning true, path is modified, so subsequent calls to :path will be return different results.
#map routes like "/abc/*" to files in subdirectory "xyz" :on_request() if web.server.pre('/abc/') web.server.write_file('xyz/'+web.server.path()) else web.server.write_404() web.server.use(on_request) web.server.run()
Pop leftmost token off path until next slash. This modifies the path.
This is used for routing. Tokens are defined as the strings in the path delimited by forward slashes. If there are no more tokens to pop, return empty string and also set path to empty string. (For example, if path is "/" and pop is called, path becomes "" and "" is returned.)
#handle route like /thing/[id] :on_request() if web.server.pre('/thing/') id = web.server.pop().number() #... web.server.write_json({id:id,data:'whatever'}) web.server.use(on_request) web.server.run()
Get header from current request by name.
Get cookie from current request by name.
Set a response cookie.
The domain flag is used to share cookies across subdomains. For example, if login.example.com wants to share cookies with api.example.com, set domain to true.
Erase a response cookie from client.
This tells the client to erase the cookie on its end by setting Max-Age to 0.
Get cookie from current request by name.
This uses the "secret" specified as part of the configuration for :run().
Set a signed response cookie.
The domain flag is used to share cookies across subdomains. For example, if login.example.com wants to share cookies with api.example.com, set domain to true.
This uses the "secret" specified as part of the configuration for :run().
Write a redirect response.
If permanent is true, the response status is "301 Moved Permanently". If permanent is false, the response status is "302 Found".
Write a response.
This function provides low-level access for writing generic responses.
It is recommended to use the other web.server.write_
functions for convenience.
The response_headers field should contain the status line followed by the headers. Each line in response_headers should be followed by "\r\n". However, the entire string should NOT contain a trailing "\r\n". (So response_headers end with "\r\n" not "\r\n\r\n".)
The "Content-Length" header is automatically added if not already present in response_headers.
The body is automatically ignored if responding to a HEAD request.
Write a 200 OK response.
Write a 401 Unauthorized response. This causes the browser to prompt for Basic Authorization.
Although the specs say "401 Unauthorized", most other servers return "401 Not Authorized", so this function does the same.
Write a 403 Forbidden response.
Write a 404 Not Found response.
Write a 405 Method Not Allowed response.
Write a 500 Internal Server Error response.
Write a 200 OK response using a file.
If content_type is blank, it is automatically chosen based on the file extension in path.
Write a 200 OK response with type "application/json" using any variable.
The data parameter can be a variable of any type, and is automatically converted to JSON for the response.
Write a 200 OK response after running a function in a separate thread.
Use this function for long-running responses so that they do not block the main thread (and other connections) from processing.
This operates by creating a new thread, then calling fn with self and args in the new thread. After the function completes, the return value of the function is used as the response body, and content_type is used as the Content-Type header.
This function sets the timeout of the current connection to -1 (unlimited). To reduce this, use :timeout().
Get or set the timeout for the current connection.
The default timeout for all connections is 10 seconds. The timeout becomes -1 (unlimited) if :write_thread() is called, or the connection is upgraded to a websocket. In both cases, this function can be used to reduce the timeout.
Minify Javascript file and return minified contents.
This function just removes comments and whitespace. It does not rename function or variable names.
Minify CSS file and return minified contents.
This function just removes comments and whitespace. It does not rename any selectors.
Render an HTML template and return the rendered contents.
This function uses an extremely simple template syntax, where $variable is replaced with context[variable]. Use $$ to escape $. The variable $this becomes the filename from path without the extension, to aid with scoping.
The template syntax also supports server-side includes in the following format:
<!--#include path.html title="Title" name="value"-->
Where "path.html" is the path of the file to include (also treated like a template), and any attributes are used to form the context for the sub-template.
HTML-escape a string.
This simply replaces '<', '>', '&', and '"' with their corresponding HTML entities.
Close the websocket.
Get the current websocket message from the client to the server.
Send a message on the current websocket from server to client.
If opcode is 1, the msg becomes a String within the browser. If opcode is 2, the msg becomes a Blob within the browser.
Subscribe the current websocket to channel. This causes messages :publish() to the channel to be relayed to websocket client.
Publish a message to the specified websocket channel.
If opcode is 1, the msg becomes a String within the browser. If opcode is 2, the msg becomes a Blob within the browser.