GitHub - tmalsburg/selfhost_ling_expts: A guide and templates for self-hosted experiments designed with jsPsych and served using Python

What is this?

In this document, I explain how to set up and host browser-based experiments. This approach uses jsPsych for designing the experiment and a simple but effective Python-script to serve it on the web. This repository also includes demo experiments showing how standard (psycho)linguistic paradigms can be implemented. Feel free to use these demos as templates for your own experiments.

Terms of use

Use this guide and software at your own risk. This guide, the script for serving the experiment online (server.py), and the Makefile are shared under the CC BY 4.0 license. If you base for your own research on these materials, please acknowledge it.

Short instructions (for Linux, macOS)

Currently, the following demo experiments are available:

To run the demo experiments (detailed instructions and explanations below):

Copy this repository to the server on which you’d like to run the experiment.
Install required software: sudo apt install make python3-bottle python3-gevent
In a command shell, enter the directory of the experiment that you’d like to test.
Execute make test. The experiment will now be served at address displayed in the shell.
Point your web browser to that address to test.
To stop the web server, press Ctrl-C in the shell.
Collected data can be found in the subdirectory data.

The instructions above are for testing. For production use, see detailed instructions below.

Overview

The sample experiments are implemented using jsPsych which is one of the standard packages for implementing web-based experiments. An alternative package that also looks promising is lab.js.

Bottle and gevent are Python packages that we use to serve the experiment to the web and to store the results on the server.

Bottle is the Python web framework used for serving the experiment and storing the collected data. Bottle was chosen because it is easy to use and well-documented.
gevent is our web server and handles network connections. Gevent, too, is easy to use but at the same time it scales really well if needed. I supports asynchronous processing and can simultaneously serve hundreds or even thousands of users.

The script server.py serves the experiment and stores the results in the subdirectory data.

Below are detailed instructions showing how to install and run the experiments. You’ll need a virtual server running Ubuntu Linux (or similar). Follow the instructions below for DigitalOcean, a commercial cloud service provider. People working at a University in Baden-Württemberg may try bwCloud, a cloud service offered by the state.

Create a virtual server (a “Droplet”) on DigitalOcean

Visit https://www.digitalocean.com and create an account.
Log in and visit the management interface at: https://cloud.digitalocean.com/projects
In the menu pane on the left select “Droplets”.
Click blue button “Create Droplet”.
Configure Droplet:
- Choose geographic region where the Droplet should be hosted (“Frankfurt”).
- As operating system choose Ubuntu (latest version).
- Select “Size” of Droplet. “Shared CPU / Basic” plan is usually enough.
- Under CPU options, choose “Regular”. Scroll the horizontal list of available plans all the way to the left and choose the cheapest plan (USD 4, at the time of writing).
- In the section “Choose Authentication Method”, you can choose “SSH Key” or “Password”. The former is more secure and more convenient once it’s set up. The latter is potentially insecure (depending on the password) but slightly easier to set up. The “SSH Key” method is strongly recommended. On Linux, the file containing your key can be found at: ~/.ssh/.id_rsa.pub. On MacOS it’s probably in the same location. No idea where it would be on Windows, but DigitalOcean show some instructions for all operating systems when you click on the button “New SSH Key”.
- The options that they offer in the next section are typically not needed.
- In the section “Finalize Details” you can choose how many Droplets you want to create (usually 1) and given each a name.
- Finally click the button “Create Droplet” at the bottom right.

Install required software on the virtual server

Log into the virtual server using SSH in a terminal:
- Copy the instance’s IP address from the list of instances (e.g., 193.196.54.221).
- Open a terminal and enter this command ssh [email protected] but with the actual IP address of your instance. In this command, root is the default username used in DigitalOcean servers. When using a bwCloud server, replace root with ubuntu.
- SSH will warn you that the “authenticity of host XYZ can’t be established”. That’s normal when you connect the first time. Answer “yes” when asked whether you’d like to continue.
- If all goes well, SSH will connect to the virtual server and show its command prompt, for instance, root@test_instance:~$ on DigitalOcean or ubuntu@test_instance:~$ on bwCloud.
Install required software packages: sudo apt update && sudo apt install make python3-bottle python3-gevent r-cran-dplyr

Done. You can now terminate the connection to the server by entering exit. This will bring you back to the command prompt of your computer.

Install the demo experiments on the virtual server

Connect to the virtual server: ssh [email protected]
To copy the demo experiments to the virtual server, simply clone its git repository: git clone [email protected]:tmalsburg/selfhost_ling_expts.git
Enter ls selfhost_ling_expts to see all files. You should see:
- Makefile: a file for starting and stopping the HTTP server that serves the experiment over the web
- README.md: the file you’re currently reading
- server.py: the script for serving the experiment and storing results on disk
- demo_stroop_task: directory containing the simple stroop task
- demo_judgment_task: directory containing the simple judgment task

Run and test the stroop task

Enter the directory containing the experiment: cd selfhost_ling_expts/demo_stroop_task
To start the web server enter: make start
The server will use encrypted connections (https://…) if the directory contains a certificate and key (cert.pem and key.pem). This will avoid messages show in some browser saying that the connection cannot be trusted.
You can now access the experiment in the browser at an URL like http://193.196.54.221/ (unencrypted) or https://193.196.54.221/ (encrypted) but using the IP address of your virtual server instance.
After you worked through the experiment, you will find a new file in the subdirectory data named something like 1244af49-9db5-410f-92bb-e4ecef23fc61.csv. This file contains the results of your test run. The name of the file is a so-called UUID which is (for all practical purposes) globally unique.

Stop the experiment

Enter the directory containing the experiment: cd selfhost_ling_expts/demo_stroop_task
To stop the server enter: make stop

Compiling all individual result files into one file

Enter the directory data.
Then execute: Rscript combine_results.R

This will create a new file combined.tsv with two additional columns:

participant_id: This column contains the file name of each participant’s individual results file. Since the file name is a UUID, this ID is practically guaranteed to be unique. However, if someone participates multiple times in the same experiment, they’ll get multiple IDs, so there is not necessarily a 1-to-1 mapping between these IDs and actual people.
ctime: contains the individual results file’s creation time.

The individual results will appear in chronological order in combined.tsv.

Note: For ctime, the script uses the time when the file was created on disk. For this time to be accurate, the script must be run on the machine where the experiment was conducted. If you transfer the files to another computer with (e.g., using scp), the times will no longer reflect the original creation time, but the time at which the files were copied. So the suggested workflow is: First combine all results into one file. Then transfer that file to wherever you’d like to process the data further.

Testing an experiment

For testing, use (which blocks the shell):

make test

To stop the server, press Ctrl-C in the shell.

Technical details

When starting the server with make start, the process id (PID) of the server will be stored in nohup.pid and log messages, including errors, in nohup.out.

Acknowledgements

Thanks go to Judith Tonhauser, who provided useful comments and suggestions and helped test some of the software in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
R		R
demo_judgment_task		demo_judgment_task
demo_recaptcha		demo_recaptcha
demo_selfpacedreading		demo_selfpacedreading
demo_stroop_task		demo_stroop_task
README.md		README.md
bwCloud.md		bwCloud.md
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is this?

Terms of use

Short instructions (for Linux, macOS)

Overview

Create a virtual server (a “Droplet”) on DigitalOcean

Install required software on the virtual server

Install the demo experiments on the virtual server

Run and test the stroop task

Stop the experiment

Compiling all individual result files into one file

Testing an experiment

Technical details

Acknowledgements

About

Languages

tmalsburg/selfhost_ling_expts

Folders and files

Latest commit

History

Repository files navigation

What is this?

Terms of use

Short instructions (for Linux, macOS)

Overview

Create a virtual server (a “Droplet”) on DigitalOcean

Install required software on the virtual server

Install the demo experiments on the virtual server

Run and test the stroop task

Stop the experiment

Compiling all individual result files into one file

Testing an experiment

Technical details

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Languages