Commit 6bd0a9fc authored by Maik Fröbe's avatar Maik Fröbe
Browse files


parent 1e5a7d55
......@@ -211,6 +211,11 @@ Be sure to check the generic [how to do work](#how-to-do-work) first.
#### How to do work on web-archive data?
- Log into the [webis jupyterlab]( with your gitlab credentials
- Launch a new terminal and check out the [aitools4-aq-cluster-computing repository](
- Ensure that you have a user directory in the HDFS (ask your supervisor to run the following in
HADOOP_USER_NAME=hdfs hdfs dfs -mkdir /user/<username>
HADOOP_USER_NAME=hdfs hdfs dfs -chown -R <username>:<username> /user/<username>
- Ask your supervisor to put the S3 credentials of the `internet-archive-ro` user into `~/.aws/config`:
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment