Commit 6bd0a9fc authored by Maik Fröbe's avatar Maik Fröbe
Browse files

Update README.md

parent 1e5a7d55
......@@ -211,6 +211,11 @@ Be sure to check the generic [how to do work](#how-to-do-work) first.
#### How to do work on web-archive data?
- Log into the [webis jupyterlab](https://jupyter2.webis.de/) with your gitlab credentials
- Launch a new terminal and check out the [aitools4-aq-cluster-computing repository](https://git.webis.de/code-lib/aitools/aitools4-aq-cluster-computing)
- Ensure that you have a user directory in the HDFS (ask your supervisor to run the following in ssh.webis.de):
```
HADOOP_USER_NAME=hdfs hdfs dfs -mkdir /user/<username>
HADOOP_USER_NAME=hdfs hdfs dfs -chown -R <username>:<username> /user/<username>
```
- Ask your supervisor to put the S3 credentials of the `internet-archive-ro` user into `~/.aws/config`:
```
[DEFAULT]
......
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment