@14:50 - say you have a lot of documents to upload (doc, docx, excel, ppt, pptx, pdf, txt, png, jpg, pub, etc...) (about 5TB). - what's the fastest way to import them?
Very informative and among other sources helped me to get started. However, ... it's one thing to get this thing up and running, it's a completely different animal on best practices how to actually use it. As you mentioned ISO, are there readily applicable ISO standards with regards to document organisation, how a library operates for instance, or a government about naming, tagging files? That's what I struggle with most. Coming up with a structure that's specific enough to easily narrow it down to a few documents, without going overboard with a tag cloud of +1000 words.
Hi am the "newbie" so at 16:57 i am stuck , naturally..i went through the video and want to setup on my Mac..to test --the demo site failed to load unless i need to load a Office document to test -- but back to the 16:57 -- do i need to have a Server Setup---- if installed Docker on my mac can i setup a server ?
Looks like an interesting container, will check it out. I've got a fairly decent Dropbox folder structure but this looks pretty cool with the built in OCR.
A very good presentation. I was able to install the web app on local host andit is working fine. However, I am not able to install it on my hosted web server hosted by bluehost. What are the configuration changs to be made in yml file?
@@ms7165 hey make sure you have docker compose installed, check out the docs - docs.docker.com/engine/install/ubuntu/#install-using-the-repository that go through the docker setup. You might have the older version which is docker-compose rather than docker compose
Thank you so much. Is there any way to put a lot of documents directly in one or the other directory. I've heard that putting documents in the "consume" directory would make them be treated by paperless, but in my configuration it doesn't work... any idea?
I tested a lot of solutions but reverted to a single archive folder where files are stored with tags in the filename...easy and fast to search...has been working well for almost 20 years
I am on QNAP using container station and have been trying to figure this out now for days. Could get the basic linuxserver image working with access to the front end but all sort of errors after that. Do you have any advice for getting this going in container station?
Great tutorial, worked directly out of the box following your instructions on OMV6 headless server, Thx a lot! (not too fast, English is not my 1st language and I used the pause option...)
Thank you for the video. I wonder if there is something similar but for nextcloud. Because for me, nextcloud is already managing my library of documents, but it is still missing some functionality, and the presented tools seem useful. However, how do you import the existing bulk of existing files? Of how to keep it in sync with nextcloud? Thank you. I am sure some people might have the same question.
In terms of keeping it in sync with Nextcloud, you could create like an Rsync between Nextcloud and Paperless and copy the files between the Nextcloud volume and the Paperless volume. I might explore this and see how you could do it. I’m not sure if Nextcloud had any extensions that can do this, worth a look though.
The files are stored in a docker volume called media, the full path will be something like. /var/lib/docker/volumes/paperless_media/_data/documents You need root permission to access these volumes though
Hey!! I have installed the paperless-ngx on ubuntu server using docker, and now I wanted to mount the points between the docker container(paperless-ngx) & my TrueNAS server, as all the documents are organized in my TrueNAS Server. Is it possible to do ? Pls reply if possible 🙏🙏
I’m assuming you have a network share on your NAS? You can setup docker network volumes, if your network share is already mounted on your host then you can point docker to that directory as if it was any other directory
Hi, it's a great guide which worked flawlessly until trying to create the super user. When I enter the command docker-compose run --rm webserver createsuperuser, I get: -bash: docker-compose: command not found. I can't work out what's causing it and how to resolve it. Can you help please?
Are you on Ubuntu, try install docker-compose. 'sudo apt install docker-compose' also make sure you are still in the same directory as the compose file :)
I'm struggling to get Paperless-NGX to run, in the logs it says: "FATAL: password authentication failed for user "paperless"" - If you can, please give me some advice.
@@Techdox I pulled the images with Compose and then tried to create the superuser with the command in the Docker-Compose file, but before it runs it gives me this error. So , the error comes on before I need to create the superuser... Thanks for replying...
was it just me or did you miss defining some persistent volumes in the YAML to store the imported docs? If that's the case that would mean all the imports would live in the container and when you update the images all the imports would be deleted!
Hey very use full any Idea of how to use the Rest API to have it as a document repository for an app or multiple apps? A link or video will be helpfull.
Well you could but that defeats the entire point of the video haha. This is as an alternative to something like Google Drive where you are in control and selfhosting
Quick warning (LMK if this was because something that I did wrong..) if this is your first time installing it, the CMD prints will be having seizures. Not to worry tho, if you look closely, it is just trying to download all the dependent files.
Can you import a current document archive or is it single document at a time? Might be painful for some who have a folder structure already in place. Or can you keep a folder structure and still use this? I mean I tend to have other files like images in the folders which is relevant to the document and separating them means you're making your actions longer to find relevant documents.
That would be awesome, upload it and it just manages it all for you. Surely it could be implemented without too much hassle if you leveraged an exisiting AI like GPT
It says on the front page "Uses machine-learning to automatically add tags, correspondents and document types to your documents.", so it looks as if it's already there
Not everything needs to be coined AI, in this case it's just a matter of auto filling titles and dates, whether that's dates in files or document created dates, or file names. Artificial intelligence is not that.
They are pretty different right? Evernote is a hosted service you use and has paid features and is a note taking app. Paperless is a document management service with OCR that you selfhost
@@Techdox I understand but even the free Evernote version has similarities. All documents uploaded are searchable, even text in images. That is why I like it so much. This solution seems more complicated 🤔
@@James7104 the big difference here is looking at self hosted alternatives, rather than trusting other companies with data. For instance I wouldn’t be comfortable hosting medical data here to store long term compared to a self hosted solution that I control. It’s like a lot of what I cover in this channel, Nextcloud could be Google or Dropbox etc it’s all about alternatives :)
Yeah, like I have mentioned to others. This is focusing on a self hosted service that can be used regardless of device you use. More of a central repository for your documentation then on an individuals device
It is a g❤reat video for those without knowledge about installing paperless in docker on a Nas. But the video has a big but. Someone without knowledge can absolutely not follow it. Within one minute someone is lost. Because many times you talk so fast and not really clear. Please delete this video and create a new one. We really need a version for Dummies. I have watched this video 6 times now and to no use at all. I know this is also me because i have no knowledge at all. But many times i replayed certain parts trying to understand what you are saying. But with the best intentions i can not. And i so wish that i could learn to do all of this. If someone knows another good educational video for dummies you would make me very happy. Or something on the Web ?
Yeah, I’m definitely not deleting my best performing video with over a 100k views haha. I can understand if you are getting stuck because the concept of docker and docker compose maybe new to you. Check out this video I made around getting started with Selfhosting ua-cam.com/video/OfGmfZmpVsk/v-deo.htmlsi=Vnte0b_Qof4No4Yj
The environment file is really only being used for a small amount of variables unless we were making it public, you could remove it if you wanted and bring timezone etc over
In general, the amount of syntax in those files is disgusting....there should be some abstraction made, there is no way that in future by adding more complexity into the these types of files would be able to manage, it will be just added on top and new frameworks and it will be big mess@@Techdox
@@Filip-r2o If you think this is bad, you should see Kubernetes manifest files haha. It may not seem like it be there is some structure and logic to this :D
No Tor or NAS haha, just docker. The thing is if docker itself is new to you that’s why I link the get started video. Explaining the basics in every video would make them an hour each
Dude, slow down, annunciate and speak clearly. You have good info to share, that’s why I’m watching it, but the staccato delivered makes my head go dizzy. 🤪
Could follow you at 1.25x with no problem. And English is my second language. It's just kiwi :-) Please don't get discouraged, you're helping regional dialect awareness :-)
i´ve just tried to install with portainer. i put the code from "docker-compose.postgres-tika.yml" in a stack and adjust only my port and a few volumes. I get an error. after that i put the code 1:1 in a stack and deploy. also there i got an error . what i am doing wrong ?
@@Techdox this error message showed up " Deployment Error => failed to deploy stack: open /data/compose/16/docker-compose.env: no such file or directory"
No I don't because I thougt there is everything commented out and the gid and uid are not neccesarry. You mean put the Code from both direct in the table of portainer stack and then deploy? I will test this and Feedback here. Thank you both for your Support!
Now i`ve got PL running, i`ve just took the "docker-compose.portainer.yml" from github, that was a lot easier for me ;). but now i have the next issue i have to dela with ... the loggin steup with the CLI. I ve running docker on a omv media server in portainer, i have nothing to do yet in CLI`s. I can acces via putty on my WIN pc and login in my user on my server but thats it. I have noch clue how to get to the path in the video "~docker/paperless" to set the needed command line to create user
Are you in the Discord? If not make sure you are for the Raspberry Pi 5 Giveaway - discord.com/invite/8mX2KRxDw8 drawn end of December
Second video I’ve watched. Your presentations are much clearer than most. Much appreciated
Thank you, glad it was helpful
Awesome video, love the fast nature of the video, I usually pause videos should I need more time.
That’s what I figured as well, if I cut to the points then viewers can pause and rewind as needed :)
@14:50 - say you have a lot of documents to upload (doc, docx, excel, ppt, pptx, pdf, txt, png, jpg, pub, etc...) (about 5TB).
- what's the fastest way to import them?
@11:07 - I wanted to see the directory after the creation process!!!
If this video helped you, feel free to share a "Thanks" to support the channel!
Thanks!
Very informative and among other sources helped me to get started. However, ... it's one thing to get this thing up and running, it's a completely different animal on best practices how to actually use it. As you mentioned ISO, are there readily applicable ISO standards with regards to document organisation, how a library operates for instance, or a government about naming, tagging files? That's what I struggle with most. Coming up with a structure that's specific enough to easily narrow it down to a few documents, without going overboard with a tag cloud of +1000 words.
I have a suggestion for showing ip addr outputs.. if you add "-br" is cleans it up a lot. so "ip -br a" or "ip -br addr" would make it easier to demo.
That is a lot cleaner, thank you :)
@@Techdox it is what finally tipped the scales for me to switch off of ifconfig
very helpful!
thx
Hi am the "newbie" so at 16:57 i am stuck , naturally..i went through the video and want to setup on my Mac..to test --the demo site failed to load unless i need to load a Office document to test -- but back to the 16:57 -- do i need to have a Server Setup---- if installed Docker on my mac can i setup a server ?
Yeah, you should be able to run this just fine locally if you have docker on your Mac
I am curious on what terminal software you are using, it seem to populate with useful commands and options after a - etc.. ? Thanks in advance!
Thanks for the demo and info, have a great day
Hi, I want to see the database. I tried port 5432, 3306. it does not show up. Any idea how to access it as i want to see the table structure
Looks like an interesting container, will check it out. I've got a fairly decent Dropbox folder structure but this looks pretty cool with the built in OCR.
Let me know how you find it if you do check it out
Nextcloud now has OCR too but paperless seem pretty interesting as it seems pretty smooth and fluid compared to the horrible nextcloud ux
Up and running! Thanks.
Great to hear! Enjoy it
@7:02 - it says you need 3 files (d*.yml, d*.env, ".env"), why?
@11:01 -
A very good presentation. I was able to install the web app on local host andit is working fine. However, I am not able to install it on my hosted web server hosted by bluehost. What are the configuration changs to be made in yml file?
What error do you get? Feel free to join the Discord and we can help
at 16:57 when executing the docker compose up -d, why do I get "command not found"
@@ms7165 hey make sure you have docker compose installed, check out the docs - docs.docker.com/engine/install/ubuntu/#install-using-the-repository that go through the docker setup. You might have the older version which is docker-compose rather than docker compose
Thank you so much. Is there any way to put a lot of documents directly in one or the other directory. I've heard that putting documents in the "consume" directory would make them be treated by paperless, but in my configuration it doesn't work... any idea?
Really good question I’ll have to look into it
Thank you
I tested a lot of solutions but reverted to a single archive folder where files are stored with tags in the filename...easy and fast to search...has been working well for almost 20 years
Hey, if it works it works! :)
I am on QNAP using container station and have been trying to figure this out now for days.
Could get the basic linuxserver image working with access to the front end but all sort of errors after that.
Do you have any advice for getting this going in container station?
Great tutorial, worked directly out of the box following your instructions on OMV6 headless server, Thx a lot! (not too fast, English is not my 1st language and I used the pause option...)
Glad to hear it!
Thank you for the video. I wonder if there is something similar but for nextcloud.
Because for me, nextcloud is already managing my library of documents, but it is still missing some functionality, and the presented tools seem useful.
However, how do you import the existing bulk of existing files? Of how to keep it in sync with nextcloud?
Thank you. I am sure some people might have the same question.
In terms of keeping it in sync with Nextcloud, you could create like an Rsync between Nextcloud and Paperless and copy the files between the Nextcloud volume and the Paperless volume. I might explore this and see how you could do it.
I’m not sure if Nextcloud had any extensions that can do this, worth a look though.
there is OCR on nextcloud
@@ArawnFR well that answers that :D
Tks for the video; one question: can u explain how to install another menu language like spanish instead of default english?
I hope you enjoyed this video 😀
Nice video 😊
I would alter the data/media volumes to docker folder defaults to /usr/src/paperless
@@MartinHiggs84 Yeah, I was debating if it would be worth changing them from volumes to binds in that folder
@Techdox that makes sense - I was thinking just need one backup set for/docker then everything covered as expand
11:00 docker compose up -d
no configuration file provided: not found
Make sure to run the command where your compose file is
what to do if you need to upgrade all the components, db/broker/tika, etc...?
Interesting - Do you have a yaml manifest for paperless to deploy on kubernetes ?
I don't but you could chuck the compose into ChatGPT to convert it haha
Nice tutorial. Does it grab text from images/scanned documents? If not is there another program that can do this?
Yeah. Should do that just fine :)
where are all the documents being stored / saved at? docker/paperless?
The files are stored in a docker volume called media, the full path will be something like.
/var/lib/docker/volumes/paperless_media/_data/documents
You need root permission to access these volumes though
Hey!! I have installed the paperless-ngx on ubuntu server using docker, and now I wanted to mount the points between the docker container(paperless-ngx) & my TrueNAS server, as all the documents are organized in my TrueNAS Server. Is it possible to do ? Pls reply if possible 🙏🙏
I’m assuming you have a network share on your NAS? You can setup docker network volumes, if your network share is already mounted on your host then you can point docker to that directory as if it was any other directory
Now that this works, can you access it outside your local network trough a Cloudflare tunnel?
You sure can
perfect! thx a lot
works right away..
Can you edit or create new ones on it?
did you ever see SMEs using this?
Hi, it's a great guide which worked flawlessly until trying to create the super user. When I enter the command docker-compose run --rm webserver createsuperuser, I get:
-bash: docker-compose: command not found. I can't work out what's causing it and how to resolve it. Can you help please?
Are you on Ubuntu, try install docker-compose. 'sudo apt install docker-compose' also make sure you are still in the same directory as the compose file :)
@@Techdox you're a star! Thank you for such a quick response...it must be breakfast time there! ;-)
It is indeed! Coffee time :)
I'm struggling to get Paperless-NGX to run, in the logs it says: "FATAL: password authentication failed for user "paperless"" - If you can, please give me some advice.
Did you create the user after deploying Paperless? :)
@@Techdox I pulled the images with Compose and then tried to create the superuser with the command in the Docker-Compose file, but before it runs it gives me this error. So , the error comes on before I need to create the superuser... Thanks for replying...
@@Techdox It seems to be working now after I changed the permissions for the db volume. But thanks for your help!
@@DariusOosthuizenSuper great to hear!
docker compose up -d or docker-compose up -d?
was it just me or did you miss defining some persistent volumes in the YAML to store the imported docs? If that's the case that would mean all the imports would live in the container and when you update the images all the imports would be deleted!
It's using volumes, so the data is stored amoung 4 volumes which you can see at the bottom of the compose. data, media, pgdata and redisdata
Hey very use full any Idea of how to use the Rest API to have it as a document repository for an app or multiple apps? A link or video will be helpfull.
That’s a great idea, thanks for the suggestion. Their APIs look very straight forward
This seems more convoluted than using the ocr scanning functions of google drive. Why not just use that?
Well you could but that defeats the entire point of the video haha. This is as an alternative to something like Google Drive where you are in control and selfhosting
Quick warning (LMK if this was because something that I did wrong..) if this is your first time installing it, the CMD prints will be having seizures. Not to worry tho, if you look closely, it is just trying to download all the dependent files.
Any ideas on auto-tagging, categorization and document clustering powered by AI?
I myself am curious on how that works. They mentioned it was a thing, so I’m keen to understand how it works
What about persistance? If the container dies, your database dies too, doesn't it?
Two containers, but even if they do die all the data is stored on volumes that live on the host machine
Can you import a current document archive or is it single document at a time? Might be painful for some who have a folder structure already in place.
Or can you keep a folder structure and still use this? I mean I tend to have other files like images in the folders which is relevant to the document and separating them means you're making your actions longer to find relevant documents.
I would say you could point paperless to your current location of files and it should import them all
Which terminal do you use?
iterm2 using a Dracula theme with the fig plugin :)
AI treatment of documents would be great to autopick the categories and fill out the date and so on
That would be awesome, upload it and it just manages it all for you. Surely it could be implemented without too much hassle if you leveraged an exisiting AI like GPT
It says on the front page "Uses machine-learning to automatically add tags, correspondents and document types to your documents.", so it looks as if it's already there
Not everything needs to be coined AI, in this case it's just a matter of auto filling titles and dates, whether that's dates in files or document created dates, or file names. Artificial intelligence is not that.
Thats very interesting akternative of zipping and syncing with onedrive or cloud or anything the paperless is way more easy and practical
This might b a nice addition to my home server running on an HP DL60 9!
What is the benefit of this software compared to Evernote?
They are pretty different right? Evernote is a hosted service you use and has paid features and is a note taking app. Paperless is a document management service with OCR that you selfhost
@@Techdox I understand but even the free Evernote version has similarities. All documents uploaded are searchable, even text in images. That is why I like it so much. This solution seems more complicated 🤔
@@James7104 the big difference here is looking at self hosted alternatives, rather than trusting other companies with data. For instance I wouldn’t be comfortable hosting medical data here to store long term compared to a self hosted solution that I control. It’s like a lot of what I cover in this channel, Nextcloud could be Google or Dropbox etc it’s all about alternatives :)
Thanks for pointing that out. @@Techdox
thx bro
Mac OS does this already. Just scan give it time to index and you’re done.
Yeah, like I have mentioned to others. This is focusing on a self hosted service that can be used regardless of device you use. More of a central repository for your documentation then on an individuals device
My bad, I have a file system that suits me and I didn’t listen to what you were saying.
Docker being a paid service, the software is not free. Is there any other method?
Docker is most definitely still free, the business editions are not though but you don’t need that
Or I just use macOS spotlight which also indexes content.
For sure, it’s nice to have dedicated services for this though so you can access it and get the benefits from any device :)
It is a g❤reat video for those without knowledge about installing paperless in docker on a Nas.
But the video has a big but. Someone without knowledge can absolutely not follow it. Within one minute someone is lost. Because many times you talk so fast and not really clear.
Please delete this video and create a new one. We really need a version for Dummies. I have watched this video 6 times now and to no use at all. I know this is also me because i have no knowledge at all. But many times i replayed certain parts trying to understand what you are saying. But with the best intentions i can not. And i so wish that i could learn to do all of this.
If someone knows another good educational video for dummies you would make me very happy. Or something on the Web ?
Yeah, I’m definitely not deleting my best performing video with over a 100k views haha. I can understand if you are getting stuck because the concept of docker and docker compose maybe new to you.
Check out this video I made around getting started with Selfhosting ua-cam.com/video/OfGmfZmpVsk/v-deo.htmlsi=Vnte0b_Qof4No4Yj
@@Techdox So many times i have watched it. 😇
I will watch the one from your links. Thanks.
Where are you getting stuck?
@8:20 🤮
The environment file is really only being used for a small amount of variables unless we were making it public, you could remove it if you wanted and bring timezone etc over
In general, the amount of syntax in those files is disgusting....there should be some abstraction made, there is no way that in future by adding more complexity into the these types of files would be able to manage, it will be just added on top and new frameworks and it will be big mess@@Techdox
@@Filip-r2o If you think this is bad, you should see Kubernetes manifest files haha. It may not seem like it be there is some structure and logic to this :D
Your like my brother and other techies. You know your stuff but don’t speak in human terms. So this is running in tor Nas / Docker dang
No Tor or NAS haha, just docker. The thing is if docker itself is new to you that’s why I link the get started video.
Explaining the basics in every video would make them an hour each
Docker or K8’s?
Dude, slow down, annunciate and speak clearly. You have good info to share, that’s why I’m watching it, but the staccato delivered makes my head go dizzy. 🤪
Haha! I hear you, I’ll make sure to keep this in mind for future video, the kiwi accent won’t help either 😂
no issues here, sounds normal to me
Normal here as well, great content man!
Yeah no problem. Man's got an accent not poor communication.
Could follow you at 1.25x with no problem. And English is my second language. It's just kiwi :-) Please don't get discouraged, you're helping regional dialect awareness :-)
i´ve just tried to install with portainer. i put the code from "docker-compose.postgres-tika.yml" in a stack and adjust only my port and a few volumes. I get an error. after that i put the code 1:1 in a stack and deploy. also there i got an error . what i am doing wrong ?
Hey, what’s the error?
@@Techdox this error message showed up " Deployment Error => failed to deploy stack: open /data/compose/16/docker-compose.env: no such file or directory"
did you import the env file inside the portainer UI at the bottom of the page?
No I don't because I thougt there is everything commented out and the gid and uid are not neccesarry.
You mean put the Code from both direct in the table of portainer stack and then deploy? I will test this and Feedback here. Thank you both for your Support!
Now i`ve got PL running, i`ve just took the "docker-compose.portainer.yml" from github, that was a lot easier for me ;). but now i have the next issue i have to dela with ... the loggin steup with the CLI. I ve running docker on a omv media server in portainer, i have nothing to do yet in CLI`s. I can acces via putty on my WIN pc and login in my user on my server but thats it. I have noch clue how to get to the path in the video "~docker/paperless" to set the needed command line to create user
docker compose up -d or docker-compose up -d?
They are the same, but docker compose is the new one and I would suggest using that one
@@Techdox It did not work for me so I had to use docker-compose.