Transform Your Document Chaos with Paperless: A Self-Hosting Guide with Docker

Поділитися
Вставка

КОМЕНТАРІ • 141

  • @Techdox
    @Techdox  Рік тому +1

    Are you in the Discord? If not make sure you are for the Raspberry Pi 5 Giveaway - discord.com/invite/8mX2KRxDw8 drawn end of December

  • @AntManBee19
    @AntManBee19 Місяць тому

    Second video I’ve watched. Your presentations are much clearer than most. Much appreciated

    • @Techdox
      @Techdox  Місяць тому

      Thank you, glad it was helpful

  • @bkahlerventer
    @bkahlerventer Рік тому +6

    Awesome video, love the fast nature of the video, I usually pause videos should I need more time.

    • @Techdox
      @Techdox  Рік тому

      That’s what I figured as well, if I cut to the points then viewers can pause and rewind as needed :)

  • @fbifido2
    @fbifido2 Рік тому +1

    @14:50 - say you have a lot of documents to upload (doc, docx, excel, ppt, pptx, pdf, txt, png, jpg, pub, etc...) (about 5TB).
    - what's the fastest way to import them?

  • @fbifido2
    @fbifido2 Рік тому +1

    @11:07 - I wanted to see the directory after the creation process!!!

  • @Techdox
    @Techdox  Рік тому +2

    If this video helped you, feel free to share a "Thanks" to support the channel!

  • @marcelohx
    @marcelohx 11 місяців тому

    Thanks!

  • @TomClaessens
    @TomClaessens 2 місяці тому

    Very informative and among other sources helped me to get started. However, ... it's one thing to get this thing up and running, it's a completely different animal on best practices how to actually use it. As you mentioned ISO, are there readily applicable ISO standards with regards to document organisation, how a library operates for instance, or a government about naming, tagging files? That's what I struggle with most. Coming up with a structure that's specific enough to easily narrow it down to a few documents, without going overboard with a tag cloud of +1000 words.

  • @Breeegz
    @Breeegz Рік тому +11

    I have a suggestion for showing ip addr outputs.. if you add "-br" is cleans it up a lot. so "ip -br a" or "ip -br addr" would make it easier to demo.

    • @Techdox
      @Techdox  Рік тому

      That is a lot cleaner, thank you :)

    • @Breeegz
      @Breeegz Рік тому

      @@Techdox it is what finally tipped the scales for me to switch off of ifconfig

    • @fvgoya
      @fvgoya Рік тому

      very helpful!

    • @Lunolux
      @Lunolux Рік тому

      thx

  • @Amazzinguy
    @Amazzinguy Рік тому +1

    Hi am the "newbie" so at 16:57 i am stuck , naturally..i went through the video and want to setup on my Mac..to test --the demo site failed to load unless i need to load a Office document to test -- but back to the 16:57 -- do i need to have a Server Setup---- if installed Docker on my mac can i setup a server ?

    • @Techdox
      @Techdox  Рік тому

      Yeah, you should be able to run this just fine locally if you have docker on your Mac

  • @coup728
    @coup728 4 місяці тому

    I am curious on what terminal software you are using, it seem to populate with useful commands and options after a - etc.. ? Thanks in advance!

  • @chrisumali9841
    @chrisumali9841 Рік тому

    Thanks for the demo and info, have a great day

  • @BestRestaurantToEat
    @BestRestaurantToEat 11 місяців тому

    Hi, I want to see the database. I tried port 5432, 3306. it does not show up. Any idea how to access it as i want to see the table structure

  • @alanjrobertson
    @alanjrobertson Рік тому +7

    Looks like an interesting container, will check it out. I've got a fairly decent Dropbox folder structure but this looks pretty cool with the built in OCR.

    • @Techdox
      @Techdox  Рік тому +1

      Let me know how you find it if you do check it out

    • @ArawnFR
      @ArawnFR Рік тому +2

      Nextcloud now has OCR too but paperless seem pretty interesting as it seems pretty smooth and fluid compared to the horrible nextcloud ux

  • @ms7165
    @ms7165 Рік тому +1

    Up and running! Thanks.

    • @Techdox
      @Techdox  Рік тому

      Great to hear! Enjoy it

  • @fbifido2
    @fbifido2 Рік тому

    @7:02 - it says you need 3 files (d*.yml, d*.env, ".env"), why?
    @11:01 -

  • @simhayoutubification
    @simhayoutubification Місяць тому

    A very good presentation. I was able to install the web app on local host andit is working fine. However, I am not able to install it on my hosted web server hosted by bluehost. What are the configuration changs to be made in yml file?

    • @Techdox
      @Techdox  Місяць тому

      What error do you get? Feel free to join the Discord and we can help

  • @ms7165
    @ms7165 5 місяців тому

    at 16:57 when executing the docker compose up -d, why do I get "command not found"

    • @Techdox
      @Techdox  5 місяців тому

      @@ms7165 hey make sure you have docker compose installed, check out the docs - docs.docker.com/engine/install/ubuntu/#install-using-the-repository that go through the docker setup. You might have the older version which is docker-compose rather than docker compose

  • @ewfzappadis
    @ewfzappadis Рік тому +1

    Thank you so much. Is there any way to put a lot of documents directly in one or the other directory. I've heard that putting documents in the "consume" directory would make them be treated by paperless, but in my configuration it doesn't work... any idea?

    • @Techdox
      @Techdox  Рік тому +1

      Really good question I’ll have to look into it

    • @ewfzappadis
      @ewfzappadis Рік тому

      Thank you

  • @mystereit73
    @mystereit73 Рік тому +3

    I tested a lot of solutions but reverted to a single archive folder where files are stored with tags in the filename...easy and fast to search...has been working well for almost 20 years

    • @Techdox
      @Techdox  Рік тому +1

      Hey, if it works it works! :)

  • @Michaelmannphotography
    @Michaelmannphotography 11 місяців тому

    I am on QNAP using container station and have been trying to figure this out now for days.
    Could get the basic linuxserver image working with access to the front end but all sort of errors after that.
    Do you have any advice for getting this going in container station?

  • @stephanejaglin221
    @stephanejaglin221 Рік тому

    Great tutorial, worked directly out of the box following your instructions on OMV6 headless server, Thx a lot! (not too fast, English is not my 1st language and I used the pause option...)

  • @EduardoSantanaSeverino
    @EduardoSantanaSeverino Рік тому +2

    Thank you for the video. I wonder if there is something similar but for nextcloud.
    Because for me, nextcloud is already managing my library of documents, but it is still missing some functionality, and the presented tools seem useful.
    However, how do you import the existing bulk of existing files? Of how to keep it in sync with nextcloud?
    Thank you. I am sure some people might have the same question.

    • @Techdox
      @Techdox  Рік тому +1

      In terms of keeping it in sync with Nextcloud, you could create like an Rsync between Nextcloud and Paperless and copy the files between the Nextcloud volume and the Paperless volume. I might explore this and see how you could do it.
      I’m not sure if Nextcloud had any extensions that can do this, worth a look though.

    • @ArawnFR
      @ArawnFR Рік тому +2

      there is OCR on nextcloud

    • @Techdox
      @Techdox  Рік тому +1

      @@ArawnFR well that answers that :D

  • @enricoschiappa3643
    @enricoschiappa3643 5 місяців тому

    Tks for the video; one question: can u explain how to install another menu language like spanish instead of default english?

  • @Techdox
    @Techdox  Рік тому +2

    I hope you enjoyed this video 😀

    • @MartinHiggs84
      @MartinHiggs84 Рік тому

      Nice video 😊
      I would alter the data/media volumes to docker folder defaults to /usr/src/paperless

    • @Techdox
      @Techdox  Рік тому

      @@MartinHiggs84 Yeah, I was debating if it would be worth changing them from volumes to binds in that folder

    • @MartinHiggs84
      @MartinHiggs84 Рік тому

      @Techdox that makes sense - I was thinking just need one backup set for/docker then everything covered as expand

  • @Schniebel89
    @Schniebel89 Рік тому

    11:00 docker compose up -d
    no configuration file provided: not found

    • @Techdox
      @Techdox  Рік тому

      Make sure to run the command where your compose file is

  • @fbifido2
    @fbifido2 Рік тому

    what to do if you need to upgrade all the components, db/broker/tika, etc...?

  • @Fayaz-Rehman
    @Fayaz-Rehman Рік тому

    Interesting - Do you have a yaml manifest for paperless to deploy on kubernetes ?

    • @Techdox
      @Techdox  Рік тому

      I don't but you could chuck the compose into ChatGPT to convert it haha

  • @Tedpikel
    @Tedpikel Рік тому

    Nice tutorial. Does it grab text from images/scanned documents? If not is there another program that can do this?

    • @Techdox
      @Techdox  Рік тому

      Yeah. Should do that just fine :)

  • @nunes_a813
    @nunes_a813 Рік тому +1

    where are all the documents being stored / saved at? docker/paperless?

    • @Techdox
      @Techdox  Рік тому +1

      The files are stored in a docker volume called media, the full path will be something like.
      /var/lib/docker/volumes/paperless_media/_data/documents
      You need root permission to access these volumes though

  • @GOURAB_DAS_12
    @GOURAB_DAS_12 11 місяців тому

    Hey!! I have installed the paperless-ngx on ubuntu server using docker, and now I wanted to mount the points between the docker container(paperless-ngx) & my TrueNAS server, as all the documents are organized in my TrueNAS Server. Is it possible to do ? Pls reply if possible 🙏🙏

    • @Techdox
      @Techdox  11 місяців тому +1

      I’m assuming you have a network share on your NAS? You can setup docker network volumes, if your network share is already mounted on your host then you can point docker to that directory as if it was any other directory

  • @ndchartier
    @ndchartier Рік тому

    Now that this works, can you access it outside your local network trough a Cloudflare tunnel?

    • @Techdox
      @Techdox  11 місяців тому

      You sure can

  • @gismoscherer
    @gismoscherer 8 місяців тому

    perfect! thx a lot
    works right away..

  • @KadargoGaming
    @KadargoGaming Рік тому

    Can you edit or create new ones on it?

  • @staticalmo
    @staticalmo 10 місяців тому

    did you ever see SMEs using this?

  • @icojones
    @icojones Рік тому

    Hi, it's a great guide which worked flawlessly until trying to create the super user. When I enter the command docker-compose run --rm webserver createsuperuser, I get:
    -bash: docker-compose: command not found. I can't work out what's causing it and how to resolve it. Can you help please?

    • @Techdox
      @Techdox  Рік тому +1

      Are you on Ubuntu, try install docker-compose. 'sudo apt install docker-compose' also make sure you are still in the same directory as the compose file :)

    • @icojones
      @icojones Рік тому

      @@Techdox you're a star! Thank you for such a quick response...it must be breakfast time there! ;-)

    • @Techdox
      @Techdox  Рік тому

      It is indeed! Coffee time :)

  • @DariusOosthuizenSuper
    @DariusOosthuizenSuper Рік тому

    I'm struggling to get Paperless-NGX to run, in the logs it says: "FATAL: password authentication failed for user "paperless"" - If you can, please give me some advice.

    • @Techdox
      @Techdox  Рік тому

      Did you create the user after deploying Paperless? :)

    • @DariusOosthuizenSuper
      @DariusOosthuizenSuper Рік тому

      @@Techdox I pulled the images with Compose and then tried to create the superuser with the command in the Docker-Compose file, but before it runs it gives me this error. So , the error comes on before I need to create the superuser... Thanks for replying...

    • @DariusOosthuizenSuper
      @DariusOosthuizenSuper Рік тому

      @@Techdox It seems to be working now after I changed the permissions for the db volume. But thanks for your help!

    • @Techdox
      @Techdox  Рік тому

      @@DariusOosthuizenSuper great to hear!

  • @stan8926
    @stan8926 Рік тому

    docker compose up -d or docker-compose up -d?

  • @quadcom
    @quadcom Рік тому

    was it just me or did you miss defining some persistent volumes in the YAML to store the imported docs? If that's the case that would mean all the imports would live in the container and when you update the images all the imports would be deleted!

    • @Techdox
      @Techdox  Рік тому

      It's using volumes, so the data is stored amoung 4 volumes which you can see at the bottom of the compose. data, media, pgdata and redisdata

  • @orlandofury
    @orlandofury Рік тому +1

    Hey very use full any Idea of how to use the Rest API to have it as a document repository for an app or multiple apps? A link or video will be helpfull.

    • @Techdox
      @Techdox  Рік тому

      That’s a great idea, thanks for the suggestion. Their APIs look very straight forward

  • @justinbailey6515
    @justinbailey6515 Рік тому

    This seems more convoluted than using the ocr scanning functions of google drive. Why not just use that?

    • @Techdox
      @Techdox  Рік тому +1

      Well you could but that defeats the entire point of the video haha. This is as an alternative to something like Google Drive where you are in control and selfhosting

  • @AnnoyingErrors41
    @AnnoyingErrors41 Рік тому

    Quick warning (LMK if this was because something that I did wrong..) if this is your first time installing it, the CMD prints will be having seizures. Not to worry tho, if you look closely, it is just trying to download all the dependent files.

  • @fa8ster
    @fa8ster Рік тому

    Any ideas on auto-tagging, categorization and document clustering powered by AI?

    • @Techdox
      @Techdox  Рік тому

      I myself am curious on how that works. They mentioned it was a thing, so I’m keen to understand how it works

  • @sebastiankastner7514
    @sebastiankastner7514 Рік тому

    What about persistance? If the container dies, your database dies too, doesn't it?

    • @Techdox
      @Techdox  Рік тому

      Two containers, but even if they do die all the data is stored on volumes that live on the host machine

  • @YeahNoTellTheTruth
    @YeahNoTellTheTruth Рік тому

    Can you import a current document archive or is it single document at a time? Might be painful for some who have a folder structure already in place.
    Or can you keep a folder structure and still use this? I mean I tend to have other files like images in the folders which is relevant to the document and separating them means you're making your actions longer to find relevant documents.

    • @Techdox
      @Techdox  Рік тому +1

      I would say you could point paperless to your current location of files and it should import them all

  • @wilko1251
    @wilko1251 Рік тому

    Which terminal do you use?

    • @Techdox
      @Techdox  Рік тому +1

      iterm2 using a Dracula theme with the fig plugin :)

  • @matuopm
    @matuopm Рік тому +2

    AI treatment of documents would be great to autopick the categories and fill out the date and so on

    • @Techdox
      @Techdox  Рік тому

      That would be awesome, upload it and it just manages it all for you. Surely it could be implemented without too much hassle if you leveraged an exisiting AI like GPT

    • @Belgabor
      @Belgabor Рік тому +1

      It says on the front page "Uses machine-learning to automatically add tags, correspondents and document types to your documents.", so it looks as if it's already there

    • @YeahNoTellTheTruth
      @YeahNoTellTheTruth Рік тому

      Not everything needs to be coined AI, in this case it's just a matter of auto filling titles and dates, whether that's dates in files or document created dates, or file names. Artificial intelligence is not that.

  • @OverdriveCz
    @OverdriveCz Рік тому

    Thats very interesting akternative of zipping and syncing with onedrive or cloud or anything the paperless is way more easy and practical

  • @PWingert1966
    @PWingert1966 Рік тому

    This might b a nice addition to my home server running on an HP DL60 9!

  • @James7104
    @James7104 Рік тому

    What is the benefit of this software compared to Evernote?

    • @Techdox
      @Techdox  Рік тому +1

      They are pretty different right? Evernote is a hosted service you use and has paid features and is a note taking app. Paperless is a document management service with OCR that you selfhost

    • @James7104
      @James7104 Рік тому

      @@Techdox I understand but even the free Evernote version has similarities. All documents uploaded are searchable, even text in images. That is why I like it so much. This solution seems more complicated 🤔

    • @Techdox
      @Techdox  Рік тому +1

      @@James7104 the big difference here is looking at self hosted alternatives, rather than trusting other companies with data. For instance I wouldn’t be comfortable hosting medical data here to store long term compared to a self hosted solution that I control. It’s like a lot of what I cover in this channel, Nextcloud could be Google or Dropbox etc it’s all about alternatives :)

    • @James7104
      @James7104 Рік тому

      Thanks for pointing that out. @@Techdox

  • @Gery-zg7ky
    @Gery-zg7ky 10 місяців тому

    thx bro

  • @VisionaryAudioandVideo
    @VisionaryAudioandVideo Рік тому

    Mac OS does this already. Just scan give it time to index and you’re done.

    • @Techdox
      @Techdox  Рік тому

      Yeah, like I have mentioned to others. This is focusing on a self hosted service that can be used regardless of device you use. More of a central repository for your documentation then on an individuals device

    • @VisionaryAudioandVideo
      @VisionaryAudioandVideo Рік тому

      My bad, I have a file system that suits me and I didn’t listen to what you were saying.

  • @cmohanc
    @cmohanc Рік тому

    Docker being a paid service, the software is not free. Is there any other method?

    • @Techdox
      @Techdox  Рік тому +1

      Docker is most definitely still free, the business editions are not though but you don’t need that

  • @Humuku
    @Humuku Рік тому

    Or I just use macOS spotlight which also indexes content.

    • @Techdox
      @Techdox  Рік тому

      For sure, it’s nice to have dedicated services for this though so you can access it and get the benefits from any device :)

  • @keesverhagen9227
    @keesverhagen9227 10 місяців тому

    It is a g❤reat video for those without knowledge about installing paperless in docker on a Nas.
    But the video has a big but. Someone without knowledge can absolutely not follow it. Within one minute someone is lost. Because many times you talk so fast and not really clear.
    Please delete this video and create a new one. We really need a version for Dummies. I have watched this video 6 times now and to no use at all. I know this is also me because i have no knowledge at all. But many times i replayed certain parts trying to understand what you are saying. But with the best intentions i can not. And i so wish that i could learn to do all of this.
    If someone knows another good educational video for dummies you would make me very happy. Or something on the Web ?

    • @Techdox
      @Techdox  10 місяців тому

      Yeah, I’m definitely not deleting my best performing video with over a 100k views haha. I can understand if you are getting stuck because the concept of docker and docker compose maybe new to you.
      Check out this video I made around getting started with Selfhosting ua-cam.com/video/OfGmfZmpVsk/v-deo.htmlsi=Vnte0b_Qof4No4Yj

    • @keesverhagen9227
      @keesverhagen9227 10 місяців тому

      @@Techdox So many times i have watched it. 😇
      I will watch the one from your links. Thanks.

    • @Techdox
      @Techdox  10 місяців тому

      Where are you getting stuck?

  • @Filip-r2o
    @Filip-r2o Рік тому

    @8:20 🤮

    • @Techdox
      @Techdox  Рік тому

      The environment file is really only being used for a small amount of variables unless we were making it public, you could remove it if you wanted and bring timezone etc over

    • @Filip-r2o
      @Filip-r2o Рік тому

      In general, the amount of syntax in those files is disgusting....there should be some abstraction made, there is no way that in future by adding more complexity into the these types of files would be able to manage, it will be just added on top and new frameworks and it will be big mess@@Techdox

    • @Techdox
      @Techdox  Рік тому

      @@Filip-r2o If you think this is bad, you should see Kubernetes manifest files haha. It may not seem like it be there is some structure and logic to this :D

  • @tmdx6345
    @tmdx6345 Рік тому +4

    Your like my brother and other techies. You know your stuff but don’t speak in human terms. So this is running in tor Nas / Docker dang

    • @Techdox
      @Techdox  Рік тому

      No Tor or NAS haha, just docker. The thing is if docker itself is new to you that’s why I link the get started video.
      Explaining the basics in every video would make them an hour each

    • @AndrewTa530
      @AndrewTa530 9 місяців тому

      Docker or K8’s?

  • @paultech9385
    @paultech9385 Рік тому +19

    Dude, slow down, annunciate and speak clearly. You have good info to share, that’s why I’m watching it, but the staccato delivered makes my head go dizzy. 🤪

    • @Techdox
      @Techdox  Рік тому +5

      Haha! I hear you, I’ll make sure to keep this in mind for future video, the kiwi accent won’t help either 😂

    • @smplchmp
      @smplchmp Рік тому +18

      no issues here, sounds normal to me

    • @hallowatcher
      @hallowatcher Рік тому +7

      Normal here as well, great content man!

    • @kalebburris6425
      @kalebburris6425 Рік тому +7

      Yeah no problem. Man's got an accent not poor communication.

    • @followthemoonrabbit
      @followthemoonrabbit Рік тому +5

      Could follow you at 1.25x with no problem. And English is my second language. It's just kiwi :-) Please don't get discouraged, you're helping regional dialect awareness :-)

  • @andreaswiehl8482
    @andreaswiehl8482 Рік тому

    i´ve just tried to install with portainer. i put the code from "docker-compose.postgres-tika.yml" in a stack and adjust only my port and a few volumes. I get an error. after that i put the code 1:1 in a stack and deploy. also there i got an error . what i am doing wrong ?

    • @Techdox
      @Techdox  Рік тому

      Hey, what’s the error?

    • @andreaswiehl8482
      @andreaswiehl8482 Рік тому

      @@Techdox this error message showed up " Deployment Error => failed to deploy stack: open /data/compose/16/docker-compose.env: no such file or directory"

    • @quadcom
      @quadcom Рік тому

      did you import the env file inside the portainer UI at the bottom of the page?

    • @andreaswiehl8482
      @andreaswiehl8482 Рік тому

      No I don't because I thougt there is everything commented out and the gid and uid are not neccesarry.
      You mean put the Code from both direct in the table of portainer stack and then deploy? I will test this and Feedback here. Thank you both for your Support!

    • @andreaswiehl8482
      @andreaswiehl8482 Рік тому

      Now i`ve got PL running, i`ve just took the "docker-compose.portainer.yml" from github, that was a lot easier for me ;). but now i have the next issue i have to dela with ... the loggin steup with the CLI. I ve running docker on a omv media server in portainer, i have nothing to do yet in CLI`s. I can acces via putty on my WIN pc and login in my user on my server but thats it. I have noch clue how to get to the path in the video "~docker/paperless" to set the needed command line to create user

  • @stan8926
    @stan8926 Рік тому

    docker compose up -d or docker-compose up -d?

    • @Techdox
      @Techdox  Рік тому

      They are the same, but docker compose is the new one and I would suggest using that one

    • @stan8926
      @stan8926 Рік тому

      @@Techdox It did not work for me so I had to use docker-compose.