Hi Everyone,
As I understand you want to export 1.2 million files from one folder one by one outside Alfresco, then delete 1.2 millions source documents in this folder and then import it back.
If you get nodes one by one from a large folder do not export it, just move these nodes into propper new folders structure. I did it for the folders with more then 2 millions nodes by JavaScrip API. It was not efficient. Maybe the REST API will make it better. In the moving case you will do only one operation with each node in that large folder.
Answer to your question. Deletion in Alfresco has about 5 steps. The last step does not even resolved: alfresco never delete bin files from the file system, just move it in the contentstore.deleted. Read more about Alfresco deletion process in the article Understand the Lifecycle of Alfresco Nodes. But again, I do not think you should delete documents from the repository at all.
Some other good reference:
https://blyx.com/2014/08/18/understanding-alfresco-content-deletion/
There is also this utility project i used in some occasion, you have to fix the code for version 23 of alfresco slightly, but it is still good as a principle line.
https://github.com/keensoft/alfresco-deleted-content-store-cleaner
Or better yet from personal experience you can create a java code to split the contents of the folder into N folders each containing at most1000 nodes in this way with lucene queries you can retrieve data browse documents without particular problems
Ask for and offer help to other Alfresco Content Services Users and members of the Alfresco team.
Related links:
By using this site, you are agreeing to allow us to collect and use cookies as outlined in Alfresco’s Cookie Statement and Terms of Use (and you have a legitimate interest in Alfresco and our products, authorizing us to contact you in such methods). If you are not ok with these terms, please do not use this website.