Are New Year’s resolutions still a thing after 2020? Given the way most of ours were blown out of the water in March of this past year, we’re not sure. At the least though, we learned that no matter our good intentions, the unexpected can still have its way with us. Thankfully we also learned new ways to plan and prepare (and we don’t mean buying 20 packs of toilet paper) to ensure that the unexpected isn’t quite as unpleasant as it might have been.
With this post, we want to help ensure that data loss is one challenge you can take OFF your list of potential unpleasantness in 2021. By preparing for accidental deletions and computer crashes with a computer backup or cloud storage plan, you can shelve at least one uncertainty for the rest of 2021 and beyond.
Best Practices for Starting Your Backup Plan
With the holiday season (and the sales that come with it) coming to an end, you may have updated to a new computer or need to set up a computer for one of your family members. If so, you may have heard about the importance of backup and want to know how to set it up yourself. First thing to know: It’s super easy!
To back up pictures and other files on your computer using a cloud backup system, you simply need to choose a service and install the software on your computer or laptop. Depending on what you choose, you may need to go through all of your files and folders and select what you’d like to protect. We’re partial to our backup service, however, which backs up everything on your machine for you. You don’t need to worry about anything getting missed. You won’t notice the Backblaze backup client is there, but it will store a backup of everything on your computer, and whenever you modify a file or add something, it will back that up, too. Other than ensuring your credit card is up to date and that you connect to the internet long enough for it to upload data, you don’t need to do anything else to keep the service rolling.
For many of us, accomplishing this first step is good enough to keep us feeling safe and sound for a long time. But if you’ve been reading about ransomware attacks, had a friend lose data, or you’ve ever lost data yourself, there are six more easy steps you can take to ensure MAXIMUM peace of mind going forward.
Top Six Things to Keep in Mind When Monitoring Your Backup and Cloud Storage Strategy in 2021
1. Lay Out Your Strategy.
When you’re just starting out, or even later on in your computer backup journey, it’s a good idea to have a basic backup strategy. Here are three questions to help you establish one:
What data needs to be backed up?
“Everything” might be your answer, but it’s a little more complex than that. Do you want to preserve every version of every file? Do you have external hard drives with data on them? Do you want to back up your social profiles or other data that doesn’t live on your machine? Make sure you’re truly considering everything.
How often should it be backed up?
Important files should be backed up at minimum once a week, preferably once every 24 hours. If your data changes less frequently, then scheduling a periodic backup might be better for you. If you have older hard drives you don’t use often, you might want to simply archive your backup for them, rather than needing to plug them in whenever you get close to the edge of your version history.
How should I continue to monitor my backup?
It can be devastating to find out that your data backup has been failing at the time when you may have lost your data. If your backup job has been running quietly for months, it is a good idea to check and make sure it’s doing its job. Testing the restore feature on your backup gives you the ability to check that all the data you deem important is going to still be there when you need it most.
2. Keep Data Security in Mind.
At the end of 2019, we shared six New Year’s resolutions to help protect your data, but we realize that some of your New Year’s resolutions may have been deferred. So here’s a little reminder that data security is always important! We’ll keep it simple: If you take one security step in 2021, make it to set two-factor authentication on all of your accounts.
Two-factor authentication notifies you whenever someone tries to log in to your account and will not give them access until you enter the second identification code. You can choose from many different delivery options to receive the code, like an SMS text, voicemail, or using an application like Google Authenticator (we recommend the latter as it’s the most secure).
Either way, two-factor authentication means that not only will hackers have to steal your credentials and password, they’ll also have to get access to one of your personal devices. Needless to say, this will greatly decrease the chances that your data will be compromised.
3. Know Where Your Data Lives.
Over the years, our data often becomes “scattered.” Bits and pieces of our data are strewn from place to place as we create new data on different platforms and services. Between new and old computers, multiple hard drives, sync services like Google Drive, all of your social profiles, and all the others, it’s easy to lose track of where your most important data is when you need it. Especially because many of these locations will not be covered by standard backup services.
Mapping out where your data lives will help you to track what’s being stored off of your computer (like on a hard drive or USB), what’s being synced to the cloud, and what data is being backed up.
Once you have an idea of where your data is, your backup strategy comes into play. If there are important files that are being synced or that live on a hard drive, you may want to think about moving those files to a device that is being backed up or to an archive. Once you do, you’ll never have to worry about them again!
4. Consider Which Retention Span Fits Best for You.
Backup retention—also known as data retention—is how long you would like your data to be archived. At Backblaze, you have three options for your data retention: 30 days (the default), 1 Year, or Forever Version History. Picking between the three can feel tricky but it really just depends on your needs. If you have a college student away at school for a year and want to make sure their data is retrievable in case of emergency (like a coffee spill on their computer in the library), then yearly may be the best option for you. If you are a writer who constantly needs to look back on past versions of material you have written, then forever version history may be the best option for you.
Any retention plan should work just fine as long as you are monitoring your backup and understand what data is still being retained.
5. Testing Restores
There’s an old saying that “Data is only as good as your last backup, and your backup is only as good as your ability to restore it.” When data loss occurs, the first question that comes to mind is, “Who is responsible for restoring those backups?” and the answer is simple: you are!
Think of testing your restore as a fire drill. When you go through the steps to restore your data you want to make sure that you know what the steps are, what files are backed up when you want to recover them, and what options you have for restoring your data. When testing out your restore, this may clue you in on potential holes in your backup that you can fix before it’s too late.
6. Archive Your Data
Backups are great for things you are actively using on your computer, but when you’re done with a project or your computer starts underperforming due to the amount of data on it, you may want to think about archiving that data. In cloud storage and backup, an “archive” is a place to keep data for long term storage. This ensures your computer can run its best with some freed up storage space.
Archives can be used for space management on your computer and long term retention. The original data may (or may not be) deleted after the archive copy is made and stored—it’s up to you! You can always store another copy on a hard drive if you want to be extra careful.
With our Backblaze B2 Cloud Storage product, you can create an archive of your data in various different ways. You can experiment with setting up your own archive by creating a B2 Cloud Storage Bucket within your Backblaze Computer Backup account. It’s easy (we even outlined a step by step process on how to do it), and more importantly, free: Your first 10GB of data stored are on us!
These are some of the recommendations we have for utilizing your computer backup and cloud storage account. If you could just try one, three, or more, then you are starting 2021 out right!
Writing a “year in review” for 2020 feels more than a little challenging. After all, it’s the first year in memory that became its own descriptor: The phrase “because 2020” has become the lead in or blanket explanation for just about any news story we never could have predicted at the beginning of this year.
And yet, looking forward to 2021, I can’t help but feel hopeful when I think about what we did with these hard times. Families rediscovered ways to stay connected and celebrate, neighbors and communities strengthened their bonds and their empathy for one another, and all sorts of businesses and organizations reached well beyond any idea of normal operations to provide services and support despite wild headwinds. Healthcare professionals, grocery stores, poll workers, restaurants, teachers—the creativity and resilience shown in all they’ve accomplished in a matter of months is humbling. If we can do all of this and more in a year of unprecedented challenges, imagine what we can do when we’re no longer held back by a global pandemic?
Looking closer to home, at the Backblaze community—some 190 employees, as well as their families and pets, and our hundreds of thousands of customers and partners around the world—I’m similarly hopeful. In the grand scheme of the pandemic, we were lucky. Most of our work, our services, and our customers’ work, can be accomplished remotely. And yet, I can’t help but be inspired by the stories from this year.
There were Andrew Davis and Alex Acosta, two-thirds of the IT operations team at Gladstone Institutes—a leader in biomedical research that rapidly shifted many of its labs’ focus this year to studying the virus that causes COVID-19. After realizing their data was vulnerable, these two worked with our team to move petabytes of data off of tape and into the cloud, protecting all of it from ransomware and data loss.
And then there were Cédric Pierre-Louis, Director of Programming for the African Fiction Channels at THEMA, and Gareth Howells, Director of Out Point Media, who worked with our friends at iconik to make collaboration and storytelling easier across the African Fiction Channels at THEMA—a Canal+ Group company that has more than 180 television channels in its portfolio. The creative collaboration that goes into TV might not rival the life-saving potential of Gladstone’s work, but I think everyone needed to escape through the power of media at some point this year.
And if you had told me on March 7th—the day after we made the decision to shift Backblaze to mostly 100% work from home status until the COVID-19 situation resolved—that the majority of our team would work for 10 more months (and counting) from our kitchens and attics and garages…and that we’d still launch the Backblaze S3 Compatible APIs, clear an exabyte of data under management, enable Cloud to Cloud Migration, and announce so many other solutions and partnerships, I’m not sure which part would have been harder to believe. But during a year when cloud storage and computer backup became increasingly important for businesses and individuals, I’m truly proud of the way our team stepped up to support and serve our customers.
These are just a sampling of the hopeful stories from our year. There’s no question that there are still challenges in our future, but tallying what we’ve been able to achieve while our Wi-Fi cut in and out, our pets and children rampaged through the house, while we swapped hard drives while masked and six feet distant from our coworkers, there’s little question in my mind that we can meet them. Until then, thanks for your good work, your business, and sticking with us, together, while apart.
Top 10 lists! You know them. You read them! You love them? As 2020 comes to an end and we look longingly at the new year ahead of us, I wanted to take a moment and look back at what you, our blog readers, have found amusing, entertaining, and informative over this past year.
To do that, we looked at our analytics and picked out the top 10 most-viewed stories that we published in 2020. The results may not shock you, but they may entertain you, especially if you missed any of these the first time around. Without further ado, let’s jump into the results!
Top 10 Backblaze Blog Posts From 2020
1. 2019 Hard Drive Stats. It’s not surprising to see a year-end hard drive stats post in the first position. Readers show up for these posts in a big way and this one took a look at the entirety of 2019 as a year-end wrap up.2. The Complete Guide to Ransomware. With huge organizations like Foxconn, Kmart, many K-12 school districts, and hospitals being targeted by ransomware in recent years—and those attacks increasing—it’s no wonder that people are seeking to understand how to protect themselves.
3. & 4. Q1 2020 Hard Drive Stats and Q2 2020 Hard Drive Stats. The quarterly drive stats set the stage for our popular yearly reviews and provide a “heartbeat” of how our spinning disks are doing throughout the year.5. A Beginner’s Guide to External Hard Drives. We took a look at some best practices for folks looking to increase their on-site storage capacity and how to make sure all that data is safe, as well. It would appear a lot of readers were onboarding new hard drives in 2020.6. Synology Backup Guide. Other readers already have a series of external hard drives connected to their PC, meaning the natural progression is getting a NAS system like Synology in place and making sure that it, too, is backed up.7. Q3 2020 Hard Drive Stats. Looking at how the stats are progressing, we find that even when some drive models have over 4,029 failures, their annualized failure rate can be below 3%—that’s scale!
8. Backing Up Google Drive. Far be it from us to claim that we saw the future, but when we published this post in June it was a touch ahead of its time. A few months later, Google announced the end of their unlimited storage plan and as people looked for alternatives, this resource on downloading and backing up Google Drive information became invaluable.9. Backblaze S3 Compatible APIs. One of our biggest Backblaze B2 Cloud Storage releases for 2020 was the Backblaze S3 Compatible APIs suite. This launch cleared the way for a ton of new partner integrations, use cases, and happy cloud storage customers.10. Cloud Sync Is Not Backing Up. A common misconception is that someone is backed up if they only use iCloud, Google Drive, or Dropbox. Nothing could be further from the truth. In this post, we dig into the differences between cloud backup and cloud sync, why they’re both useful, and how to leverage both for maximum efficiency.
The “Up-and-coming” Top Ten
Looking at the top 10 list for 2020, we see a lot of series and subjects that are popular every year. This got us thinking, what about the stories that broke new ground? Posts that aren’t hard drive stats and yet still drew an admirable number of readers? When we removed the big hitters we found an alternative top ten that will appeal to anyone looking for some more in-depth solutions, some nice news, and answers to a few evergreen questions!
1. What Is an Exabyte? What the heck is an exabyte anyway? We take a look at how much data that really is, and how it compares, on a cosmic level, to a gigabyte.2. Object vs. File vs. Block—A Cloud Storage Guide. The word “cloud” can sometimes feel amorphous. For readers just starting to look cloudwards, this post aims to help put a finer point on clouds! We take a look at the different types of cloud storage and how to most effectively use each.3. Duplicati + Backblaze. We love when B2 Cloud Storage gets integrated into popular apps, and Duplicati makes backing up data securely and easily from pretty much any system a piece of cake. No wonder it pairs so well with Backblaze B2!4. Metadata: Your File’s Hidden DNA. Metadata surrounds pretty much every digital thing we do on a day to day basis, but a lot of people don’t fully understand what it is or how it works. This post defines metadata and looks at how it helps programs keep track of the information about files for both humans and computers.
5. Free Cloud Storage? What’s the Catch? There are a lot of “free” offers in the cloud storage marketplace positioned to help entrepreneurs get their application or website off the ground. In this post we go into some of the pitfalls that might come about when you take cloud storage providers up on an offer that might seem too good to be true.6. Computer Backup Version 7.0.1. We took some time at the beginning of the year to make some adjustments to our cloud backup software, improving performance and enhancing our Inherit Backup State feature to help folks avoid reuploading data if they switch computers!7. Exabyte Unlocked. In March, Backblaze crossed a data storage threshold that few other companies have achieved, storing over an exabyte of data for our customers, and we couldn’t be prouder.
8. How to Wipe a Mac Hard Drive. As people get new computers and sell off their old hardware, sometimes they want to make sure that all of their data has been deleted from their computer (just make sure you have a backup first).9. Upgrading to an SSD. Once readers finish wiping their old drives, they often want something a bit more speedy. SSDs are dropping in price and getting more common, so this post gives you a few things to consider when upgrading.10. RAM vs. Storage. This post takes a look at one of the most commonly asked questions when people talk about gigabytes—“Do they mean RAM, or do they mean storage size?”—and what’s the difference between the two anyway?
We love writing about the ins and outs of our industry, infrastructure, and the business in general, so it’s always fun to look back at what resonated with you over the past year. Was your favorite blog post not listed? Let us know in the comments below what resonated with you this year!
Ever wonder if your feedback is heard when you tell a company why you are cancelling your subscription? Well, at Backblaze, customer feedback isn’t just heard—it’s read, considered, and used to improve the product over time.
Most companies seek to understand the reasons customers leave by setting up a formulated poll with a multiple choice style list of common reasons for why you may be leaving. We decided to manage this process a little differently by giving customers who decide they no longer want to use Backblaze Computer Backup an open forum.
This format allows people to be specific about their reasoning, and in some cases to vent about their frustrations. By sifting through these responses and grouping them under common causes, we gain insights into the customer experience that allow us to create a better product.
When customers choose to cancel our service, we send this email:
Over time, the responses to these messages have helped us enhance our Computer Backup product and add new features to it that we knew customers would like thanks to this process. Because our approach is somewhat unique, we wanted to illuminate it for you, both to be transparent and also for anyone that might find our model useful.
What Is Churn Analysis, and Why Is It Important?
When a customer leaves a service or cancels an account, it’s called “churn.” Churn can be calculated as the percentage of customers that stopped using your company’s product or service during a certain time frame. The churn rate calculation for subscription or service-based products is an excellent metric to gauge their performance.
As much as you wish it wouldn’t happen when running a business, customer churn is a real thing and important to keep an eye on. You may already know about some issues your service has that need to be addressed, but by tracking churn over time you can also identify new issues or discover that issues outside of your scope are more important than you thought. When these issues turn out to be easily fixable, they provide a direct path to decreasing churn and often also attract new business. This is churn analysis: identifying the reasons people are leaving and prioritizing their resolution.
The Nuts and Bolts of Churn Analysis at Backblaze
Every month, 10% of the customers that churn actually offer substantive responses for their departure. On the 10th day of each month, one hearty staffer sifts through all of the messages that we receive and adds them to a large spreadsheet. Unsurprisingly, every month, the reasons people cite for leaving are relatively similar, so she’s able to group the messages into 10–15 different categories. These categories range across different feature requests that we are tracking, like issues with our safety freeze feature, as well as trends with different accounts, like their desire for two-factor verification set up, and various other reasons.
When different reasons begin to gain or lose ground, it’s a sign that we need to do something. Depending on the reason, it might mean that we need to write a more informative FAQ, or that we need to work with Marketing to highlight a feature better, or that we need to notify engineers that there is something that needs to be fixed or built.
So Why Do People Churn From Backblaze?
To illustrate how we go from churn analysis to product development, we gathered the five top reasons customers churned from Backblaze, and what we’ve decided to do about it (or not).
Reason #1: “I No Longer Need My Data Backed Up”
Customers use Backblaze Computer Backup for various reasons. Some of them have long term needs, like wanting to protect the files on their home computer. Others may be thought of as temporary, like backing up freelance businesses or college projects. The former tend to stick around, while there’s not much we can do to convince the latter that they might want to rethink their approach.
As a result, “I don’t need it anymore” is one reason that’s always on our list. But that’s not to say we’re not doing anything about it. If you read this blog, you know that we’ll take any opportunity to remind people that there are more reasons for long term backups than most folks assume.
Financial documents, legal correspondence, essential application settings, system information, and all of the important data you’ve forgotten you have on your machine until it crashes are great reasons to second guess a spotty back up strategy. If you have a computer, you should have a backup in place to protect yourself from accidental or incidental data loss. In fact, we recommend a 3-2-1 backup strategy to ensure that you’re always covered.
Resolution #1: No specific response in product development, but a rigorous marketing campaign to argue against the premise of their departure.
Reason #2: “30 Day Deletion”
All Backblaze Computer Backup accounts have 30 Day Version History included with their backup license. That means you can go back in time for 30 days and retrieve older versions of your files or even files that you’ve deleted. For years, we had customers respond that they would continue to use Backblaze if we retained their files a little bit longer than 30 days.
We took that feedback and created the ability to keep updated, changed, and even deleted files in their backups for a longer period of time by extending Version History for the computers backing up in their accounts. We chose to build this feature because the engineering investment was easily offset by the number of customers we could retain and/or gain by offering some customized approaches to data retention.
Since 2013, customers who told us that they were cancelling due to our Version History being set to only 30 days hovered around 5.91% out of the total responses to reasons for leaving. Since we made a change in 2019, and started educating people that the feature exists, we’ve now seen a large number of people enabling Extended Version History. Reports of customers leaving for Version History reasons is now down to 3.37% for 2020 and is dropping quickly.
You can now increase your peace of mind by enabling Yearly or Forever Version History on your account—all thanks to the customers who wrote in and told us why it was important to them.
Resolution #2: Build a new feature set to answer a reasonable request with a reasonable offering.
Reason #3: “Leaving For a Sync Service”
There’s unfortunately still some confusion between backup (which Backblaze provides) and sync and share services, like Dropbox and iCloud.
So what’s the difference? We wrote a blog post to explain it, but to summarize: Sync services will synchronize folders on your computer or mobile device to folders on other machines, allowing users to access the same file, folder, or directory across different devices. This is great for collaboration and reducing the amount of data you’re holding on any number of devices. But it’s completely different from a backup. In a sync service, only the files, folders, or directories you add to the service are synced, leaving the rest of the data on your computer completely unprotected.
Backblaze’s cloud backup automatically backs up all user data with little or no setup, and no need for the dragging and dropping of files. If your friends tell you they are using a sync service to back up their personal data, let them know they may need a backup service as well—before they learn that lesson the hard way.
Resolution #3: Similar to resolution #1, the response to confusion about what different services do is: Education. Tens of thousands of folks have already read our post about the difference between sync and backup, so hopefully we see this reason decrease over time.
Reason #4: Too Expensive
We’ve all been there. We look in our bank account and realize we accidentally signed up for a few too many monthly services and we need to cut back to pay the essential bills. At Backblaze, we realize that times can get tough and occasionally you will need to cut back on expenses.
Keeping this in mind, we strive to be the most affordable unlimited online backup service for our customers. Over the course of 10+ years since Backblaze started backing up customer computers, we have only raised our prices once, by $1 (and wrote about how hard it was to do even that).
When deciding which monthly service to keep, we hope you consider the value of keeping all your files safe and protected and the cost of losing precious memories or important documents.
Resolution #4: Sometimes your product may be too expensive for people’s budget and they will leave. All you can do is work to be as affordable as possible and stress the value of your service.
Reason #5: Switched to Backblaze B2 Cloud Storage
“Hey Backblaze, we love your product but we are leaving to use B2 Cloud Storage!” Some Computer Backup customers occasionally write in with this response and we get a good chuckle from it… because B2 Cloud Storage is also a product of Backblaze. Backblaze B2 Cloud Storage was created to be a simple and flexible cloud storage platform and, with the help of integration partners, it can be a very nifty backup solution for more tech-savvy users!
We actually love when this reason pops up! It lets us know that people are moving on to the product that’s right for them. Backblaze B2 was created as a result of customers writing in and saying “I love your backup service, but I need a place to just store the data on my server or NAS device. Can you give me direct access to your cloud storage? Is that possible?” So we created a product that could do just that.
If you have been backing up your computer for a while, you may be curious about cloud storage or have heard about cloud storage and thought it might be too technical for you—don’t worry, we have all been there. We put together a quick starter guide that highlights how simple Backblaze B2 can be.
Resolution #5: When the customer starts to outgrow your starter product, guide them to the product that fits them best.
What Churn Responses Look Like Over the Years for Computer Backup
About 10% of our customers that leave respond to our “how can we do better” email after cancelling their accounts. This number tends to be pretty constant, but when it rises above that range it usually indicates that something unique happened that month.
An uptick in churn isn’t always a bad thing. We saw a rise in responses when we announced our first European data center because customers were switching their accounts to the EU region. It was a good sign that people were excited about the availability of different regions for storing their data.
Giving the option for customers to share personal responses also notifies us when a new issue arrives. This can help us identify and fix bugs in our system that might only be caught in very specific situations that may not be seen by our engineers in our initial testing.
They can also clue us in on world events. We started to see high trends of customers reporting COVID-19 related reasons for cancelling their accounts back in January 2020. This helped us assess in a timely manner how we could support our customers during a worldwide pandemic.
The following graph shows you how a few different reasons for leaving have changed over the past few years:
All Feedback Is Good Feedback
You may find it a bit crazy but there really is a person at the other end of your responses—reading your feedback and sharing it with the rest of the gang at Backblaze. That feedback has provided us useful updates, new features, and peace of mind knowing that our customers feel heard.
So, we want to say thank you to all the previous customers that took the time to write out why they were breaking up with Backblaze. Without that feedback, we wouldn’t be the company we are today.
To this day we are still updating our products to meet our customers’ needs and we love to hear what our customers hope to see as our next feature. Do you have a feature request? Share it in the comments below!
The new operating system from Apple, MacOS 11.0 Big Sur (a reference to a lovely stretch of California’s central coast, if you’ve never been) releases on November 12, 2020. We’ve been preparing for this release for quite some time and are glad to report that the Backblaze Computer Backup client is “Big Sur-ready.” As always, a couple of notes to keep in mind before updating to a new OS: make sure you have a good and up-to-date backup in place, and that you are running the latest Backblaze version.
If you’re already running the Big Sur beta, or are planning to upgrade to the latest MacOS version on day one, please make sure you are running the latest Backblaze client version. You can download and install the latest build by doing the following:
Perform a Check for Updates (right-click on the Backblaze icon in the Mac menu bar)
Or download the latest from www.backblaze.com/update.htm
Note, if You Are on MacOS 10.9 Mavericks or older: And you want to update your client before upgrading your macOS, you will need to download the latest version of Backblaze here.
Recently, there’s been speculation about the U.S. banning TikTok due to privacy concerns. As fellow TikTok enthusiasts, my friends and I quickly took to the app to download all our favorite content. Today, the addictive social media app is still available in the U.S., but concerns remain that it may not be for long.
For many people, TikTok has been a great source of entertainment during lockdown. If you’re like my friends and I, you probably want to save some of the dances, recipes, skits, and some of the other totally indescribable things you’ve encountered there.
We don’t know what will happen in the future, but at Backblaze, we are all about saving precious memories. And social media is one place where things you want to hold on to can suddenly disappear or become inaccessible for reasons beyond your control. In fact, we’ve gathered a handful of guides to help you protect content across a number of different platforms.
But today, our focus is the 15 to 60 second clips you know and love on TikTok. So here are some ways to be prepared in case you can no longer access the app in the future.
How to Download Your Personal TikTok Data
You can request a copy of your TikTok data and download information like your profile (username, profile photo, profile description, and contact info), your activity (videos, comment history, chat history, purchase history, likes, and favorites), and your app settings (privacy settings, notification settings, and language settings). The steps to download your TikTok data are the same for both iPhones and Androids.
1. Open TikTok on your phone and go to your profile.2. Click on the three dots that appear at the top right corner.3. Under “Account,” select “Privacy and safety.”4. Click on “Personalize and data” → “Download your data.”
5. In “Download your data” you will see more information about what you can download. Scroll to the bottom and click “Request data file.”6. In the second tab titled “Download data,” you will see that your request is pending.
7. Once your data is ready for download, you will receive a message in your TikTok inbox that says “System Notifications: The file you’ve requested containing your Tiktok information is now ready for you to download.” Tap that message and select “Download.”8. When the file is downloaded, you can find all your comments, direct messages, activity, and more. To save your TikToks, click on the “Videos” folder → “Videos.txt” file.9. Warning: You are not done yet! The file you’ve received has information about your TikToks like the date you published them, the video link, and the number of likes you got. But it doesn’t include the actual video itself. To archive the video, you need to copy and paste the video link into your web browser, then download the TikTok to your device. Yes, it will take some time to download all your videos, but if they’re worth it, they’re worth the time!
Keep in mind that these are the steps to download the TikToks that you have personally created and uploaded to your account. If you’d like to save TikToks made by other people, keep reading.
How to Download TikToks by Other Creators
The process of downloading other peoples’ TikToks is a little more manual, but unlike requesting your TikTok data like above, there’s no waiting time. Here’s what you’ve got to do:
1. Open TikTok on your phone and go to the video you want to save.2. On the right side of the video, click on the arrow which indicates the “Send to” button.3. Under “Share to,” click “Save video.”
4. That’s it—the video is now saved to your phone!
Note: Some people may have set their videos to be non-downloadable. They probably have a good reason for that! It should go without saying, if you’re downloading other people’s content, don’t use it for any purposes they might not offer consent for.
How to Back Up Your TikToks
Once you’ve got all your TikTok data on your phone, it’s time to back it up. Those of you with iCloud may think you’re in the clear. Unfortunately, iCloud is not a backup service; it simply syncs your data with your other Apple devices. This means that if your Mac and iPhone are synced and you lose the saved TikToks on your iPhone, you will lose them on your Mac too. You can read more about using iCloud here.
Since iCloud shouldn’t be used as a backup service, we recommend you use a computer backup or cloud storage service instead. To do this, you first need to transfer your TikToks from your phone to your computer. And then, it’s time to back it up!
Lucky for you, we already have a detailed blog post about backing up your social media content. The post covers the difference between computer backup vs. cloud storage and how you can use Backblaze B2 Cloud Storage to archive your social media data. With Backblaze, you can store as much data as you’d like with no limitations. So whether you’re an avid TikToker with thousands of videos or just getting started on the social media platform, we’ve got you covered.
In the beginning there was the World Wide Web and, for us common folk, it was used to send electronic mail and instant messages. Then the internet became a place where the average user could share their voice, videos, and pretty much everything else. But how permanent are these things we share? When it comes to the memories we want to hold on to, will they always be there?
We’ve all lived through our own different phases of the internet age. There was the AIM phase, Napster phase, Wikipedia phase, Skype phase, and of course the boom of social media with Twitter, Facebook, Instagram, and more. Some of these websites and apps are still here, some look a little different, and some are not around anymore. (Like Vines, boy do we miss Vines!)
In 2019, it was reported that internet users spend an average of two hours and 22 minutes per day on social networking. If we are spending even a fraction of that time each day creating content to be shared with family and loved ones, don’t we want to make sure we have those creations forever?
We think so! And so we’ve developed a series of posts to help you retrieve your data from social media profiles, ranging from Facebook to Tiktok, and other services where the long term reliability of or your data might be in question. In this post we will go more in depth about best practices of how to back up this data once you’ve downloaded it.
Review: Retrieving Your Data
If you’re like most people, you probably have your data spread out across multiple platforms. Depending on where you like, share, and post, there are various ways to download your data to keep a copy of it on your computer. But how do you figure out how to do this for each platform? We’re glad you asked! Here’s our list of guides you can consult right now. We’ll work to grow this list over time, but don’t hesitate to reach out if you’d like to see different platforms covered.
Facebook: When your uncle saves the family’s treasured reunion photos only on Facebook, it’s time to consult this guide.
Google Drive: You know that college paper is going to be Pulitzer-worthy someday—make sure you have it backed up!
Due to the vast variety of options available on the internet, we may have missed a few you want to know about. While there’s not one solution for every platform, there are some typical steps that could help you with a service we haven’t covered yet:
Some websites and apps have an area in your account settings or privacy settings where you can request your data, like Twitter, which has built this feature into their user account section. If functionality like that isn’t immediately apparent, your next best option is to search the support FAQs to find the process for user data requests. Some platforms do not have this feature available at all yet, so you should be careful to understand the guidelines for retrieving data at any company before you start storing your photos, audio files, and more there.
Once you’ve downloaded your data successfully, the next challenge is safeguarding it for the future.
Now That It’s on My Computer, What Should I Do Next?
Downloading the internet memories you’d like to keep is step one. If you’re reading this, you probably already use Backblaze Computer Backup to safeguard the data on your PC or Mac. (If not, make sure your computer is backed up, preferably with a 3-2-1 backup strategy.) But just because you back up your data, that doesn’t mean you want to keep archival memories on the computer you use every day.
Depending on the size of the data you downloaded, you may now have a far larger quantity of files on your computer than you’d prefer. Those YouTube videos you made with your friends back in 2008 might be old, but they ain’t small. Your computer may be thinking the same thing. Even if you choose to store the memories on an external hard drive, remembering to plug in and back up multiple drives can be hard over the long term.
Backups are great for things you are actively using on your computer, but when you’re done with a project or want to store a memory safely, you may want to think about archiving that data. In cloud storage and backup, an “archive” is a place to keep data for long term storage. Most importantly for this post, an archive helps to protect data you want to retain, but don’t need regularly, while ensuring your computer can run its best with some freed up storage space.
Archives can be used for space management on your computer and long term retention. The original data may (or may not be) deleted after the archive copy is made and stored—it’s up to you! You can always store another copy on a hard drive if you want to be extra careful. This is the difference between computer backup and cloud storage. In both cases, data is stored in the cloud, but in backup, the data in the cloud is a copy of the data on your computer. In cloud storage, it’s just saved data—there’s no mirroring or versioning.
Our Backblaze B2 Cloud Storage product allows you to create an archive of your data in various different ways. You can experiment with setting up your own archive by creating a B2 Cloud Storage Bucket within your Backblaze Computer Backup account. It’s easy, and more importantly, free: your first 10GB of data stored are on us!
Creating a B2 Archive
For this example, I downloaded data from my personal blog, hosted on WordPress. My blog has various types of files (photos, videos, text, audio) so it’s a good example of the diverse set of files that are good candidates for storing in the cloud.
After downloading my data from WordPress and creating a new folder on my desktop filled with the files I want to archive, the next step is to sign into my Backblaze account. After signing in, I navigate to the left sidebar and select “Buckets” under the section “B2 Cloud Storage.”
On the B2 Cloud Storage Buckets page I select “Create a Bucket.” You can think of buckets as a folders feature when storing data in B2 Cloud Storage. There is no limit to the number of files you can keep in a bucket, but there is a limit of 100 buckets per account.
When I select “Create a Bucket” a pop-up appears, guiding me to create a unique bucket name and decide whether the bucket will be “private” or “public.” Setting the bucket to “private” means that every download requires an authorization token. Setting it to “public” means that everybody in my group (if your account is a group) is allowed to download the files in the bucket.
When I create a bucket, I get to pick the name. The name must be unique—never been used before by you or by anybody else. In other words, a bucket’s name is globally unique.
For my example, I named my bucket “WordpressNicolePerry” and set the bucket to private. Once the bucket is created you can start uploading files and folders.
When I click the button “Upload,” a pop-up appears, prompting me to drag and drop files or folders I want to upload to that bucket. And then, bazinga! Your files are now uploaded to the cloud!
Wow! Cloud Storage Is Easier Than I Expected
If you have been backing up your computer for a while, you may be curious about cloud storage or have heard about cloud storage and thought it was too technical for you—don’t worry, we have all been there. But, the internet and social media seemed hard at first and now look at where we are at! Play around with buckets in B2 Cloud Storage. If you feel like they’re the right spot to keep your memories, you can learn more about pricing and other functionality here.
At the end of the day, when it comes to making sure my long lost Vines, Facebook photos, and Google data are somewhere safe without gunking up my computer’s memory, I’ve found that the few bucks a month I put toward B2 Cloud Storage seem like a small price compared to juggling hard drives and other archiving practices.
Creating content for social media, whether for a business or personally, is an ever changing process as new platforms appear. So, keeping that data in an easily accessible place where I can download it and upload it to a new platform is worth the cost for me. But that’s one solution coming from this social media guru. How have you kept up with the times? We would love to hear your solutions in the comments below.
We recently released an update for Backblaze Computer Backup: version 7.0.2! This release comes with improvements to our Safety Freeze feature, and some enhancements to the Mac and Windows applications. Enjoy!
What’s New for Windows & Macintosh:
Improvement: Safety Freeze
Safety Freezes exist to protect your data from corruption, but lately, they’ve been a touch over-cautious. This improvement updates Safety Freeze to reduce the amount of false-positives users experience.
Bug fix: Multiple hard drives listed
Some users experienced duplicate volume listings in the application, which led to confusion. This release addresses that issue.
Minor improvements to logging.
What’s New for Macintosh:
Bug fix: Location Services
This feature has been reworked to reduce the amount of pop-ups received when Locate My Computer is enabled on more recent macOS versions. This also fixed an issue where disabling Locate My Computer on the web would still result in a pop-up asking for Location Services permission.
Release Version Number: Mac — 18.104.22.1684 PC — 22.214.171.1243
Cost: Free for Backblaze Computer Backup consumer and business customers and active trial users.
Immediately when performing a “Check for Updates” (right-click on the Backblaze icon and then select “Check for Updates”).
With so many services out there that offer businesses a way to store and protect files online, they might all seem like the same service. When considering backup and sync strategies, owners often ask, “Can’t we just store all our files on Google Drive or Dropbox and call it a day?” The short answer is no, not if you want to properly protect your business from data loss.
While cloud-based sync services may seem to operate with backup-like functionality, they will not protect you from total data loss. For Pierre Chamberland—founder of NetGovern, an informational governance solution—making this distinction between sync and backup was a vital realization for his company’s information security.
Before rolling out a cloud backup solution for his business, Chamberland designated Microsoft OneDrive as the central source for storing his team’s files and projects. This served as an excellent tool for collaboration and quick and easy access to files. But when Chamberland suspected that not everyone was keeping copies of their data in OneDrive, he decided to conduct an audit. He found that only 20% of his staff had properly backed up their work.
“In the event of a catastrophe, we could lose hours to potentially weeks of work,” Chamberland explained. He needed a way to safely protect all of the company data, which he was able to do by rolling out a proper cloud backup.
Chamberland’s story ends well, but plenty of business owners only learn the difference between backup and sync services in the most painful circumstances: after data loss. This post aims to provide information to help you understand how to best use sync, backup, and cloud storage services together to ensure that your business’s data is stored both securely and in the most optimal way for productivity.
What’s the Difference Between Cloud Sync, Cloud Backup, and Cloud Storage?
It’s helpful to understand how cloud sync, cloud backup, and cloud storage services differ from each other, and how they complement one another. Each performs a unique, helpful service, but learning the differences will help you more effectively put them to work for your specific use case.
You’re probably familiar with services like OneDrive, Dropbox Business, or Google Drive. These services sync (short for “synchronize”) files or folders on your computer to your other devices running the same application, ensuring that the same and most up-to-date information is merged across each device.
Sync services allow multiple users across multiple devices to access the same file, making it incredibly useful for collaboration and for sharing information with others. But because these services are designed for syncing, if your coworker deletes a shared file, that change will be reflected across all devices, and you may lose access to that file forever. Though most sync services offer a limited way to restore changed or deleted file versions, they aren’t true backups and remain susceptible to major data loss.
A cloud backup tool takes all of the data on your computer and stores it safely somewhere remote from your work environment. It works similarly to a traditional backup which would catalog and save all of the files on your computer to an external hard drive or a storage server on your local network. Except, in this case, your data is stored in an off-site server—also known as “the cloud.”
Cloud backups are optimized to allow businesses to easily recover their data in case a computer is lost, stolen, or compromised. Backups offer various options for data recovery allowing users to quickly access files via web and mobile applications or have their data directly shipped to them via a USB hard drive. The point is, cloud backups ensure complete protection from data loss and are meant to help your business recover swiftly.
Cloud storage is what makes cloud sync and cloud backup possible. Cloud storage providers like Backblaze B2 Cloud Storage offer the backend infrastructure for the storage of data, which services like Dropbox or Backblaze Business Backup are built on top of. It is the physical location where backups are stored and syncing occurs.
And yet, while a simple definition of cloud storage is that it is the raw storage that these other services are built on top of, it is also true that you can utilize cloud storage to build a unique service or application.
Most cloud storage providers offer an application programming interface (API) that lets you directly connect to the cloud storage of your choice, giving you the ability to create a service that does exactly what your business needs it to do. Alternatively, you can choose an integration partner that pairs with the cloud storage provider giving you the same direct connection to the cloud without having to do any technical development.
Cloud Sync Is Not the Same as Cloud Backup
Sync services were not built with backup in mind. They often rely on the user having a folder on their computer that is designated for OneDrive, Google Drive, or Dropbox. Users place files into that folder when they want their data to appear on other devices via the sync service.
This is an excellent way to avoid having to email yourself or your team files that need to be shared or worked on together. However, it’s important to remember that files outside of your team’s designated folders, i.e. in Documents, Downloads, Photos, etc., will remain locally stored on your device, and not synced to the cloud.
Just as sync services aren’t the same as cloud backup services, the reverse is also true. Though backup services may allow you various options to remotely access your data and share individual files when you need to, they are not suitable for use as collaboration tools. Instead, cloud backups ensure that all data on one device is backed up safely elsewhere. Instead of having to manually drag and drop files into designated locations, a backup will typically work automatically and in the background of your computer, backing up any new or changed data on the device. In the event of a computer crash, data loss, or ransomware hijack these backed up files will be available for recovery.
When recovering files from a business cloud backup service, it’s important to understand the versioning options they provide. Say you accidentally delete a file, but don’t realize it until a few months later. You may be unable to access the file if versioning limits apply, or you may only have access to the most recent version of the lost file. However, many services now offer features like extended version history, which allows you to recover files from past points in time, so you can easily restore older work.
Here is a table that provides a quick overview and comparison of cloud sync, cloud backup, and cloud storage:
Ensures that the same and most up-to-date information is merged across each device.
All of the data on your computer is stored off-site and in the cloud.
The infrastructure on top of which cloud sync and backup services are built.
Allows multiple users to access the same file, or files, across multiple devices.
Protects and recovers all of the files on your workstation in the event of data loss.
Backs up servers or NAS devices, or allows you to build unique services and applications.
Share and collaborate on work files seamlessly amongst your team.
Reliably protect all of the data on your computer automatically.
Gain more control and functionality beyond what pre-built services offer.
In the event of a major data loss, files that aren’t synced (or are outside of your sync folders) will not be recoverable.
Not great for file sharing and collaborating, and some services may have data and bandwidth caps.
May require additional resources if you plan to build out custom applications and services for your business.
Automatic or Manual?
Manual. Sync services rely on users dropping the files they wish to keep into designated folders on their devices. Files outside of these folders will not be synced to the cloud.
Automatic. With little to no configuration, a backup solution regularly and automatically backs up everything, even your designated sync folders.
Depends how you choose to set it up. Cloud storage providers will often have integration partners that offer the functionality you’re looking for.
Sync solutions may retain older or deleted versions of your files but these options vary from service to service.
May come with features like extended version history which help to recover older files.
Great for long-term data archiving and typically priced based on the amount of data stored.
Should My Business Use Cloud Storage?
It’s easy to understand how sync and backup services can help to foster collaboration and data protection in an enterprise because they deal with something we all do: manipulate, share, and save files and data. The question of whether cloud storage might serve a role in your tech stack is slightly more complex.
While cloud backups like Backblaze Business Backup are great for backing up the data on your Mac and PC laptops and computers—these often are not the only devices storing precious information. Some businesses require additional functionality to back up their on-premises server and NAS devices or create applications with unique functionality that serves their purpose. That’s when utilizing a cloud storage service is particularly useful.
Cloud storage providers supply data storage just as utility companies supply power, gas, and water. Cloud storage can be used for data backups, data archives, application data, media libraries, records, or any other type of data. They typically charge by a combination of data ingress, egress (in other words, the data coming and going), and the amount of data stored.
Backblaze B2 Cloud Storage supports integrations with NAS devices, as well as Windows, Mac, and Linux servers. We provide a complete solution for storing all types of data, in partnership with vendors who integrate various solutions into the Backblaze B2 ecosystem. These integration partners offer both hardware and software solutions that pair with B2 Cloud Storage, giving businesses several options when it comes to data storage and management.
“Our business has a backup strategy in place, so I think we’re done here.”
If only it were that easy. Once your business has a backup plan and has an idea of how to properly utilize sync, backup, and storage, the next step is to routinely check-in and test your backups.
You should test your most important, mission-critical data first, such as tax returns, legal documents, and irreplaceable media. Ensure that the files that are important to you are recoverable and intact by actually trying to recover them.
Don’t wait until disaster strikes to test your restore process and recovery. Seriously. Data loss emergencies are incredibly stressful, and doubly so when you have no idea how to properly find and recover your data. Set a schedule to test your backups and restore processes regularly. If you have more questions about keeping your business data protected, drop a line in the comments below and our team will be happy to help!
Any modern organization should have a backup plan at all times. But as your team grows, finding and implementing a suitable backup strategy can be challenging. As more teams and companies go remote and everyone is dispersed across a range of networks, working on unsanctioned devices, and in various time zones—rolling out a backup solution for your remote team isn’t the only thing on your to-do list. Even small to medium-sized IT teams are put to the test as resources are stretched thin and the challenge of keeping everyone backed up becomes greater.
Understanding the struggles of a strained IT team, Pierre Chamberland, founder & CEO of NetGovern—an information governance software company he founded—made it a top priority to relieve his team from the overhead and burden of managing employee devices. In a recent case study, we took a look at how Chamberland landed on Backblaze as a viable backup solution for his business, how he rolled it out company-wide, and how he and his team continue to practice data backup best practices. Read on for some of the key takeaways from NetGovern’s solution.
How Do You Effectively Back Up a Remote Team?
As a longtime Backblaze Personal Backup customer, Chamberland knew all too well the importance of keeping a proper backup of his data. It all started when he left his laptop on a plane. Sadly, the device was never located, but luckily for him, his data was with Backblaze. He was able to recover all of his files via a USB restore, and his new device was up and running by the time he returned from his trip. “I’ve been convinced of the utility ever since,” Chamberland professed. So when he decided to roll out an innovative device policy at NetGovern, he looked to our Backblaze Business Backup service to harden the plan’s resilience.
Back Up Everything by Default
Chamberland knew that effective protection for his team meant backing up everything by default. This minimizes the risk of losing important data that may otherwise be lost if employees are given the option of selecting what they feel are “critical” backup directories. This approach saves IT teams the hassle and time in making sure employees are properly backing up their data, and a “set it and forget it” client ensures that the least technical person on the team can stay successfully backed up.
In 2018, NetGovern introduced a “Bring Your Own Device,” or BYOD program, where employees choose the device they want within a given budget, and after six months, they own the device. Naturally, employees use this device for both work and personal use, but regardless, Chamberland keeps all of the data backed up. “We made no distinction between personal and business data. Fundamentally, we’re backing up the whole device,” he explained. If employees save locally for whatever reason—ease, habit, slow internet connections—everything on their computer will be recoverable.
Don’t Rely on Sync to Back Up Data
Sync services like Dropbox, iCloud, and Microsoft OneDrive are not true backup solutions. These services sync folders and files across your devices or in the cloud and allow you to access them across each device. These files can be easily shared with others via a unique URL, but changes made to the file will be reflected across all devices. That means if you delete a file from your synced folder, that file will no longer be accessible on your other devices. Sync services also rely on users placing files in designated locations or folders to achieve proper functionality.
Backups, on the other hand, ensure that all of the data on one device has a copy saved elsewhere. By “elsewhere,” we mean the cloud. Backup services typically work automatically and in the background of your computer, backing up new or changed data that is on your computer to another location. In the event of a computer crash or data loss, you’ll be able to recover all of your backed up files. For NetGovern, making the distinction between backups and sync was hugely important.
Before rolling out a full backup solution at NetGovern, Chamberland and his team were using OneDrive, which served as a great tool for collaboration and quick and easy file sharing. Their goal was to use OneDrive as central storage. However, skeptical about how much data was being backed up, Chamberland decided to audit the team. He figured there was work-related content on local devices that was not saved on OneDrive, and he was right. Only 20% of their employees backed up their devices. “In the event of a catastrophe, they could lose hours to potentially weeks of work,” Chamberland contended. They needed a way to safeguard company data without implementing high-touch security protocols.
Test Your Data Recovery and Restores
A backup is great but is only the first step in a complete data backup strategy. Successful data recovery or restoration is the final piece of the puzzle. However, the data recovery process is often overlooked since it isn’t usually an immediate need for most. But as Chamberland can confirm, a data loss emergency is incredibly stressful, and doubly so in a remote scenario. It’s important to set a schedule to test your backups and the restore process regularly. You never know when a hard drive will fail, so it’s best to know the drill before a real-life disaster scenario is underway.
For remote organizations, utilizing a tool like our Backblaze Groups functionality is an easy way to manage your team’s backups, restores, and billing in one place. Groups offers an admin console that allows organizations to employ a low-touch IT approach while still ensuring data security.
To protect their team’s privacy, NetGovern assigned their BYOD devices to an Unmanaged Group where the company only handled billing and payment. Then, they instituted a policy that required employee approval to restore the device. For server devices and shared workstations used by their development operations staff, they continued using a Managed Group to ensure that those key devices could be restored by the business at any time.
Backing Up a Remote Team’s Data Is Simple
Small to medium-sized IT teams don’t have time to troubleshoot or maintain complex solutions. Especially now, as more teams are working remotely, you want a solution that works “out of the box,” requires little to no interfacing with the end user (your employees), and is easy to deploy across your entire organization.
With Backblaze, the click of a button lets you invite the entire team to sign up, install the client, and begin backing up all of your team’s files—all within the same day. IT does not have to manually configure each device, nor be physically present to facilitate the rollout.
For Chamberland, not only is it important to have a backup solution that “just works,” but one that is affordable and scalable as his organization grows. At just $60 per year per device, Chamberland never questioned the decision. “It’s a small price to pay for the peace of mind of our employees. In terms of our HR benefits, it’s a rounding error. It’s way under our coffee budget. I can tell you that,” he remarked. Not only does Backblaze Business Backup allow NetGovern to employ a flexible, forward-thinking device policy that improves IT efficiency, but it also allows them to be certain of how it will affect their budget going forward.
Instilling a Culture of Resilience with Backblaze Business Backup
Read more about how NetGovern implemented Backblaze Business Backup to ensure that essential business data is being backed up and empower their employees with a security mindset.
We all have that one overflowing file cabinet or possibly a closet we’ve been jamming full of files we think may be important to keep, whether because we might need them one day or they include too much personal information.
This year, with the income tax deadline extended to July 15th, I decided to try to sort through all the files I’ve put aside that I felt were important. I keep the current information I need for filing my taxes near me but the older documents I just throw in a box in my basement. With more time at home this year, I’ve realized that a lot has been “saved” over the years. Nonetheless, keeping the old records might come in handy if I need to produce them to file a claim for a tax refund, if someone steals any of my information, or if a creditor or an insurance company asks for specific records from longer than a few years ago.
After going through the process of sorting my old files and documents, I found that other people around me—family members or friends—also have a lot of important documents they want to digitize and back up, and might not know how to start. I want to help make that process a bit easier for other people and provide some peace of mind that all of your important documents stay safe and easy to access for years to come.
It’s important to note that not all of these files may be tax-related. You may be reading this post because you want to jump start documenting your family history or have old schoolwork that you want to save, and you came to this post to find a quick solution on how to save these paper documents on your computer. The information here can relate to many situations, so read on to learn more!
Since 1997, the IRS will accept electronic records as long as they are legible and readable. Having your tax documents in a digital format allows you to get more organized with the way you keep them. When scanning your documents you’ll want to pay attention to what you are naming your files and the state that they are in. Make sure the new digital files are set up in a way that when you search later, you can easily find the information you’re looking for.
Getting the Paper Documents to Your Device
When picking a way to digitize the documents it’s all about what kind of device you feel most comfortable with using. If you don’t feel comfortable doing it at all, you can hire a professional to do it for you. Read on to learn more about both of these options.
This is one of the most common methods of scanning. Whether you have a printer with a scanning function or a device only used for document scanning, this will get your documents on to your computer one scan at a time. There are many different kinds of scanners for different use cases so we recommend comparing reviews of scanners to think about the features that best fit your needs.
Using a desktop scanner will take you a while depending on the size of documents you need to scan but it is a good option for a long term project if you prefer to organize your files on your own.
Third-Party Apps for Your Phone
This option will speed up the scanning process a little more compared to using a scanner. These apps like Evernote Scannable or CamScanner will use your phone’s camera to scan printed documents, receipts, family reunion pictures, birth certificates, and more. Some may even have a function that will analyze the type of document and sort it into a folder for you. That means that all of your photo scans are saved in one folder, while scanned documents go in another. Depending on the third-party app that you chose, it could also have connections to sync services, like Dropbox or Google Drive.
Also, depending on the phone that you have, there may be first party apps available as well, like PhotoScan by Google. If you’re using an Apple device, iOS 11 includes a scanning feature built-in to the Notes app, while iOS 13 supports a scan and sync feature in the Files app.
Document Scanning Services
If you have a very large (closet size) amount of documents to save, then you may not feel comfortable doing it all by yourself. This is when a professional can help you with your project. You can send all your files to a company near you that offers document scanning services. They will work with you to digitize all your important documents and even sort them into folders (and possibly subfolders) to keep your paper documents organized and easy to find on your device. They also give you the option to shred documents you no longer need. This option will off-load the stress that may come with going through your big box of document doom.
One thing to note: These services are great for things like photos, but be aware that you will send them your personal, private, or confidential information, and that they will have access to that data.
Now Your Files Are Digital. What’s Next?
Now that you’ve had your documents digitized on to your computer or a hard drive, it’s important to make sure you protect that data from computer damage (spilled coffee can wreak havoc), viruses, and ransomware by backing up your device.
If you’re using a third-party app to scan and sync your tax documents, you’ll want to be sure you’re also backing them up. Using a sync service, like Google Drive or Dropbox, doesn’t guarantee that your data stays protected. (We go into the details of the differences between sync and backup in this post.) These things may sound very similar but the important difference is that a sync service lets you access the same files across devices, whereas a backup service saves a copy of the most recent version of your data on your computer to another location. More simply: Sync doesn’t protect your data from accidents or disasters.
If you are new to backing up your data, it’s good to make sure you have three copies of your data, the original and at least two backups: one local, on your desktop or on a hard drive, and one in the cloud. Having backups of your newly digitized data ensures that you will always have your important tax information whenever you may need it. We call this the 3-2-1 backup strategy, and you can read more about what it means, here.
It’s important to actively back up your old tax records (or any records) in case you may need to produce them one day. Digitizing and organizing your documents now will help if that situation ever occurs.
Do you have any tips on backing up paper documents that we didn’t mention above? Share them in the comments below!
For the past twelve years we’ve commissioned an annual poll conducted by The Harris Poll asking people the simple question, “How often do you backup all the data on your computer?” and published the results here on the blog. In 2009 we decided to make this an annual event and declared June to be Backup Awareness Month.
Entering this June, we’re curious to see how the changes we’ve seen in the world since the beginning of this year have affected our behavior when it comes to backing up. This year we also asked if people understood the difference between cloud backup and cloud storage—spoiler alert: many don’t. Let’s dig into the numbers!
Are We Backing Up?
There’s good news in this year’s report! Among those who own a computer the percentage who state that they “never” back up all the data on their computer continues to decrease. Even better, the number of people backing up once a year or more frequently is increasing. Even with all that good news though, there’s still work to be done. Roughly one fifth of those who own a computer (19%) say they have “never” backed up all their data. If you add that to those who back up all the data on their computer less than once a year, that number balloons to one in three (33%).
The fact that almost one in five of those who own a computer have never backed up all the data on the computer is still alarming, as they are vulnerable to losing important documents, photos, and other files. We still have work to do to reach all those people to convince them how easy and economical it is to protect their data through regular backups.
But let’s look more closely at the data:
We love seeing that “daily” and “weekly” number increasing. Those are positive trends and more proof that simple backup solutions are causing more people to take action and protect their data.
You can see that the number of people who are backing up frequently has increased substantially over the years. As the “daily,” “weekly,” “monthly,” and “yearly” categories increase, we’d expect to see the “never” category decrease, and that’s a great sign of awareness.
Here’s a detailed look at the numbers from our surveys in 2008 through 2019.
Key Takeaways and Fun with Numbers
Every year after the poll is conducted, we sift through the poll data to see what conclusions we can draw from the results. Our pollster gives us demographics about the subjects surveyed such as the region of the U.S. where they live, level of education, household income, and whether they own a computer or not (kind of important, we think, for this poll). Here’s what stood out:
Almost one in five (19%) of those who own a computer have never backed up all the data on their computer. We’re making some progress, but with almost 50% of people losing data each year, we want to get that number down much further!
10% of those who own a computer say they back up all the data on their computer once a day or more. That’s the highest daily backup percentage we’ve ever recorded.
There’s still a lot of cloud confusion out there with 41% of Americans saying they do not understand the difference between cloud backup and cloud storage. (And for even more nuance: cloud backup vs. cloud sync.) The age group with the highest rate of daily or more backup was the 35-44 year old group at 15%—a mix between Gen X and Millennials. (Who’d of thunk it?)
The Northeast region of the United States has a high rate of daily backup or more with 15% vs. 9% in the Midwest and only 8% in the West.
A few years back, seniors (65+) were the best at backing up, but now as a group they’ve slid back. 30% have never backed up their computer and only 8% back up once a day or more.
It seems the folks in the Midwest who own a computer are the most at risk to lose data, with 26% having never backed up all the data on their computer versus 18% each in the Northeast and West, and 17% in the South.
Want to back up more often? Think outside the box and have children. Those who are not parents of children under 18 are more likely than those who are to have never backed up all the data on their computer (23% vs. 12%). It would seem that backing up is necessary with children running around…
The best way to succeed at a task that’s sometimes neglected is to make it so easy that it gets done. Fortunately, computers are good at automation and backing up can be configured to happen quietly and automatically in the background.
We believe that the reason more people are successful at backing up is that they have discovered automated backup solutions such as Backblaze Personal Backup.
Backblaze Personal Backup can be installed on a Mac or PC and in less than a couple of minutes will be on the job continuously backing up your data. In many situations, the default settings are fine so there’s nothing else to do.
If more people use solutions like Backblaze Personal Backup and automate their backups, the poll results will continue to improve, but more importantly, people will be less likely to lose their valuable photos, messages, financial records, and other important files and documents.
It will be interesting to see whether the poll results next year show even more people backing up. We hope so.
How You Can Help!
One of the things we’re trying to do is educate people on the different types of cloud services and storage options available. The links above are a great way to learn the differences so that you can choose the right solution for you. Those solutions are important considering that almost 20% of people still don’t back up their computers. We need to get that number down as far as we can!
You can also help improve the results for next year’s survey. If you’re already a Backblaze customer, you can let your friends and family know that backing up is important. You can even refer them to Backblaze using our Refer a Friend feature which allows you to invite your friends to an extended free trial of Backblaze Personal Backup. It’s perfect because they get peace of mind knowing that Backblaze is backing up their computers, and you’ll get a free month of service if they sign up with us! If you’re not a Backblaze customer, consider signing up for a free trial, and help us ensure that no one ever loses data again.
• • •
Survey Method: These surveys were conducted online by The Harris Poll on behalf of Backblaze among U.S. adults ages 18+ who own a computer in June 1-3, 2020 (n=1,913), June 6-10, 2019 (n=1,858), June 5-7, 2018 (n=1,871), May 19-23, 2017 (n=1,954), May 13-17, 2016 (n=1,920), May 15-19, 2015 (n=2,009), June 2-4, 2014 (n=1,991), June 13–17, 2013 (n=1,952), May 31–June 4, 2012 (n=2,176), June 28–30, 2011 (n=2,209), June 3–7, 2010 (n=2,051), May 13–14, 2009 (n=2,154), and May 27–29, 2008 (n=2,723). These online surveys were not based on a probability sample and therefore no estimate of theoretical sampling error can be calculated. For complete survey methodology, including weighting variables and subgroup sample sizes, please contact Backblaze.
When I first started using Google Drive I saved everything there. Class projects, presentations for work, notes from meetings, resumes, recipes, and family mailing lists. You name it—all of my files lived in my Google Drive because of how easy it was to access and share them there.
However, the longer I used Google Drive, the more I used it while juggling different accounts (school, personal, and work). So, inevitably, I lost track of where some of my favorite files were located. But then I faced a real challenge: My university announced they would soon be deleting my year’s academic Google Accounts. I realized, as I considered this change, that a lot of important files and emails were on that account that I absolutely needed.
Whether controlled by work, school, or your housemate, Google Accounts are not permanent. Depending on the type of account you have, or who controls it, you may suddenly only have limited access to the account; you might lose your passwords and not have access to the means to reset them; the domain might lapse and get picked up by someone else; or, at the extreme end, your account could be hacked.
So whether you want/need to leave your Google Account for a new service, or you just want to save a copy of all your Google data to a second source, you need to understand how one retrieves and backs up content from a cloud sync service. We’ve outlined some simple steps for you to achieve that, here.
How to Download from Google Drive
Log in to the Google Account you would like to copy your data from.
On average, people have two email accounts, so it is important to make sure you are logged in to the correct Google Account before you start this process. Once signed in, you will want to go to Google Drive itself: drive.google.com. From there, click on the top right corner of the page where your account profile image is located and a drop-down menu (like the one pictured below) will appear.
Select “Manage your Google Account” and you will be led to a new page where you will have four different options to choose from. Select the section labeled “Privacy & personalization.” This is where you will see what data, activity, and preferences your Google Account has associated with it. From here you want to select “Manage your data & personalization” which will bring you to the page where you can download your data.
Once you get to the new page, scroll down to the section labeled “Download or delete your data” and select “download your data.” This will lead you to a new website named Google Takeout. Here, you can export a copy of the content in your Google Account to keep on a local storage source. A reminder before we go forward: this is going to download your data, but it does not delete it from your Google Account.
Select the data you want to download.
On this page, you can select to download an archive of your Google Drive and also your Chrome bookmarks, transactions from various Google services, locations stored in Google Maps, Google Drive contents, and other Google-related products you may use.
When most people think about downloading the data they store in Google Drive, they’re thinking about the documents, photos, and other larger files they work with, but as Google Takeout makes clear: You have a lot more data stored with Google outside of Drive.
Here’s why you might choose to export everything: to have a copy of bookmarked websites, to have a copy of emails that may contain files you’ve lost over time, or to have a copy of important voicemails from loved ones in Google’s Voice product that you want to keep forever. Also, when you download all of your data it is a good reminder of what information Google has on you.
Decide how you would like your files to be delivered.
Once you have decided what parts of your Google data you would like to download, you will have to pick what file type you would like it sent as, the frequency you would like this action to happen (example: if you would like your data to be downloaded every six months this is where you can set that to happen), and the destination you would like your data to be sent to.
When picking a destination for where your data will be sent once you download it, you can choose from having the files emailed to you or sent to a sync service (if you use one) like Dropbox or OneDrive.
Depending on the size of your data, Google may send you multiple emails with different sizes of files. You can choose to have these files sent as a .zip file or a .tgz (tar) file. The main difference between the two options is that a .zip file compresses every file independently in the archive, but a .tgz file compresses the archive as a whole.
What to do once you have your data in your inbox.
An email will appear in a few minutes, hours, or a couple of days (depending on the size of data you are downloading), informing you that your Google data is ready to download. Once you have this email in your inbox, you have a week to download the data. Click the “download your files” button in the email and—presto—you will have a .zip file or a .tgz file (depending on what type of file you picked) on your computer with your Google data.
Backing Up Your Google Drive
You now have your data with all of your important work out of the Google cloud and on to your operating system. What’s next? Protecting your newly downloaded Google data with a good cloud backup strategy should be the next thing you do.
Make sure to have at least two backups: one local, on your desktop or on a hard drive, and one in the cloud. (The word “cloud” may be confusing since you just had your data in a sync cloud service but we’ve found a simple way to define sync vs. backup.) Having two (or three) backups of your newly downloaded data ensures that you will never lose those projects you spent hours working on.
Do you have any techniques on how you download your data from Google Drive or other Google products? Share them in the comments section below!
At the beginning of this month, I received a frantic phone call from a long time friend who teaches ninth grade English. She had just been given the news that she would have to start teaching from home. Her school district gave out Zoom accounts and external hard drives to some of the teachers in order to have them transfer their lesson plans from their school computers to the personal devices they have at home, and sent them on their way.
My friend never had to use an external hard drive before since she saved everything to the computer she used at work or on to a Google Drive account. She was nervous about using it incorrectly, breaking it, or even just finding it on her computer.
This is a reality for thousands of teachers and employees who are being asked to learn new skills at home without the aid of onsite IT help. If you’re one of many folks who are suddenly asking “what is this thing?” and “how will it be helpful to me?” and “I hope I don’t break it”—all while trying to schedule online lesson plans, big meetings, or just trying to continue your connection with your students—you’re not alone! Lots of folks are dealing with this, and we’re here to help with a guide for setting up and protecting your new hard drive.
When you first start using an external hard drive, you might be annoyed by the need to learn something new, or you may simply ignore it. But we love hard drives (obviously) and will include some information below regarding the benefits they can bring to your table: extra space on your computer for new files and applications, portability, and more!
A Guide to Setting Up Your First External Hard Drive
During this COVID-19 pandemic, many of us have found ourselves in situations where we are handed external hard drives to keep our files safe. We hope these tips will help you understand how to best utilize your external hard drive and protect your data.
While it might seem like a no brainer, the first step for setting up your hard drive is to plug it into your computer. An external hard drive typically has one or two cords, usually one for the computer which transfers the data, and another that may also go into your computer or an electric plug to power the hard drive. Small, external, portable hard drives usually need only one cable for both data and power.
Know What’s On Your External Hard Drive
Store only what’s needed. External hard drives are simple: you plug them in, they appear on your computer, and you can simply click and drag your files onto them to copy the files onto the hard drive. But it’s important to monitor what’s on your external hard drive. You can do this by periodically checking your drive to make sure your files are up to date and still needed.
To find where a connected external drive is located on your Mac, try opening Finder. You can do this either by clicking the default Finder icon at the bottom left end of your Dock, or by pressing Command + Space bar, and searching in Finder, or by pressing Shift + Command + C. Once Finder is open, you should see your drives listed either immediately or in the left-hand navigation column under “Locations.” From here, you can click on specific drives to view their contents.
For a Windows computer, you may see variations depending on the version of Windows you are using. In general, you will find your drives listed in File Explorer by clicking on Computer or This PC in the left-hand navigation bar. If you are unsure on how to open File Explorer, try looking for it in your Start Menu. You can also try clicking on your desktop and pressing Windows Key + E together. Once you have located the drives, you should be able to click on specific drives to view their contents.
Another important thing to remember when reviewing the files on your external hard drive is to delete duplicates. Occasionally we will create a copy of a project or create a final edit of a video and have multiple saved versions of the same file. Deleting the duplicates you do not need can help your drive run faster and free up space for more files. You can manually check your files for duplications or use an application that will find and delete duplicate files on your drive.
Learn How to Clean Your Drive
To keep an external hard drive clean you must clean both the hard drive itself as well as the area around the actual computer. Most important is to keep your drive and surrounding areas free of dust. Keeping the airflow in your device free of dust or other debris makes it less likely to overheat. If you’ve already run your hard drive in a dusty environment, compressed air is the best cleaning tool for remedying your situation.
To know where to blow the compressed air you should look for the fan vent, check where the USB ports are, and find other spots on the external hard drive that could collect dust over time.
Finally, it’s important to keep the area around your external hard drive uncluttered to allow for maximum airflow. Be sure to move anything around your drive that may be blocking its airflow like books, papers, etc.
While storing information in the cloud has become second nature to most, there’s still nothing like having everything saved on a physical device. A 3-2-1 backup strategy means having at least three total copies of your data, two of which are located locally but on different types of media (like an external hard drive), and at least one copy that is offsite. So, if you have your files on your computer and your hard drive (which you should store separately from your computer when not in use), you need one other copy stored separately from your house. That’s where the cloud comes in.
There are numerous cloud backup services that will service your computer and your attached drives. We’re partial to our own, of course, and, with Backblaze’s Yearly and Forever Version History features, you can back up your external hard drive easily without having to worry about plugging it in every 30 days.
Keep Your Operating System Up to Date
Your operating system (OS) is the interface of the computer that your external hard drive connects to. We have all hit “remind me later” on an update dialog from our computer at some point in our lives, but updating your OS will ensure that your computer is secure, that your system can run better, and that hard drives are able to properly connect to your files. Updating your OS can vary depending on what kind of computer you have. The best place to look for how to update your OS is in your system’s preferences.
Depending on the age of your computer, however, you should reach out to your local IT person before updating. Some older computers are not able to run, or run very poorly on newer systems.
Prepare for a Drive Failure
Don’t wait until it’s too late. The average hard drive manufacturer’s warranty is only three to five years, and budget hard drives can be even less. This number does not take into consideration physical damage, make or model, or conditions that they are stored in.
When using an external hard drive, you have to prepare for the day that it fails. There are several different ways you can monitor your external hard drive’s health. When it’s near its end, you’ll see or hear the signs like strange clicking or screeching noises, slower performance, and encountering lots of errors when trying to open folders on the drive. You can manually check the status of your drives on your computer.
For a Windows computer, you’ll use a simple command prompt that will tell your computer where to look and what to check. Just right-click the Start menu on your computer, select Run, and type “cmd” or type “cmd” into the search bar. In the Command Prompt window that opens, copy and paste “wmic diskdrive get model,status” without the quotation marks and hit enter. This command will run and it will return “Pred Fail” if your drive is not performing, or “OK” if the drive is performing well.
For a Mac computer, you can monitor the status of your external hard drive by opening Disk Utility by going to Applications and then Utilities. Next, you will click on the drive you would like to test to see how it’s performing. Once you click the drive you would like to check on in the top right corner, click on First Aid. If your drive is performing well, you’ll be able to scroll until you find where it says the volume appears to be OK. If it is not performing well, this process will automatically notify you of any problems like file corruption, an external device not working properly, or that your computer won’t start up. Disk Utility will not detect or repair all problems that a disk may have, but it can give you a general picture.
There are tools or apps you can download to monitor your external hard drive’s health on a Mac using S.M.A.R.T (Self-Monitoring, Analysis, and Reporting Technology) diagnostics. One tool that does a good job is an app called DriveDx, which costs $20 (but you can test it out with a free trial first). DriveDx will help you continuously monitor your drive with a menu bar item that you can pull down and check the status of your drive.
Starting out with an external hard drive is exactly like starting out with any piece of technology you might own. The more you educate yourself on the ins and outs of taking care of it, the better it will run for you, hopefully. But if something bad were to happen, you should always have a backup plan (we suggest Backblaze, but you probably already know that) to protect your new piece of equipment.
Are you a hard drive expert? Are there any tips you would like to share with beginners? Be sure to share them in the comments below.
Remote work, and therefore remote IT management, have become an essential part of the global fight to “flatten the curve” of COVID-19. Thankfully, it appears that widespread social distancing is working to reduce the acceleration of new cases in a number of regions, but it’s clear that the disruption this has caused for businesses is far from over. And for those tasked with IT management during this unpredictable time, their work is more challenging than ever.
With these challenges in mind, we wanted to take a moment to offer our Backblaze Business Backup customers a quick primer to make sure they understand the full range of solutions available to them if they’re experiencing disrupted workflows due to the current pandemic. We hope they help, and if you need any additional guidance, don’t hesitate to ask in the comments, or on our help page.
We understand that deploying new IT systems during this time could be impossible in many scenarios, so this guide begins with a focus on current customers. But if you’re in need of remote backup, restore, or file-sharing services over the coming weeks and months, scroll to the end of this post to learn how seamless and incremental Backblaze Business Backup onboarding can be.
Tips for Remote Backup, Restore, and File Sharing
For those of you that already use Backblaze, here are some tips and tricks to work more efficiently while you’re remote.
Remote File Access
There’s a good chance that a number of employees undergoing mandatory work from home (WFH) arrangements have lost access to files and directories they typically work with on their office devices. With a solution like Backblaze, employees can access their work files from any location, including home. To do so, they merely need to sign in to their account at Backblaze.com, and follow these easy steps.
If for some reason the user is not able to access their account, then an administrator of a managed Group can prepare a restore on behalf of that employee directly within the admin console. The admin can then either notify the employee that their file is ready to download, or download it on the admin computer and email it to them.
Groups-Level File Sharing
Alternatively, if you know exactly what you need to push to your users, Backblaze offers the option of sharing a file directly with multiple recipients without the need to download or have users log in. This can be done directly within the admin console as we outlined here.
Physical Restores for Low-Bandwidth Users
Of course, given that your teams will likely be on a wide array of networks with varying qualities of connectivity, the quantity of data you need to share could saturate a home internet connection if downloaded.
For users in this scenario, Backblaze offers the option of shipping an encrypted restore drive with your data preloaded on it to locations anywhere in the world. Admins can log into their account, prepare the restore drive with the data needed, and ship it to their employees. If the drive is returned after the files are recovered, the price of the restore is refunded, making the process of restoring via USB drive free.
For Users in Need of Remote Backup, Restore, and File Sharing
For businesses with majority onsite teams, it’s tempting to use on-premises backup tools for individual workstations and servers, with backup drives being stored remotely to satisfy a 3-2-1 backup approach. But when teams suddenly have to work off-network for long periods of time, these solutions often no longer cut it. With team members only intermittently logging on to the VPN, or working on their personal machines at home, much of the data created during WFH periods may never hit your server or your backup drives.
If this sounds familiar, we’d urge you to consider using a cloud backup service, if only for the hopefully short duration of time that your team will be out of the office.
Remote Installation of Backblaze Business Backup in Three Steps
If you’re interested in giving it a shot, Backblaze Business Backup can be set up remotely in three easy steps:
1. Administrators email an invitation to employees.2. End users click on the link in the email to install Backblaze and they’ll be backing up in minutes.3. Once the files are backed up, employees’ data is safe regardless of an employee’s physical location, whether they are in the office, working remotely, or even offline.
It really is that easy, and once you’re set up, you can scale up or down your use of Business Backup as you need to for your current business reality. You’re not locked into any level of commitment. If you’d like to learn more, you can get started here.
Staying Together, Apart
These are hard and uncertain times for all of us to navigate, but we hope this information can help those of you out there who are tasked with managing your business’ technical infrastructure find some useful information here. As our CEO, Gleb Budman, noted in his message to customers about our response to COVID-19, it’s all about being “together, apart.”
Has been rewritten to increase the upper thresholds for inheriting a backup state. In the past, some edge-cases existed where log files were too large to be inherited, resulting in a failure. This has now been fixed.
The process has also been cleaned up to remove unnecessary older files, which should result in better performance and less system resource usage.
Fixed a bug which sometimes showed duplicate volume listings in the apps, which led to confusion.
Fixed a bug with .Bzvol which resulted in no files appearing to be selected in some cases.
Minor security enhancements and improvements to logging.
Release Version Number: Mac — 7.0.1 PC — 7.0.1
Availability: April 9th, 2020
Cost: Free for Backblaze Cloud Backup consumer and business customers and active trial users. Upgrade Methods:
Immediately when performing a “Check for Updates” (right-click on the Backblaze icon and then select “Check for Updates”).
It’s a face-off we’re asked about a lot. But from our perspective, the “versus” should really be a “plus,” as the two are complimentary.
Having the right tool for the right job is something any contractor will tell you is imperative, and the same guiding principles apply to computer usage. Sync or backup, which to use? As it happens, using both sync (Dropbox, Google Drive, OneDrive, etc…) and backup (Backblaze Computer Backup) services for your Macs and PCs is now a computing best practice. But there’s still confusion about what these services do, and that leaves some users in a vulnerable state.
We’ve been keeping track of trends and use-cases over the last few years and these misunderstandings about how to leverage the “cloud” for personal use appear to be on the rise. One common quip that often comes up in conversation is “I don’t need a backup, I’m using Dropbox.” Our usual reply is, “Oh, what tier are you paying for?” The response is almost always “No, I just use the free tier.” Which means, while they may not need to back up the data they keep inside their syncing service, the rest of the data on their computer is completely at-risk. And odds are, if you are using the free-tier of a syncing service, you have a lot of data that’s not syncing.
Since we’re in the business of protecting people from data loss, we wanted to offer a little more information about the differences and similarities of Sync and Backup, so that you can make the best, most informed decision about how to adequately protect your data using either or both service types!
What is the Cloud? Sync and Backup
The cloud is still a term that causes a lot of confusion, both about what it is and how services utilize it. Put simply, the cloud is a set of computers that someone else is managing on the customer’s behalf. These computers (typically called servers, or in Backblaze’s case, Storage Pods) typically live in large buildings known as data centers, where they are fed a constant supply of power, are kept in environmentally controlled rooms, and are connected to each other with incredibly fast networking equipment. That networking equipment also connects these data centers to the outside world, where customers can interact with the service providers inside the data centers.
The cloud is perfect for Sync and Backup services, because they both require a lot of space (in the form of servers) to store the data that is being synced or backed up, and a lot of bandwidth (all of that networking equipment) to make sure that data flows to and from the services rapidly. But, while both types of services require similar infrastructure, they are very different in how they function. Let’s take a closer look below!
When considering sync and sharing services like Dropbox, Google Drive, OneDrive, or the slew of other options, people often assume they act as a backup solution as well. The word “cloud” only adds to this confusion, leading people to believe that all “cloud” services are doing the same thing. To help sort this out, we’ll define Sync and Backup below, as they apply to a traditional computer setup—a Mac or PC—with a bunch of apps installed and data on the hard drive.
These services sync (short for “synchronize”) folders on your computer or mobile device to folders on other machines or into the cloud, allowing users to access a file, folder, or directory across different devices. What this means is that you can access a file via a sync service on your computer at home in the morning, make changes, then head to work or a friend’s house and access the same file with all those changes that were made on the other computer. You can also share that file with another user and they can make changes from their computer, which will in turn appear on yours. In either scenario the file is always synced no matter where you access it from. It’s important to note that only the files, folders, or directories you put into the sync service are synced. The rest of the data on the computer is not.
Typically these services have tiered pricing, meaning you pay for the amount of data you store with the service, or for tiers of data that you are allowed to use. If there is data loss (let’s say you share a file with someone and they simply delete it), it may be lost forever. Sometimes these services have a version history feature, meaning you’re able to recover an earlier version of your work (before your friend or coworker deleted it). Of course, only files that are in the synced folders are available to be recovered.
In some cases, relying on a syncing service as a backup can be detrimental. A recent ZDNet article—”Ransomware Victims Thought Their Backups Were Safe, They Were Wrong“—made clear that some people, who thought they were protected by their syncing service, where shocked to discover that the ransomware encrypting their computers also encrypted all of their synced files. With a backup solution (discussed below) with longer version history, these people could’ve simply rolled back to earlier backups, from before the encryptions occurred, and been back up and running with a quick restore. Where sync services ensure that a certain set of data is the same across multiple devices, backup ensures that all or most of the data on one device is backed up elsewhere. In this case “elsewhere” is the cloud.
Backup services typically work automatically and in the background of a person’s computer, backing up new or changed data that is on your computer to another location. For the majority of backup services there is not much configuration involved and there is usually a fixed price (no tiering) for the service. In the event of a computer crash or data loss, all backed up files are available for recovery.
For the most part, backup services catalog and save the most recent version of all data, but many cloud backup services now offer features like extended version history, which helps recover files from past points in time. If you happen to accidentally delete or overwrite files without noticing it, or realize that an earlier version of a file is more useful than the currently saved version, you can recover that older work.
A Note on Backups: Before the cloud became an available and popular destination, the most common way to back up was primarily to a tape, a CD, or an external hard drive. As the cloud became more readily available and affordable, it quickly became the most popular offsite storage medium because it eliminated the need for manual backups by automating the process. Automation makes backing up much easier and more reliable.
Which Backup Service is Right For You?
Backblaze strongly believes in a 3-2-1 Backup Strategy. A 3-2-1 strategy means having at least three total copies of your data, two of which are local (or quickly accessible) but on different mediums (e.g. an external hard drive in addition to your computer’s local drive), and at least one copy offsite. A good way to think about this is a setup where you have data (files) on your computer, a copy of that data on a hard drive that resides somewhere not inside your computer (commonly on your desk), and another copy with a cloud backup provider.
What is the Difference Between Cloud Sync and Backup?
Sometimes it helps to have a real-world example, so let’s take a look at some sync setups that we see fairly frequently.
Example 1. Users have one folder on their computer that is designated for Dropbox, Google Drive, OneDrive, or a similar sync tool. Users save or place data into that folder when they want the data to appear on other devices. Often, they are using the free tier of the syncing and sharing services and only have a few gigabytes of data uploaded in them. This is the most common example that we see and works great for people who simply want to have a little bit of data accessible across many of their devices.
Example 2. Users pay for a higher tier of Dropbox, Google Drive, OneDrive, etc., and essentially use those services as their ‘Documents folder,’ meaning they primarily work out of that one folder. Files in that folder are available across devices, however, files outside of that folder (i.e. living on the computer’s desktop or anywhere else) are not synced or stored by those syncing and sharing services.
What both examples are missing is the backing up of any photos, movies, videos, or anything else among the rest of the data on their computer. That’s where cloud backup providers shine. They automatically back up user data with little or no setup, and no need for the dragging-and-dropping of files.
If Backblaze Computer Backup is added to this example, its application scans the hard drive(s) to find all the user’s data, regardless of where it might be stored. This means that all the user’s data is kept as a backup in the Backblaze cloud, including the data synced by sync services like Dropbox, iCloud Drive, Google Drive, or OneDrive, as long as that data resides on the computer.
Beyond just where and how your data is stored, it’s important to consider how easy it is to get your data back from all of these services. With sync and share services, retrieving a lot of data, especially if you are in a high-data tier, can be cumbersome.
Generally, the sync and share services only allow customers to download files over the internet. If you are trying to download more than a couple gigabytes of data, this process can take time and can be fraught with errors. If the process of downloading from your sync and share service will take three days, one thing to consider is having to keep the computer online the entire time or risk an error if the download were to get interrupted. One thing to be wary of with syncing and sharing services is that if you are sharing your folders or directories with others, if they add or remove files from shared directories, they will also be added or removed from your computer as well.
Cloud backup services enable you to download files over the internet too and can also suffer from long download times. At Backblaze, we never want our customers to feel like we’re holding their data hostage. That is one of the reasons why we have a lot of restore options, including our Restore Return Refund policy, which allows people to restore their data via a USB hard drive and then return that drive to us for a refund. Cloud sync providers typically do not provide this capability.
One popular data recovery use case we’ve seen when a person has a lot of data to restore is for that user to download just the files that are needed immediately, and then order a USB hard drive restore for the remaining files that are not as time sensitive. The user gets all their files back in a few days and their network is spared the download charges.
One of the most important questions for a company that’s scaling quickly is: how do you scale your culture? After the founders, initial hires are easy, but once you’re into triple digits (Backblaze just reached 145), it can be difficult to figure out how to build a community with a shared understanding of your business’s evolution.
One way that we tackle this challenge is by assigning required blog reading to many of our new hires. Along with their different logins, personnel forms, and other onboarding information, new members of the Backblaze family receive a collection of blog posts to read so they can get up to speed on the company.
As the new Publishing Associate, my orientation also featured a lot of reading, and I realized it would be helpful to share my onboarding assignment. After all, there are a lot of new readers coming to the blog these days, and I expect that they might be curious as to what makes Backblaze tick.
So go ahead and read through some, or all of these 10 blog posts, and you’ll receive the same education as every new hire at Backblaze does about what makes our company special.
Transparency is one of our most cherished values—if not the most important value of our business. So we ask anyone new to Backblaze to read this post to learn first-hand from our CEO and co-founder, Gleb Budman, about our guiding philosophy for choosing to publish our designs, statistics, and calculations.
There are always going to be positives and negatives to sharing the inner workings of our business, but for us, building credibility, trust, and awareness has only made our team more excited about transparency. And even though this approach is especially uncommon in the world of cloud storage, we plan to keep open-sourcing our designs and stats because we need the feedback on our products and services to ensure that we continually improve.
This is the second post we typically assign. Where “The Decision on Transparency” is about our values, this post is one of the best examples of our values in action.
When we were first starting out, we couldn’t afford the cost of existing cloud storage solutions at our price point, so we had to build our own multi-petabyte storage system that kept costs low. And yet, many people couldn’t believe that it was possible to provide unlimited data storage for only $5 a month. The only way we could prove our technology worked was to share it with the world. So we did. The results were incredible, but we won’t spoil the surprise.
Another Backblaze value is to be “cleverly unconventional.” And our experience during the Thailand drive shortage is a master class in unconventional thinking and doing.
A 2011 flood in Thailand, where the factories that helped produce nearly half of the world’s hard drives were located at the time, threatened the global supply chain and cut off Backblaze from affordable drive prices. In order to keep the data centers scaling at the right pace, the team came up with a plan to buy up every hard drive they could get their hands on and shuck the external cases to get to the internal drives. Thanks to the help they received from friends, family, and customers, they managed to pull it off. The rest is blog history.
To solve problems and grow as a company, we’ve learned to think creatively to access big picture solutions. And, as we move forward, we’re always thinking about how we can continue to implement our value of transparency. This post is one in a series that we share with new hires who are particularly interested in how we’ve weathered the storm of some universal startup issues. As someone who has been with Backblaze since the beginning, Gleb Budman breaks down a range of common roadblocks and gives his insight to other entrepreneurs looking for advice in this series of nine blog posts.
Curious about how Backblaze makes everything actually work? I was too! When it comes to our commitment to open-sourcing, we didn’t stop at publishing our Storage Pod design; six years later, we also released Backblaze Reed-Solomon, a Java library for erasure coding. This post is essential onboarding for our staff because, until you understand our approach to Reed-Solomon, you can’t understand how our service works. It’s not uncommon for managers to “test” new hires on their understanding of this post, which typically leads to some entertaining whiteboard sessions.
We like to share this post with new hires so they know exactly how our Storage Pod hardware and software architecture come together in the data centers. In this post, we published the final piece of the puzzle of our software architecture: the Backblaze Vaults. If you can understand “Petabytes on A Budget,” the Reed-Solomon post, and “Backblaze Vaults,” well, you might as well send in a resume because you’re ready for your interview at Backblaze!
In 2017, we celebrated the one year anniversary of the launch of Backblaze B2 Cloud Storage, but the most common questions we were still receiving was: “How can you afford to offer such reliable cloud storage at such a low price?” So we decided to craft a post about the underlying costs of running our cloud infrastructure, how our storage system keeps our costs low, and what we do to ensure we have sufficient financial and data storage buffer for unpredictable issues. If you’re interested in knowing where a dollar you give to Backblaze goes, this post will tell you in detail.
No conversation about cloud storage is complete without an argument about durability, and so this post is a must read for anyone new to our business.
Everyone wants to know, and should know, just how safe their data is with Backblaze. So to put a finer point on that knowledge, we shared that the Backblaze Vaults durability can be calculated at 11 nines, and we shared how we calculated that number. But what does 11 nines mean, and why is it important? If you’ve been thinking about how much you miss those super-complex mathematics classes in college or high school, this post is for you.
Backblaze Hard Drive Stats reports are some of the largest data sets on disk drive performance ever to be made available publicly. We also release the raw data that feeds these reports, so that anyone can take a closer look or even recreate the calculations for themselves. We ask new hires to take a look at our stats so they can see the information for themselves, but also so they can see some of the content that has helped us build such an incredible readership. Take a look at one of our most recent reports and let us know what you would do with this information!
Being “fair and good” is another central value for our company. We assign this post to new hires because it shows how we always make our business decisions with our customers’ best interests in mind, even when it’s tough.
Raising prices is never easy, but almost every business is faced with the decision at some point. Since we’ve only raised prices once in Backblaze’s history, we wanted to share the fairly entertaining story of how it happened, and why it’s proven to be a valuable lesson for our team.
Want to Know More About Us?
I hope you’ve enjoyed reading about Backblaze just as much as I did when I first joined the team. For everyone in our company, sharing our journey has been a fundamental aspect of our approach to improving our products and services throughout the years. Want to learn more about us and the unlimited data backup we offer? Take a look at all of our options for getting started with backing up your data.
50,000,000,000—that’s a large number. It also happens to be the milestone that we crossed (on February 5th, 2020 at 14:47 UTC) for files restored from our Computer Backup service! Back in 2016, Backblaze hit 20 Billion files restored for our customers. It took us almost 9 years to get to that number, and only another 4 years to more than double it (and that’s not even including all the Backblaze B2 Cloud Storage files that get accessed and downloaded every day).
50 Billion is a giant number, but it’s not just a number to us. It’s baby pictures, first step videos, PhD theses, long lost tax forms from years past, powerpoint presentations, digitized family albums, art projects, documents and writing, manuscripts, book outlines, and all manner of memories. We love that we’ve built a sustainable business around restoring people’s files which they may have thought were lost forever.
The last time we wrote about a restore milestone we went in and took a look at a typical month in the life of our restore system. Lets revisit that and take a look at the stats for January 2020, with a few new ones thrown in:
January 2020 Stats:
28,841 Total Restores
1,119,500,858 (1.1 Billion) Total Files Restored
2.17 Petabytes of Data Restored
3 Terabytes per hour—equivalent to a good sized external hard drive
48 Gigabytes per minute—about one 4K UHD Blu-Ray movie
810 Megabytes per second—just over one CD’s worth of data
Restores By Operating System:
49.08% were Mac
50.92% were Windows
Of all January 2020 restores:
97.82% were Zip
1.63% were USB HD
0.54% were USB Flash Drive
The Average Amount of Files Per Restore:
29,927 files – Zip
518,756.23 – USB HD
232,711.93 files – USB Flash Drive
The Average Size Of a Restore:
42.16 GB – Zip
2,081.42 GB – USB HD
131.95 GB – USB Flash Drive
Total Data Restored:
Based on ZIP restores:
Range in GB
% of Restores
1 – 10
10 – 25
25 – 50
50 – 75
75 – 100
100 – 200
200 – 300
300 – 400
400 – 500
We started Backblaze with a goal of preventing data loss, and we’re now recovering over 2 Petabytes of data per month, which is a stat that we are, to say the least, very proud of. To put that into perspective, it took us 2 ½ years to reach 2 Petabytes of customer data under management. Now we’re helping our customers restore that amount of data on a monthly basis.
We want to thank our Backblaze customers, and remind folks of how easy it is to restore data with us. You can download it for free via the web, recover your files via a USB Hard Drive or Flash Key, and use our Mobile apps to access your data on iOS and Android! To learn more, visit our restore webpage. If you want to test a restore, try this easy web guide:
Do you have a great story of Backblaze helping you recover data? We’d love to hear it and possibly highlight it in a future blog post. Just comment below with the story of how Backblaze helped you get your data back! Need an example? Here’s a great one.
The files you use every day on your Mac or PC, whether at home or at work, carry around a slew of hidden data that can be incredibly useful to you… or problematically revealing to others. For example, the image in the header reveals latitude and longitude details in an iPhone photo that you could use to organize the photo along with others taken in the same place. But anyone else can access the same data and enter it directly into Google Maps to discover exactly where that picture was taken! Not quite as useful.
But if you know what this hidden information is—and how to use it—it can be incredibly helpful in diagnosing problems with files, organizing or protecting data, and even removing information you don’t want revealed! If you don’t, it can be a huge annoyance, and potentially even dangerous.
“It” is “metadata” and it’s something everyone works with, even if they don’t know it. Whenever you move a file—through email, into or out of a sync or cloud storage service, or to another device—you’re likely altering its metadata. It’s something we work with at Backblaze every day. And because moving files into and out of computer backup and cloud storage services can affect metadata, we thought we’d take a high-level look at how this information works in common file types to help you understand how to optimize its use in your own file management.
You can follow along as we walk through several examples, then tackle some real world file mysteries with the power of metadata. At the end of the post, you will find a list of several tools for Macs, PC’s, and command line to test out and add to your own ‘metadata toolbox.’
What is file metadata?
A great way to think of file metadata is as extra information about a file, carried along with that file, that makes it easier to use and find. So it’s not the actual document or photo itself, it’s information about it—like the file’s name, thumbnail image, or creation date. This information is embedded in or associated with the file, and helps make it easier for you, your applications, and your computer to actually use those files.
Information about a File for Humans
The most obvious kind of metadata is a file’s name, extension, icon, and the timestamp of the its creation date. This simple metadata alone makes searching across an entire hard drive of files and folders as easy as typing a part of the name into the finder or search bar, sorting the results by date, then singling out the file you want by the proper thumbnail or filename.
Information about a File for Computers
A less well-known example of file metadata is meant to make working with files easier or safer for your operating system. Your files might carry notes for the operating system that they should be opened with a specific application. Or a flag might be set on a file you’ve downloaded from the internet or mail attachment warning your OS that it may not be safe to use.
Other critical information about a file is the permissions, or privilege levels, extended to users on that computer:
For example, files on UNIX-like systems, like Linux and macOS X, are marked with the name of the user account that created them (the ‘owner’), the computer account group they belong to, and the permissions for the owner and other users to open and view that file, or make changes to it.
When permissions on files are set correctly, you rarely need to think about them as a user. But if this permissions information changes, users could lose access to files, or files could be opened by users that shouldn’t have access.
Information about a File for Applications
Another category of information is human-readable, but really intended for your applications to use. Some of this information can be incredibly detailed. The best-known example of ‘application metadata’ is camera and location data embedded in images by the cameras when you take pictures, such as the camera information and the camera’s lens and shutter setting when the particular picture was taken.
All this information is read by your image editing software to enable new features. For example, in iPhoto you can search for all images taken in the same location, or find all images shot with the same camera. That means that these files are a trove of interesting information such as the camera type, shutter speed, and even GPS coordinates where the picture was taken.
Information You Won’t Want to Share
You may already know that you do not want to broadcast the location of photos you share, but even plain old documents can have information embedded in them that you’d rather keep to yourself.
In the image above, you’ll see the file metadata of an old word processing document that happily includes names and email addresses for anyone to see! It’s common for files to include information like usernames, email addresses, GPS coordinates, or server mount paths. This is the kind of information you might want to delete before making a file public.
How Metadata Changes as You Move Files from Place to Place
As your files move around—copied from user to user and system to system—all of this useful metadata is vulnerable to being changed or lost. This has implications for your workflow, especially when you inevitably need to reconcile different versions and copies of files.
Unfortunately, the operating-system-specific tags or comments you place on files are the first to be lost when they move from location to location, and system to system.
For example, if I carefully color tag a folder of images on my Mac, then send them to be reviewed by a colleague who works on a PC, all those tags are gone when I get the files back. For this reason, true workflow-specific tags are usually applied in an external system that is dedicated to managing this kind of metadata for files—like a photo manager or a digital asset manager.
File Permissions Can Change from Macs, Windows, and Linux
It’s also common for files received on one OS to come over with non-standard permissions set. For whatever reason, documents saved on a PC end up having the executable bit set when they are moved to a Mac. The files will still open, but there’s no reason for them to be marked like an application.
File Creation and Modification Dates Can Change, Too
When you create or change a file on your computer, the time is recorded as part of the file’s metadata. But what happens when the time on one computer differs from another? Most modern OS’s do a good job of syncing to special time servers, and compensating for universal time based on location, but there are still changes introduced that make sorting files by time a challenge.
Permissions and Timestamps Can Change from Network and Cloud Storage File Metadata and Cloud Servers
When files are copied to network servers, or the cloud, things can get completely changed. Depending on how the file is moved, and how the storage provider handles files, your modification dates could get completely blown away, and since the ‘old’ file you’re uploading is new to the storage system, it becomes a new file with an entirely new creation date.
Individually, these changes are annoying, but collectively they threaten to kill with a thousand cuts. As time stamps, tags, and permissions are changed, your carefully organized file hierarchy or valuable archival information could be in tatters.
A Real World Example of Changing File Metadata
To see how metadata changes, let’s follow a single file downloaded to a Mac, then a PC, then upload and download them to different cloud storage options to see what changes get introduced.
First: A Computer-to-Computer Test
In this test I downloaded a PDF from Backblaze’s website to a Mac. On the Mac, I added color tags, and even comments using the Finder’s preview pane. Next, I downloaded that same file on a Windows system, then copied it over to the Mac.
Despite appearing to be the exact same PDF file, let’s fire up a terminal window on the Mac to inspect them further and make sure.
To follow along, navigate to the folder of files you want to inspect so that it’s handy. Then open another finder window and double click on the ‘Terminal’ application, which is found in the Utilities folder inside of your Applications folder. The terminal application will launch, and you’re placed at the ‘prompt’ ready for your command.
To navigate to the folder you want to work with, type in ‘cd’ at the terminal prompt to change directory, enter a space, then drag the folder of files you want to work with into the terminal window and drop it. You’ll see that the path to the folder is automatically resolved to that folder’s location, saving you a lot of typing.
Now that I’m in the proper folder, the tool I want to use is the humble ‘ls’ command to list a folder’s files. To do so, type in “ls” and then a space, then a dash, immediately followed by “[email protected]”—this will retrieve the long form of results, and the ‘@’ flag will explicitly show extended metadata on the Mac.
As you can already see, the following changes have been introduced:
The Windows file has non-standard permissions (the PDF file is marked as executable as if it were an application, which you can tell by the asterisk marker at the end of the file name, and the permissions sets are all marked with an ‘x,’ indicating that the file is ‘executable’ or treated like an application or command instead of a document.)
The Mac’s Finder shows that the file color tag and comments that I’ve entered are missing in the Windows version.
The Mac has flagged files downloaded on the Mac for its file Quarantine, which is part of the Gatekeeper security feature on mac OS X that marks and prevents potential malware or security risks to your system. This was completely bypassed when copying it over from Windows, so no Quarantine flags were set.
Next Stop, the Cloud
Now, I’ll move these files to and from three different types of cloud storage—Backblaze B2 Cloud Storage, Google Drive, and Dropbox—and see how they change.
To move the files to Backblaze B2, I used rclone, which is an extremely popular tool to copy and sync files from any mix of storage and cloud systems. For Google Drive, I used their web interface, and for Dropbox I uploaded via the web, then retrieved the files as a compressed file.
Now, when I compare all the files side by side I can see how different all of the file metadata is.
First, all of my user-entered metadata, like tags and comments, were not picked up by cloud storage, as expected. Secondly, the Mac’s Gatekeeper security feature also promptly labeled every file downloaded with the ‘Quarantine’ flag. Backblaze B2 returned files with proper file permissions, (644 or read/write for the user, read for the group, and read for all others) and preserves the creation date of the original file.
Both GDrive and Dropbox applied new file creation and file modification timestamps—and bizarrely, the files returned by Dropbox have a “modified date” 8 hours in the future! Does Dropbox know something we don’t?
You can see how searching and sifting through all of these copies on my Mac has become tremendously complicated now.
Solving Metadata Workflow Mysteries and Challenges
Hopefully it’s clear that unless your files only live on your local system, as they move from system to system, the metadata they carry around will change.
Workflow Example 1: Using Metadata Tools to Learn About a ‘Mystery’ File
Let’s apply what we’ve learned in some common examples of how metadata is changed in files, how to inspect them, and some suggestions to correct them.
Inspecting a file’s metadata information can be helpful in diagnosing misnamed files, or files that have lost their file extension. The operating system usually blindly trusts the file extension. For example any file named with a .pdf extension will try to open it as a PDF file even if it’s really something else!
Above, I have a file from a very old backup that is missing an extension. The Mac is having trouble interpreting the way the original Windows OS file system encoded the date, so my Mac thinks the file was created December 31, 1969! (I’m pretty sure I wasn’t using MS Office in 1969.)
Without an extension, my Mac assumes this file must be a text file, and offers to open it in TextEdit, the default app for opening text files. When I double click on the file, the OS tries to open it but throws an error.
Reaching into the toolbox, I use a command-line program called exiftool, a powerful tool to reveal a file’s embedded file metadata. (Navigate to the bottom of the post to read more about exiftool and where you can learn more about how to use it). By calling the exiftool from the terminal application, and passing in the name of the file I want to inspect, all is revealed! This is, in fact, a Microsoft Word file.
Looking closer, I can even see that this isn’t the original file, it was autosaved from the original file, which has an entirely different name. Mystery solved! I can now safely add the ‘.doc’ extension to the file, and it will open properly with my word processor that can still import this version of Microsoft Word.
Workflow Example 2: Uncovering Duplicate Files
Next, let’s take this entire folder of PDF copies that I used for upload tests. After all that uploading and downloading, my single original file has 8 copies. I ‘know’ that I only need one of these, so let’s try de-duping them!
When I try to dedupe this folder using a tool like Gemini, a duplicate file finding tool, I’m presented with several choices of duplicates for me to remove. In other words, Gemini 2 was able to determine that there are duplicates, but isn’t sure which set of files it should keep.
If I select by ‘oldest’ duplicates, it leaves me with the Dropbox versions, by ‘newest’ it leaves me with the GDrive versions, etc. In this particular case, the ‘automatic’ selection tool lets me mark the GDrive and Dropbox versions as the duplicates I will delete. However, the differences in file permissions and extended attributes in Mac’s Finder are preventing these files from being de-duped any further.
I still have two files—the ‘original’ files downloaded to my Mac and PC. Gemini insists they are different files, but we know they are not, so let’s meet some new tools.
Setting Proper Permissions
I could, of course, use Mac’s Finder to reset the permissions of this single file downloaded from Windows. But what if I’m faced with having to reset permissions on thousands of files at once?
To show how you can combine several tools at once, chain the ‘find’ and the ‘chmod’ commands together to first find all documents in my current folder, then change permissions on all of them at once.
Cleaning Mac Extended Attributes
Next, I’ve decided that I want to clear all of the extended attributes that the Mac has set on these files. For this task, I’ll use Apple’s xattr tool.
Now, when I rerun Gemini 2 on this folder, I identify the last duplicate, delete it and I’m back to one file again.
File Metadata Takeaways
As we’ve seen, the metadata carried by the files you use every day changes over the life of the file as it moves from system to system, and server to server. And those changes can be problematic when it comes to the usefulness and security of your data.
You now have the power to see that information, inspect it, and—with the tools listed below—you can change it, solve the mysteries that crop up trying to mediate those changes, and clean up metadata you don’t want made widely known when you share the files.
Do you have more questions about file metadata and how it affects how you use and save your files? Let us know! Meanwhile, the tools listed below are excellent starting points to aid in further exploration.
Addendum: Tools Reference
Here is a list of tools referenced in the article, and other interesting command-line and GUI tools to move, dedupe, and rename files:
exiftool—Hands-down the most widely used metadata exploration tool, which lets you inspect and manipulate standard EXIF and other associated metadata. Latest Windows and macOS downloads are available on the exiftools.org website, via Linux package system, or on a mac with ‘brew install exiftool.’ There are many GUI ports available from the website as well.
rclone—Uses rsync style syntax to copy and sync file locations to and from the widest variety of destinations including almost every known cloud storage choice.
xattr—A macOS system tool to inspect, create, or remove file extended attributes.
ranger—An old school ‘file commander’ that includes an embedded metadata pane. Binaries available, build from source, or on a Mac install with ‘brew install ranger.’
MacPaw Gemini2—Still one of the most widely-used GUI de-dupe tools on the Mac.
fdupes—One of several available command-line de-duping tools.
A Better Finder Rename—A GUI tool to rename batches of files, and even rename according to parent folder structure and EXIF information.
rename—(or ‘brew install rename’) A truly impressive tool to rename entire batches of files with regex, or simple text replacement or addition. Be sure to use the “–dry-run” flag to test what changes it will make first!
The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.