实现从vector中过滤重复的数据

0、前言

相信有很有种情况下面要在vector下过滤掉重复的数据就比如在数组中需要过滤重复数据一样重要一般常用的方法，好像还是在学校中教的，进行匹配一遍，然后再进行插入既然有了STL容器，那么我们可以完全抛弃上面提到的一般方法，把效率提高至少1倍

1、思路

map也是我们常用的一种容器，他主要的保存方式是，也就是通过key来进行索引，而且这个key是不重复的，那么在我们过滤的前提下，map就是成败的关键了，看看如何把它用好

2、容易陷入的错误

a、一想到容器，很容易想到使用迭代器(iterator)，但是对于map，加入数据量大的时候，进行iterator的操作很慢很慢！

b、容易想到，先把数据都插入到map中，再最后直接在写入vector中，其实这样又回到了a中

3、流程分析

假设我们要存放N多的数据(假设是海量的)，那么我们需要避免对map进行iterator的操作(很慢很慢！)

于是在数据插入的时候，我先去从map中find一下，加入map中并不存在这个key，那么我们放心的插入到map中，同时插入到vector中。总之，每一次想到要插入vector我都想到map.find、map.insert，那么最终的vector就是不存在重复的，也同时避免了在map中使用迭代器(iterator)

4、一个简单的实例(class)

play_list.h：

#include "iostream"
#include "string"
#include "vector"
#include "unordered_map"
using namespace std;

class PlayList
{
public:
	PlayList();
	~PlayList();

	bool ReadFile(const string &path);
	bool WriteFile(const string &path);

	void AddPlayVideoPath(string play_video_path);

private:
	bool ReadFile(const string &path,  vector<string> &vec);
	bool WriteFile(const string &path, vector<string> &vec);

private:
	vector<string> all_video_path_vec_;
	unordered_map<string, string> all_video_path_map_;
};

play_list.cpp：

#include "stdafx.h"

#include "fstream"
#include "play_list.h"

PlayList::PlayList()
{

}

PlayList::~PlayList()
{

}

bool PlayList::ReadFile(const string &path)
{
	return ReadFile(path, all_video_path_vec_);
}

bool PlayList::WriteFile(const string &path)
{
	return WriteFile(path, all_video_path_vec_);
}

bool PlayList::ReadFile(const string &path, vector<string> &vec)
{
	ifstream fin;
	fin.open(path);
	if (!fin.is_open())
	{
		return false;
	}

	// 取出数据
	string tmp;
	while (fin >> tmp)
	{
		cout << tmp << endl;
//		vec.push_back(tmp);
		AddPlayVideoPath(tmp);
	}

	fin.close();

	return true;
}

bool PlayList::WriteFile(const string &path, vector<string> &vec)
{
	ofstream fout;
	fout.open(path);

	vector<string>::iterator it;
	for (it = vec.begin(); it != vec.end(); it++)
	{
		fout << *it << endl;
	}
	fout.close();

	return true;
}

void PlayList::AddPlayVideoPath(string play_video_path)
{
	unordered_map<string, string>::iterator it;
	it = all_video_path_map_.find(play_video_path);
	if (it != all_video_path_map_.end())
	{
		return;
	}

	all_video_path_map_.insert(pair<string, string>(play_video_path, ""));
	all_video_path_vec_.push_back(play_video_path);
}

5、整个实例代码下载

下载传送门：http://download.csdn.net/detail/zengraoli/7868149

时间： 2024-07-30 04:38:30

实现从vector中过滤重复的数据

0、前言

1、思路

2、容易陷入的错误

3、流程分析

4、一个简单的实例(class)

5、整个实例代码下载

实现从vector中过滤重复的数据的相关文章

通过双重for循环来找到JSON中不重复的数据

使用aggregate在MongoDB中查找重复的数据记录

python删除列表中得重复得数据

删除vector中的重复元素

C++中删除重复的数据并且输出(相当与shell脚本里面的sort -u)

mysql 中删除重复字段数据的方式

从数据表中筛选重复的数据

手把手教你如何用java8新特性将List中按指定属性排序，过滤重复数据

MySQL 处理重复数据：防止表中出现重复数据、统计、过滤、删除重复数据